Skip to content

Latest commit

 

History

History
290 lines (222 loc) · 24.3 KB

langchain.md

File metadata and controls

290 lines (222 loc) · 24.3 KB

LangChain Features, Usage, and Comparisons

  • LangChain is a framework for developing applications powered by language models. (1) Be data-aware: connect a language model to other sources of data. (2) Be agentic: Allow a language model to interact with its environment. doc:ref / blog:ref / git GitHub Repo stars

  • It highlights two main value props of the framework:

    1. Components: modular abstractions and implementations for working with language models, with easy-to-use features.
    2. Use-Case Specific Chains: chains of components that assemble in different ways to achieve specific use cases, with customizable interfaces.cite: ref
    • LangChain 0.2: full separation of langchain and langchain-community. ref [May 2024]

    • Towards LangChain 0.1 ref [Dec 2023]

    • Basic LangChain building blocks ref [2023]

      '''
      LLMChain: A LLMChain is the most common type of chain. It consists of a PromptTemplate, a model (either an LLM or a ChatModel), and an optional output parser.
      '''
      chain = prompt | model | parser
    • LLMChain: Deprecated since version 0.1.17: Use RunnableSequence, e.g., prompt | llm instead.

    • LangChain has shifted towards the Runnable interface since version 0.1.17.

    • Imperative (programmatic) approach: The Runnable interface (formerly LLMChain) for flexible, programmatic chain building.

    • Declarative approach: LangChain Expression Language (LCEL) offers a declarative syntax for chain composition, enabling features like async, batch, and streaming operations with the | operator for combining functionalities.

Macro and Micro-orchestration

  • Macro-orchestration in LLM pipelines involves high-level design and management of complex workflows, integrating multiple LLMs and other components.
  • Micro-orchestration x-ref
  • LangGraph in LangChain, and Burr

LangChain Feature Matrix & Cheetsheet

LangChain features and related libraries

LangChain chain type: Chains & Summarizer

  • Chains ref
    • SimpleSequentialChain: A sequence of steps with single input and output. Output of one step is input for the next.
    • SequentialChain: Like SimpleSequentialChain but handles multiple inputs and outputs at each step.
    • MultiPromptChain: Routes inputs to specialized sub-chains based on content. Ideal for different prompts for different tasks.
  • Summarizer
    • stuff: Sends everything at once in LLM. If it's too long, an error will occur.
    • map_reduce: Summarizes by dividing and then summarizing the entire summary.
    • refine: (Summary + Next document) => Summary
    • map_rerank: Ranks by score and summarizes to important points.

LangChain Agent & Memory

LangChain Agent

  1. If you're using a text LLM, first try zero-shot-react-description.
  2. If you're using a Chat Model, try chat-zero-shot-react-description.
  3. If you're using a Chat Model and want to use memory, try conversational-react-description.
  4. self-ask-with-search: Measuring and Narrowing the Compositionality Gap in Language Models [7 Oct 2022]
  5. react-docstore: ReAct: Synergizing Reasoning and Acting in Language Models [6 Oct 2022]
  6. Agent Type
class AgentType(str, Enum):
    """Enumerator with the Agent types."""

    ZERO_SHOT_REACT_DESCRIPTION = "zero-shot-react-description"
    REACT_DOCSTORE = "react-docstore"
    SELF_ASK_WITH_SEARCH = "self-ask-with-search"
    CONVERSATIONAL_REACT_DESCRIPTION = "conversational-react-description"
    CHAT_ZERO_SHOT_REACT_DESCRIPTION = "chat-zero-shot-react-description"
    CHAT_CONVERSATIONAL_REACT_DESCRIPTION = "chat-conversational-react-description"
    STRUCTURED_CHAT_ZERO_SHOT_REACT_DESCRIPTION = (
        "structured-chat-zero-shot-react-description"
    )
    OPENAI_FUNCTIONS = "openai-functions"
    OPENAI_MULTI_FUNCTIONS = "openai-multi-functions"
  • ReAct vs MRKL (miracle)

    ReAct is inspired by the synergies between "acting" and "reasoning" which allow humans to learn new tasks and make decisions or reasoning.

    MRKL stands for Modular Reasoning, Knowledge and Language and is a neuro-symbolic architecture that combines large language models, external knowledge sources, and discrete reasoning

    cite: ref [28 Apr 2023]
    zero-shot-react-description: Uses ReAct to select tools based on their descriptions. Any number of tools can be used, each requiring a description.
    react-docstore: Uses ReAct to manage a docstore with two required tools: Search and Lookup. These tools must be named exactly as specified. It follows the original ReAct paper's example from Wikipedia.

    • MRKL in LangChain uses zero-shot-react-description, implementing ReAct. The original ReAct framework is used in the react-docstore agent. MRKL was published on May 1, 2022, earlier than ReAct on October 6, 2022.

LangChain Memory

  1. ConversationBufferMemory: Stores the entire conversation history.
  2. ConversationBufferWindowMemory: Stores recent messages from the conversation history.
  3. Entity Memory: Stores and retrieves entity-related information.
  4. Conversation Knowledge Graph Memory: Stores entities and relationships between entities.
  5. ConversationSummaryMemory: Stores summarized information about the conversation.
  6. ConversationSummaryBufferMemory: Stores summarized information about the conversation with a token limit.
  7. ConversationTokenBufferMemory: Stores tokens from the conversation.
  8. VectorStore-Backed Memory: Leverages vector space models for storing and retrieving information.

Criticism to LangChain

  • The Problem With LangChain: ref / git [14 Jul 2023] GitHub Repo stars
  • What’s your biggest complaint about langchain?: ref [May 2023]
  • LangChain Is Pointless: ref [Jul 2023]

    LangChain has been criticized for making simple things relatively complex, which creates unnecessary complexity and tribalism that hurts the up-and-coming AI ecosystem as a whole. The documentation is also criticized for being bad and unhelpful.

  • How to Build Ridiculously Complex LLM Pipelines with LangGraph! [17 Sep 2024 ]

    LangChain does too much, and as a consequence, it does many things badly. Scaling beyond the basic use cases with LangChain is a challenge that is often better served with building things from scratch by using the underlying APIs.

LangChain vs Competitors

Prompting Frameworks

  • LangChain [Oct 2022] | LlamaIndex [Nov 2022] | Microsoft Semantic Kernel [Feb 2023] | Microsoft guidance [Nov 2022] | Azure ML Promt flow [Jun 2023] | DSPy [Jan 2023] GitHub Repo stars GitHub Repo stars GitHub Repo stars GitHub Repo stars GitHub Repo stars GitHub Repo stars
  • Prompting Framework (PF): Prompting Frameworks for Large Language Models: A Survey git GitHub Repo stars
  • What Are Tools Anyway?: 1. For a small number (e.g., 5–10) of tools, LMs can directly select from contexts. However, with a larger number (e.g., hundreds), an additional retrieval step involving a retriever model is often necessary. 2. LM-used tools incl. Tool creation and reuse. Tool is not useful when machine translation, summarization, and sentiment analysis (among others). 3. Evaluation metrics [18 Mar 2024]

LangChain vs LlamaIndex

  • Basically LlamaIndex is a smart storage mechanism, while LangChain is a tool to bring multiple tools together. cite [14 Apr 2023]

  • LangChain offers many features and focuses on using chains and agents to connect with external APIs. In contrast, LlamaIndex is more specialized and excels at indexing data and retrieving documents.

LangChain vs Semantic Kernel

LangChain Semantic Kernel
Memory Memory
Tookit Plugin (pre. Skill)
Tool LLM prompts (semantic functions)
native C# or Python code (native function)
Agent Planner (Deprecated) -> Agent
Chain Steps, Pipeline
Tool Connector (Deprecated) -> Plugin

LangChain vs Semantic Kernel vs Azure Machine Learning Prompt flow

  • What's the difference between LangChain and Semantic Kernel?

    LangChain has many agents, tools, plugins etc. out of the box. More over, LangChain has 10x more popularity, so has about 10x more developer activity to improve it. On other hand, Semantic Kernel architecture and quality is better, that's quite promising for Semantic Kernel. ref [11 May 2023]

  • What's the difference between Azure Machine Learing PromptFlow and Semantic Kernel?

    1. Low/No Code vs C#, Python, Java
    2. Focused on Prompt orchestrating vs Integrate LLM into their existing app.
  • Promptflow is not intended to replace chat conversation flow. Instead, it’s an optimized solution for integrating Search and Open Source Language Models. By default, it supports Python, LLM, and the Prompt tool as its fundamental building blocks.

  • Using Prompt flow with Semantic Kernel: ref [07 Sep 2023]

Prompt Template Language

Handlebars.js Jinja2 Prompt Template
Conditions {{#if user}}
  Hello {{user}}!
{{else}}
  Hello Stranger!
{{/if}}
{% if user %}
  Hello {{ user }}!
{% else %}
  Hello Stranger!
{% endif %}
Branching features such as "if", "for", and code blocks are not part of SK's template language.
Loop {{#each items}}
  Hello {{this}}
{{/each}}
{% for item in items %}
  Hello {{ item }}
{% endfor %}
By using a simple language, the kernel can also avoid complex parsing and external dependencies.
LangChain Library guidance. LangChain.js LangChain, Azure ML prompt flow Semantic Kernel
URL ref ref ref
  • Semantic Kernel supports HandleBars and Jinja2. [Mar 2024]

LlamaIndex

  • LlamaIndex (formerly GPT Index) is a data framework for LLM applications to ingest, structure, and access private or domain-specific data. The high-level API allows users to ingest and query their data in a few lines of code. High-Level Concept: ref / doc:ref / blog:ref / git [Nov 2022] GitHub Repo stars

    Fun fact this core idea was the initial inspiration for GPT Index (the former name of LlamaIndex) 11/8/2022 - almost a year ago!. cite / Walking Down the Memory Maze: Beyond Context Limit through Interactive Reading

    1. Build a data structure (memory tree)
    2. Transverse it via LLM prompting
  • LlamaIndex Toolkits:

    • LlamaHub: A library of data loaders for LLMs git [Feb 2023] GitHub Repo stars
    • LlamaIndex CLI: a command line tool to generate LlamaIndex apps ref [Nov 2023]
    • LlamaParse: A unique parsing tool for intricate documents git [Feb 2024] GitHub Repo stars
  • LlamaIndex showcase > examples ref: e.g., Contract Review, Patient Case Summary, and Auto Insurance Claims Workflow. [9 Jan 2025]

LlamaIndex integration with Azure AI

High-Level Concepts

  • Query engine vs Chat engine

    1. The query engine wraps a retriever and a response synthesizer into a pipeline, that will use the query string to fetch nodes (sentences or paragraphs) from the index and then send them to the LLM (Language and Logic Model) to generate a response
    2. The chat engine is a quick and simple way to chat with the data in your index. It uses a context manager to keep track of the conversation history and generate relevant queries for the retriever. Conceptually, it is a stateful analogy of a Query Engine.
  • Storage Context vs Settings (p.k.a. Service Context)

    • Both the Storage Context and Service Context are data classes.

      1. Introduced in v0.10.0, ServiceContext is replaced to Settings object.
      2. Storage Context is responsible for the storage and retrieval of data in Llama Index, while the Service Context helps in incorporating external context to enhance the search experience.
      3. The Service Context is not directly involved in the storage or retrieval of data, but it helps in providing a more context-aware and accurate search experience.
    # The storage context container is a utility container for storing nodes, indices, and vectors.
    class StorageContext:
      docstore: BaseDocumentStore
      index_store: BaseIndexStore
      vector_store: VectorStore
      graph_store: GraphStore
    # NOTE: Deprecated, use llama_index.settings.Settings. The service context container is a utility container for LlamaIndex index and query classes.
    class ServiceContext:
      llm_predictor: BaseLLMPredictor
      prompt_helper: PromptHelper
      embed_model: BaseEmbedding
      node_parser: NodeParser
      llama_logger: LlamaLogger
      callback_manager: CallbackManager
    @dataclass
    class _Settings:
      # lazy initialization
      _llm: Optional[LLM] = None
      _embed_model: Optional[BaseEmbedding] = None
      _callback_manager: Optional[CallbackManager] = None
      _tokenizer: Optional[Callable[[str], List[Any]]] = None
      _node_parser: Optional[NodeParser] = None
      _prompt_helper: Optional[PromptHelper] = None
      _transformations: Optional[List[TransformComponent]] = None

LlamaIndex Tutorial

  • LlamaIndex Overview (Japanese) [17 Jul 2023]

  • Fine-Tuning a Linear Adapter for Any Embedding Model: Fine-tuning the embeddings model requires you to reindex your documents. With this approach, you do not need to re-embed your documents. Simply transform the query instead. [7 Sep 2023]

  • 4 RAG techniques implemented in llama_index / cite [20 Sep 2023] / git GitHub Repo stars

    Expand: 4 RAG techniques
    1. SQL Router Query Engine: Query router that can reference your vector database or SQL database

    2. Sub Question Query Engine: Break down the complex question into sub-questions

    3. Recursive Retriever + Query Engine: Reference node relationships, rather than only finding a node (chunk) that is most relevant.

    4. Self Correcting Query Engines: Use an LLM to evaluate its own output.

  • LlamaIndex Tutorial: A Complete LlamaIndex Guide [18 Oct 2023]