What is LangChain used for?

LangChain is a framework for building LLM-powered applications, providing interfaces for LLMs, prompts, memory, chains, agents, and tools to orchestrate AI workflows.

Do I need to install extra packages?

Yes. Typical setup includes langchain, langchain-openai, langchain-community, and optionally chromadb as a vector store for RAG.

What is RAG in LangChain?

RAG stands for Retrieval Augmented Generation, a pipeline that uses a vector store and retriever to fetch relevant information and answer questions with an LLM.

Langchain

npx machina-cli add skill muhammederem/chief/langchain --openclaw

Files (1)

SKILL.md

8.8 KB

LangChain Framework

Overview

LangChain is a framework for developing applications powered by language models. It provides interfaces for LLMs, prompt management, memory, chains, agents, and tools.

Installation

pip install langchain langchain-openai langchain-community
pip install chromadb  # For vector store

Core Components

Language Models

from langchain_openai import ChatOpenAI
from langchain_community.llms import HuggingFacePipeline

# OpenAI
llm = ChatOpenAI(
    model="gpt-4",
    temperature=0.7,
    api_key="your-api-key"
)

# Hugging Face
from transformers import AutoModelForCausalLM, AutoTokenizer, pipeline
model = AutoModelForCausalLM.from_pretrained("gpt2")
tokenizer = AutoTokenizer.from_pretrained("gpt2")
pipe = pipeline("text-generation", model=model, tokenizer=tokenizer)
llm = HuggingFacePipeline(pipeline=pipe)

Prompts

from langchain.prompts import PromptTemplate, ChatPromptTemplate

# Simple prompt
prompt = PromptTemplate(
    input_variables=["product"],
    template="What is a good name for a company that makes {product}?"
)

# Chat prompt
prompt = ChatPromptTemplate.from_messages([
    ("system", "You are a helpful assistant."),
    ("human", "{input}")
])

Chains

from langchain.chains import LLMChain

# Basic chain
chain = LLMChain(llm=llm, prompt=prompt)
result = chain.run(product="colorful socks")

# Sequential chain
from langchain.chains import SimpleSequentialChain

chain1 = LLMChain(llm=llm, prompt=prompt1)
chain2 = LLMChain(llm=llm, prompt=prompt2)
combined_chain = SimpleSequentialChain(chains=[chain1, chain2])

RAG (Retrieval Augmented Generation)

Basic RAG Pipeline

from langchain_community.document_loaders import TextLoader
from langchain.text_splitter import RecursiveCharacterTextSplitter
from langchain_community.vectorstores import Chroma
from langchain_openai import OpenAIEmbeddings
from langchain.chains import RetrievalQA

# Load documents
loader = TextLoader("document.txt")
documents = loader.load()

# Split documents
text_splitter = RecursiveCharacterTextSplitter(
    chunk_size=1000,
    chunk_overlap=200
)
splits = text_splitter.split_documents(documents)

# Create vector store
vectorstore = Chroma.from_documents(
    documents=splits,
    embedding=OpenAIEmbeddings()
)

# Create RAG chain
retriever = vectorstore.as_retriever(search_kwargs={"k": 3})
qa_chain = RetrievalQA.from_chain_type(
    llm=llm,
    chain_type="stuff",
    retriever=retriever
)

# Query
result = qa_chain.run("What is the document about?")

Custom RAG with Prompts

from langchain.prompts import PromptTemplate

prompt_template = """
Use the following pieces of context to answer the question at the end.
If you don't know the answer, just say that you don't know, don't try to make up an answer.

Context: {context}

Question: {question}

Answer:
"""

PROMPT = PromptTemplate(
    template=prompt_template,
    input_variables=["context", "question"]
)

qa_chain = RetrievalQA.from_chain_type(
    llm=llm,
    chain_type="stuff",
    retriever=retriever,
    chain_type_kwargs={"prompt": PROMPT}
)

Memory

Conversation Memory

from langchain.memory import ConversationBufferMemory

memory = ConversationBufferMemory()

memory.save_context({"input": "Hi"}, {"output": "Hello"})
memory.load_memory_variables({})

Memory in Chains

from langchain.chains import ConversationChain

conversation = ConversationChain(
    llm=llm,
    memory=ConversationBufferMemory()
)

response = conversation.run("Hi, I'm John")
response = conversation.run("What's my name?")

Memory Types

ConversationBufferMemory: Stores all conversation history
ConversationBufferWindowMemory: Keeps last k exchanges
ConversationSummaryMemory: Summarizes conversation
ConversationKGMemory: Uses knowledge graph

Agents

ReAct Agent

from langchain.agents import AgentExecutor, create_openai_functions_agent
from langchain.tools import Tool
from langchain_openai import ChatOpenAI

# Define tools
def calculator(input_str):
    return eval(input_str)

tools = [
    Tool(
        name="Calculator",
        func=calculator,
        description="Useful for math calculations"
    )
]

# Create agent
llm = ChatOpenAI(model="gpt-4")
agent = create_openai_functions_agent(llm, tools, prompt)

# Create executor
agent_executor = AgentExecutor(
    agent=agent,
    tools=tools,
    verbose=True
)

# Run
result = agent_executor.invoke({"input": "What is 2 + 2?"})

Custom Tools

from langchain.tools import StructuredTool
from pydantic import BaseModel, Field

class SearchInput(BaseModel):
    query: str = Field(description="Search query")

def search_func(query: str) -> str:
    # Implement search
    return f"Results for {query}"

search_tool = StructuredTool.from_function(
    func=search_func,
    name="Search",
    description="Search the web",
    args_schema=SearchInput
)

Document Loaders

Common Loaders

from langchain_community.document_loaders import (
    TextLoader,
    PyPDFLoader,
    WebBaseLoader,
    GitHubLoader,
    YoutubeLoader
)

# Text
loader = TextLoader("file.txt")

# PDF
loader = PyPDFLoader("document.pdf")

# Web
loader = WebBaseLoader("https://example.com")

# GitHub
loader = GitHubLoader(
    repo="username/repo",
    branch="main",
    folder_path="src"
)

# YouTube
loader = YoutubeLoader.from_youtube_url(
    "https://youtube.com/watch?v=xxx",
    add_video_info=True
)

Custom Loader

from langchain.schema import Document

class CustomLoader:
    def load(self) -> list[Document]:
        # Load data
        data = self._fetch_data()

        # Convert to documents
        documents = [
            Document(page_content=item["text"], metadata=item["meta"])
            for item in data
        ]
        return documents

Text Splitters

Character Text Splitter

from langchain.text_splitter import CharacterTextSplitter

splitter = CharacterTextSplitter(
    separator="\n\n",
    chunk_size=1000,
    chunk_overlap=200,
    length_function=len
)

Recursive Character Splitter

from langchain.text_splitter import RecursiveCharacterTextSplitter

splitter = RecursiveCharacterTextSplitter(
    chunk_size=1000,
    chunk_overlap=200,
    separators=["\n\n", "\n", " ", ""]
)

Code Splitter

from langchain.text_splitter import RecursiveCharacterTextSplitter, Language

splitter = RecursiveCharacterTextSplitter.from_language(
    language=Language.PYTHON,
    chunk_size=1000,
    chunk_overlap=200
)

Vector Stores

Pinecone

from langchain_community.vectorstores import Pinecone
from langchain_openai import OpenAIEmbeddings

vectorstore = Pinecone.from_documents(
    documents=documents,
    embedding=OpenAIEmbeddings(),
    index_name="my-index"
)

ChromaDB

from langchain_community.vectorstores import Chroma

vectorstore = Chroma.from_documents(
    documents=documents,
    embedding=OpenAIEmbeddings(),
    persist_directory="./chroma_db"
)

Weaviate

from langchain_community.vectorstores import Weaviate

vectorstore = Weaviate.from_documents(
    documents=documents,
    embedding=OpenAIEmbeddings(),
    index_name="MyIndex"
)

Output Parsers

Structured Output

from langchain.output_parsers import StructuredOutputParser, ResponseSchema
from langchain.prompts import PromptTemplate

# Define schema
schemas = [
    ResponseSchema(name="name", description="The person's name"),
    ResponseSchema(name="age", description="The person's age")
]

output_parser = StructuredOutputParser.from_response_schemas(schemas)

# Use in prompt
format_instructions = output_parser.get_format_instructions()
prompt = PromptTemplate(
    template="Answer the user question.\n{format_instructions}\n{question}",
    input_variables=["question"],
    partial_variables={"format_instructions": format_instructions}
)

Best Practices

1. Prompt Engineering

Be specific and clear
Provide examples (few-shot)
Use appropriate delimiters
Specify output format

2. Chain Design

Keep chains simple and focused
Use appropriate chain type
Handle errors gracefully
Add validation

3. RAG Optimization

Chunk size: 500-1500 tokens
Overlap: 10-20% of chunk size
Use appropriate embeddings
Tune retrieval (k value, search type)

4. Agent Design

Provide clear tool descriptions
Limit tool set to essentials
Add error handling
Monitor and log agent actions

5. Performance

Cache embeddings
Use batch processing
Stream responses for long text
Optimize vector store queries

Integration

Vector Databases: Pinecone, Weaviate, Qdrant, Chroma
LLMs: OpenAI, Anthropic, Hugging Face, Cohere
Tools: Web search, databases, APIs, file systems
Deployment: FastAPI, Streamlit, AWS Lambda

Source

git clone https://github.com/muhammederem/chief/blob/main/.claude/skills/ml-ai/langchain/SKILL.mdView on GitHub

Overview

LangChain is a framework for developing applications powered by language models. It provides modular interfaces for LLMs, prompt management, memory, chains, agents, and tools to orchestrate AI workflows.

How This Skill Works

Developers compose LLMs, prompts, and memory into reusable chains or agents. LangChain supports basic and complex chains, prompt templates, and memory to preserve context, while optional components like a vector store enable Retrieval Augmented Generation for document-based QA.

When to Use It

You need a chat assistant that remembers past conversations and can use tools
You want to build a retrieval-augmented generation (RAG) system for documents
You need to chain multiple prompt steps together (basic or sequential chains)
You want to experiment with different LLM backends (OpenAI, HuggingFace) in one workflow
You’re building an agent that reasons and acts using tools (ReAct-style)

Quick Start

Step 1: Install dependencies like langchain, langchain-openai, langchain-community and chromadb for vector storage
Step 2: Instantiate an LLM (e.g., OpenAI ChatOpenAI or a HuggingFacePipeline) as shown in examples
Step 3: Create a Prompt (PromptTemplate or ChatPromptTemplate) and build a Chain (LLMChain or SimpleSequentialChain) to run a sample task

Best Practices

Use PromptTemplate or ChatPromptTemplate to keep prompts modular and reusable
Leverage ConversationBufferMemory or other memory types to persist context across turns
For RAG, tune chunk_size, chunk_overlap, and retriever k to balance coverage and latency
Store and retrieve embeddings with a vector store like Chroma using OpenAIEmbeddings
Compose steps with LLMChain and SimpleSequentialChain to build scalable workflows

Example Use Cases

A naming assistant that uses a simple prompt template to brainstorm a company name for a product
A document QA system that loads text, splits it into chunks, stores embeddings in Chroma, and answers questions via RetrievalQA
A memory-enabled chat bot using ConversationBufferMemory to maintain context across interactions
A multi-step content generator built from a sequence of prompts connected by SimpleSequentialChain
An experimentation setup switching between OpenAI and HuggingFace backends for the same workflow

Frequently Asked Questions

Add this skill to your agents