langchain-models

Initialize and use LangChain chat models - includes provider selection (OpenAI, Anthropic, Google), model configuration, and invocation patterns

christian-bromann 3 1 Updated 5mo ago

GitHub

Install

npx skillscat add christian-bromann/langchain-skills/skills-langchain-models-python

Install via the SkillsCat registry.

SKILL.md

langchain-models (Python)

Overview

Chat models are the core of LangChain applications. They take messages as input and return AI-generated messages as output. LangChain provides a unified interface across multiple providers (OpenAI, Anthropic, Google, etc.).

Key Concepts:

init_chat_model(): Universal initialization for any provider
Provider-specific classes: Direct initialization (ChatOpenAI, ChatAnthropic, etc.)
Messages: Structured input/output format (HumanMessage, AIMessage, etc.)
Invocation patterns: invoke(), stream(), batch()

When to Use Each Provider

Provider	Best For	Models	Strengths
OpenAI	General purpose, reasoning	GPT-4.1, GPT-5	Strong reasoning, large context
Anthropic	Safety, analysis	Claude Sonnet/Opus	Safety, long context, vision
Google	Multimodal, speed	Gemini 2.5	Fast, multimodal, cost-effective
AWS Bedrock	Enterprise, compliance	Multiple providers	Security, compliance, variety
Azure OpenAI	Enterprise OpenAI	GPT models	Enterprise features, SLAs

Decision Tables

Choosing a Model

Use Case	Recommended Model	Why
Complex reasoning	GPT-5, Claude Opus	Best logical capabilities
Fast responses	Gemini Flash, GPT-4.1-mini	Low latency
Vision tasks	GPT-4.1, Claude Sonnet, Gemini	Multimodal support
Long context	Claude Opus, Gemini	100k+ token windows
Cost-effective	GPT-4.1-mini, Gemini Flash	Lower pricing
Enterprise/compliance	Azure OpenAI, AWS Bedrock	Security features

Initialization Methods

Method	When to Use	Example
`init_chat_model("provider:model")`	Quick switching between providers	`init_chat_model("openai:gpt-4.1")`
Provider class	Need provider-specific features	`ChatOpenAI(model="gpt-4.1")`
With configuration	Custom parameters needed	Temperature, max tokens, etc.

Code Examples

Basic Model Initialization

from langchain.chat_models import init_chat_model

# Universal initialization - easiest way
model = init_chat_model("openai:gpt-4.1")

# Or with provider shorthand
model2 = init_chat_model("gpt-4.1")  # Defaults to OpenAI

# API key from environment (recommended)
import os
os.environ["OPENAI_API_KEY"] = "your-api-key"
model3 = init_chat_model("openai:gpt-4.1")

Provider-Specific Initialization

from langchain_openai import ChatOpenAI
from langchain_anthropic import ChatAnthropic
from langchain_google_genai import ChatGoogleGenerativeAI
import os

# OpenAI
openai = ChatOpenAI(
    model="gpt-4.1",
    temperature=0.7,
    max_tokens=1000,
    api_key=os.getenv("OPENAI_API_KEY"),
)

# Anthropic
anthropic = ChatAnthropic(
    model="claude-sonnet-4-5-20250929",
    temperature=0,
    max_tokens=2000,
    api_key=os.getenv("ANTHROPIC_API_KEY"),
)

# Google
google = ChatGoogleGenerativeAI(
    model="gemini-2.5-flash-lite",
    temperature=0.5,
    google_api_key=os.getenv("GOOGLE_API_KEY"),
)

Simple Invocation

from langchain.chat_models import init_chat_model

model = init_chat_model("gpt-4.1")

# String input (converted to HumanMessage)
response = model.invoke("What is LangChain?")
print(response.content)

# Message array input
response2 = model.invoke([
    {"role": "user", "content": "Hello!"}
])
print(response2.content)

Streaming Responses

from langchain.chat_models import init_chat_model

model = init_chat_model("gpt-4.1")

# Stream tokens as they arrive
for chunk in model.stream("Explain quantum computing"):
    print(chunk.content, end="", flush=True)

Batch Processing

from langchain.chat_models import init_chat_model

model = init_chat_model("gpt-4.1")

# Process multiple inputs in parallel
results = model.batch([
    "What is AI?",
    "What is ML?",
    "What is LangChain?"
])

for i, result in enumerate(results):
    print(f"Answer {i + 1}: {result.content}")

Multi-turn Conversation

from langchain.chat_models import init_chat_model

model = init_chat_model("gpt-4.1")

# Build conversation history
messages = [
    {"role": "system", "content": "You are a helpful assistant."},
    {"role": "user", "content": "What's the capital of France?"},
]

response1 = model.invoke(messages)
messages.append({"role": "assistant", "content": response1.content})

# Continue conversation
messages.append({"role": "user", "content": "What's its population?"})
response2 = model.invoke(messages)

print(response2.content)  # Knows we're talking about Paris

Model Configuration Options

from langchain_openai import ChatOpenAI

model = ChatOpenAI(
    model="gpt-4.1",
    
    # Control randomness (0 = deterministic, 1 = creative)
    temperature=0.7,
    
    # Limit response length
    max_tokens=500,
    
    # Alternative sampling method
    top_p=0.9,
    
    # Penalize repetition
    frequency_penalty=0.5,
    presence_penalty=0.5,
    
    # Stop generation at these strings
    stop=["\n\n", "END"],
    
    # Timeout for requests (seconds)
    request_timeout=30,
    
    # Max retries on failure
    max_retries=3,
)

Azure OpenAI

from langchain_openai import AzureChatOpenAI
import os

azure = AzureChatOpenAI(
    azure_endpoint=os.getenv("AZURE_OPENAI_ENDPOINT"),
    api_key=os.getenv("AZURE_OPENAI_API_KEY"),
    api_version="2024-02-15-preview",
    deployment_name="your-deployment-name",
)

AWS Bedrock

from langchain_aws import ChatBedrock

# AWS credentials from environment or ~/.aws/credentials
bedrock = ChatBedrock(
    model_id="anthropic.claude-3-5-sonnet-20240620-v1:0",
    region_name="us-east-1",
    # Credentials automatically loaded from environment
)

Model Selection Helper

from langchain.chat_models import init_chat_model

def get_model(task: str):
    model_map = {
        "reasoning": "openai:gpt-5",
        "fast": "google_genai:gemini-2.5-flash-lite",
        "vision": "openai:gpt-4.1",
        "long_context": "anthropic:claude-sonnet-4-5-20250929",
        "cost_effective": "openai:gpt-4.1-mini",
    }
    
    return init_chat_model(model_map.get(task, "openai:gpt-4.1"))

# Usage
reasoning_model = get_model("reasoning")
fast_model = get_model("fast")

Error Handling

from langchain.chat_models import init_chat_model
from openai import RateLimitError, AuthenticationError

model = init_chat_model("gpt-4.1")

try:
    response = model.invoke("Hello!")
    print(response.content)
except RateLimitError:
    print("Rate limit exceeded")
except AuthenticationError:
    print("Invalid API key")
except Exception as e:
    print(f"Error: {e}")

Async Invocation

from langchain.chat_models import init_chat_model
import asyncio

async def main():
    model = init_chat_model("gpt-4.1")
    
    # Async invoke
    response = await model.ainvoke("Hello!")
    print(response.content)
    
    # Async stream
    async for chunk in model.astream("Explain AI"):
        print(chunk.content, end="", flush=True)
    
    # Async batch
    results = await model.abatch([
        "What is AI?",
        "What is ML?",
    ])
    for result in results:
        print(result.content)

asyncio.run(main())

Checking Model Capabilities

from langchain.chat_models import init_chat_model

model = init_chat_model("gpt-4.1")

# Check if model supports features
print("Supports streaming:", hasattr(model, "stream"))
print("Supports tool calling:", hasattr(model, "bind_tools"))
print("Supports structured output:", hasattr(model, "with_structured_output"))

Boundaries

What You CAN Configure

✅ Model Selection: Any supported model from any provider
✅ Temperature: Control randomness (0-1)
✅ Max Tokens: Limit response length
✅ Stop Sequences: Define where to stop generation
✅ Timeout/Retries: Control request behavior
✅ API Keys: Per-model or from environment
✅ Provider-specific Options: Each provider has unique features

What You CANNOT Configure

❌ Model Training Data: Models are pre-trained
❌ Model Architecture: Can't modify internal structure
❌ Token Costs: Set by provider
❌ Rate Limits: Set by provider (can manage with queues)
❌ Model Capabilities: Vision/tool support is model-specific

Gotchas

1. API Key Not Found

# ❌ Problem: Missing API key
model = init_chat_model("openai:gpt-4.1")
model.invoke("Hello")  # Error: API key not found

# ✅ Solution: Set environment variable
import os
os.environ["OPENAI_API_KEY"] = "sk-..."
model = init_chat_model("openai:gpt-4.1")

# OR pass directly
from langchain_openai import ChatOpenAI
model = ChatOpenAI(
    model="gpt-4.1",
    api_key="sk-...",
)

2. Model Name Typos

# ❌ Problem: Wrong model name
model = init_chat_model("gpt4")  # Error!

# ✅ Solution: Use correct format
model = init_chat_model("openai:gpt-4.1")
# Or provider shorthand
model2 = init_chat_model("gpt-4.1")

3. Response Content Access

# ❌ Problem: Wrong property access
response = model.invoke("Hello")
print(response)  # AIMessage object, not string

# ✅ Solution: Access .content property
print(response.content)  # "Hello! How can I help you?"

# Or convert to string
print(str(response))

4. Streaming Requires Iteration

# ❌ Problem: Not iterating stream
stream = model.stream("Hello")
print(stream)  # Generator object, not chunks

# ✅ Solution: Use for loop
for chunk in model.stream("Hello"):
    print(chunk.content, end="", flush=True)

5. Temperature Confusion

# ❌ Problem: Wrong temperature range
model = ChatOpenAI(
    temperature=10,  # Too high! Should be 0-1
)

# ✅ Solution: Use 0-1 range
deterministic = ChatOpenAI(temperature=0)  # Always same
balanced = ChatOpenAI(temperature=0.7)  # Default
creative = ChatOpenAI(temperature=1)  # Maximum randomness

6. Token Limits

# ❌ Problem: Input + output exceeds model limit
long_text = "..." * 50000  # Very long text
model = init_chat_model("gpt-4.1")  # 128k context
model.invoke(long_text)  # May succeed

model2 = init_chat_model("gpt-4.1-mini")  # 16k context
model2.invoke(long_text)  # Error: context too long

# ✅ Solution: Check input length or use larger context model
import tiktoken

enc = tiktoken.encoding_for_model("gpt-4.1")
tokens = enc.encode(long_text)
print(f"Input tokens: {len(tokens)}")

if len(tokens) > 100000:
    # Use Claude with 200k context
    model = init_chat_model("anthropic:claude-opus-4")

7. Sync vs Async Confusion

# ❌ Problem: Using sync in async context
async def process():
    model = init_chat_model("gpt-4.1")
    response = model.invoke("Hello")  # Blocks async loop!

# ✅ Solution: Use async methods
async def process():
    model = init_chat_model("gpt-4.1")
    response = await model.ainvoke("Hello")  # Non-blocking
    
    async for chunk in model.astream("Hello"):
        print(chunk.content)

langchain-models

Install

langchain-models (Python)

Overview

When to Use Each Provider

Decision Tables

Choosing a Model

Initialization Methods

Code Examples

Basic Model Initialization

Provider-Specific Initialization

Simple Invocation

Streaming Responses

Batch Processing

Multi-turn Conversation

Model Configuration Options

Azure OpenAI

AWS Bedrock

Model Selection Helper

Error Handling

Async Invocation

Checking Model Capabilities

Boundaries

What You CAN Configure

What You CANNOT Configure

Gotchas

1. API Key Not Found

2. Model Name Typos

3. Response Content Access

4. Streaming Requires Iteration

5. Temperature Confusion

6. Token Limits

7. Sync vs Async Confusion

Links to Documentation

Categories

Install

langchain-models

Install

langchain-models (Python)

Overview

When to Use Each Provider

Decision Tables

Choosing a Model

Initialization Methods

Code Examples

Basic Model Initialization

Provider-Specific Initialization

Simple Invocation

Streaming Responses

Batch Processing

Multi-turn Conversation

Model Configuration Options

Azure OpenAI

AWS Bedrock

Model Selection Helper

Error Handling

Async Invocation

Checking Model Capabilities

Boundaries

What You CAN Configure

What You CANNOT Configure

Gotchas

1. API Key Not Found

2. Model Name Typos

3. Response Content Access

4. Streaming Requires Iteration

5. Temperature Confusion

6. Token Limits

7. Sync vs Async Confusion

Links to Documentation

Categories

Install

Recommended Skills