@lightfastai/ai-sdk

Production-ready AI agent framework with built-in observability, caching, and type safety.

Source: github.com/lightfastai/ai-sdk
Issues: github.com/lightfastai/ai-sdk/issues
Changelog: CHANGELOG.md

ESM-only. Requires Node ≥ 18.

Installation

npm install @lightfastai/ai-sdk
# or
pnpm add @lightfastai/ai-sdk
# or
yarn add @lightfastai/ai-sdk

Quick Start

import { createAgent } from '@lightfastai/ai-sdk/agent';
import { createTool } from '@lightfastai/ai-sdk/tool';
import { z } from 'zod';

// Define your runtime context type
interface AppRuntimeContext {
  userId: string;
  agentId: string;
}

// Create a tool with context awareness
const searchTool = createTool<AppRuntimeContext>({
  description: 'Search the web',
  inputSchema: z.object({
    query: z.string(),
  }),
  execute: async ({ query }, context) => {
    console.log(`User ${context.userId} searching for: ${query}`);
    // Your search implementation
    return `Search results for: ${query}`;
  },
});

// Create your tools object
const tools = {
  webSearch: searchTool,
};

// Create an agent with strong typing
const agent = createAgent<AppRuntimeContext, typeof tools>({
  name: 'assistant',
  system: 'You are a helpful AI assistant.',
  tools,
  model: gateway('anthropic/claude-3-sonnet'),
  createRuntimeContext: ({ sessionId, resourceId }) => ({
    userId: resourceId,
    agentId: 'assistant',
  }),
});

Production Setup

1. Environment Configuration

Create environment variables for production:

# AI Gateway
AI_GATEWAY_API_KEY=your-gateway-key

# Redis for memory/caching
KV_REST_API_URL=https://your-redis-url.upstash.io
KV_REST_API_TOKEN=your-redis-token

# Observability (Braintrust)
BRAINTRUST_API_KEY=your-braintrust-key
BRAINTRUST_PROJECT_NAME=your-project-name

# OpenTelemetry (optional)
OTEL_EXPORTER_OTLP_ENDPOINT=https://api.braintrust.dev/otel
OTEL_EXPORTER_OTLP_HEADERS=api-key=your-key

2. Complete Production Example

import { gateway } from '@ai-sdk/gateway';
import { createAgent } from '@lightfastai/ai-sdk/agent';
import { fetchRequestHandler } from '@lightfastai/ai-sdk/server/adapters/fetch';
import { RedisMemory } from '@lightfastai/ai-sdk/memory/adapters/redis';
import { AnthropicProviderCache, ClineConversationStrategy } from '@lightfastai/ai-sdk/cache';
import { smoothStream, stepCountIs, wrapLanguageModel } from 'ai';
import { BraintrustMiddleware, initLogger, traced } from 'braintrust';
import { randomUUID } from 'node:crypto';

// Initialize observability
initLogger({
  apiKey: process.env.BRAINTRUST_API_KEY,
  projectName: process.env.BRAINTRUST_PROJECT_NAME || 'my-app',
});

// Define your tools
const tools = {
  // Your tool implementations
};

// Production-ready route handler
export async function POST(req: Request) {
  const { sessionId, agentId } = await req.json();
  
  // Initialize memory with Redis
  const memory = new RedisMemory({
    url: process.env.KV_REST_API_URL,
    token: process.env.KV_REST_API_TOKEN,
  });

  // Wrap handler with tracing
  return traced(
    async () => {
      return fetchRequestHandler({
        agent: createAgent<AppRuntimeContext, typeof tools>({
          name: agentId,
          system: 'Your system prompt here',
          tools,
          
          // Production caching strategy (proven with Claude)
          cache: new AnthropicProviderCache({
            strategy: new ClineConversationStrategy({
              cacheSystemPrompt: true,
              recentUserMessagesToCache: 2,
            }),
          }),
          
          // Runtime context creation
          createRuntimeContext: ({ sessionId, resourceId }) => ({
            userId: resourceId,
            agentId,
          }),
          
          // Model with observability middleware
          model: wrapLanguageModel({
            model: gateway('anthropic/claude-3-sonnet'),
            middleware: BraintrustMiddleware({ debug: true }),
          }),
          
          // Anthropic-specific options (thinking mode)
          providerOptions: {
            anthropic: {
              thinking: {
                type: 'enabled',
                budgetTokens: 32000, // Generous budget for complex reasoning
              },
            },
          },
          
          // Streaming configuration
          experimental_transform: smoothStream({
            delayInMs: 25,
            chunking: 'word',
          }),
          
          // Step limits
          stopWhen: stepCountIs(30),
          
          // OpenTelemetry configuration
          experimental_telemetry: {
            isEnabled: !!process.env.OTEL_EXPORTER_OTLP_HEADERS,
            metadata: {
              agentId,
              sessionId,
              userId: 'user-id',
            },
          },
          
          // Type-safe event handlers
          onChunk({ chunk }) {
            if (chunk.type === 'tool-call') {
              // TypeScript knows your exact tool names here
              console.log(`Tool called: ${chunk.toolName}`);
            }
          },
          
          onFinish({ finishReason, usage, reasoningText }) {
            console.log('Stream finished', {
              reason: finishReason,
              totalTokens: usage?.totalTokens,
              hasReasoning: !!reasoningText,
            });
          },
          
          onError(error) {
            console.error('Agent error:', error);
            // Send to error tracking service
          },
        }),
        sessionId,
        memory,
        req,
        resourceId: 'user-id',
        generateId: randomUUID,
        enableResume: true, // Enable stream resumption
      });
    },
    {
      type: 'function',
      name: `POST /api/agent/${agentId}`,
    }
  );
}

3. Caching Strategies

Anthropic Cache Control (Recommended for Claude)

import { AnthropicProviderCache, ClineConversationStrategy } from '@lightfastai/ai-sdk/cache';

// Proven strategy from Cline AI assistant
const cache = new AnthropicProviderCache({
  strategy: new ClineConversationStrategy({
    cacheSystemPrompt: true,          // Cache system prompt
    recentUserMessagesToCache: 2,     // Cache last 2 user messages
  }),
});

This strategy:

Caches the system prompt (stable, reused across conversations)
Caches recent user messages (often referenced in follow-ups)
Optimizes for token efficiency with Claude models

4. Memory Adapters

Redis Memory (Production)

import { RedisMemory } from '@lightfastai/ai-sdk/memory/adapters/redis';

const memory = new RedisMemory({
  url: process.env.KV_REST_API_URL,
  token: process.env.KV_REST_API_TOKEN,
});

In-Memory (Development)

import { InMemoryMemory } from '@lightfastai/ai-sdk/memory/adapters/in-memory';

const memory = new InMemoryMemory();

5. Tool Development

Type-Safe Tool Creation

import { createTool } from '@lightfastai/ai-sdk/tool';
import { z } from 'zod';

// Tools receive merged context: SystemContext & RequestContext & AppRuntimeContext
const fileWriteTool = createTool<AppRuntimeContext>({
  description: 'Write content to a file',
  inputSchema: z.object({
    path: z.string(),
    content: z.string(),
  }),
  execute: async ({ path, content }, context) => {
    // Access full context
    console.log(`User ${context.userId} writing to ${path}`);
    console.log(`Session: ${context.sessionId}`); // From SystemContext
    
    // Your file write logic
    await fs.writeFile(path, content);
    return `File written: ${path}`;
  },
});

6. Observability & Monitoring

Braintrust Integration

import { initLogger, traced } from 'braintrust';

// Initialize once at app startup
initLogger({
  apiKey: process.env.BRAINTRUST_API_KEY,
  projectName: 'my-app',
});

// Wrap handlers with tracing
const response = await traced(
  async () => {
    // Your agent logic
  },
  {
    type: 'function',
    name: 'agent-handler',
  }
);

OpenTelemetry Support

experimental_telemetry: {
  isEnabled: !!process.env.OTEL_EXPORTER_OTLP_HEADERS,
  metadata: {
    agentId,
    sessionId,
    userId,
    // Custom metadata for tracing
  },
}

7. Advanced Features

Stream Resumption

Enable clients to resume interrupted streams:

fetchRequestHandler({
  // ... other config
  enableResume: true,
  generateId: randomUUID, // Consistent ID generation (from `node:crypto`)
});

Thinking Models (Claude)

Enable Claude's thinking mode for complex reasoning:

providerOptions: {
  anthropic: {
    thinking: {
      type: 'enabled',
      budgetTokens: 32000, // Token budget for thinking
    },
  },
}

Custom Streaming

Control streaming behavior:

experimental_transform: smoothStream({
  delayInMs: 25,      // Delay between chunks
  chunking: 'word',   // Chunk by word boundaries
})

8. Type Safety

The agent system provides full type safety for tools:

const tools = {
  search: searchTool,
  write: writeTool,
};

const agent = createAgent<AppRuntimeContext, typeof tools>({
  // ... config
  onChunk({ chunk }) {
    if (chunk.type === 'tool-call') {
      // TypeScript knows toolName can only be "search" | "write"
      if (chunk.toolName === 'search') {
        // Handle search tool
      }
      // chunk.toolName === 'unknown' would cause TypeScript error
    }
  },
});

API Reference

`createAgent<TRuntimeContext, TTools>(options)`

Creates a strongly-typed agent with tool support.

Type Parameters

TRuntimeContext - Your app-specific context type
TTools - The tools object type (inferred from typeof tools)

Options

name: string - Agent identifier
system: string - System prompt
tools?: TTools - Tools available to the agent
model: LanguageModel - The AI model to use
cache?: ProviderCache - Caching strategy
createRuntimeContext?: (params) => TRuntimeContext - Context factory
onChunk?: (event) => void - Stream chunk handler
onFinish?: (event) => void - Completion handler
onError?: (error) => void - Error handler
Plus all Vercel AI SDK options

`createTool<TContext>(options)`

Creates a context-aware tool.

Type Parameters

TContext - The runtime context type

Options

description: string - Tool description for the model
inputSchema: ZodSchema - Input validation schema
execute: (input, context) => Promise<string> - Tool implementation

`fetchRequestHandler(options)`

Handles HTTP requests for agent streaming.

Options

agent: Agent - The agent instance
sessionId: string - Session identifier
memory: Memory - Memory adapter
req: Request - The HTTP request
resourceId: string - Resource/user identifier
generateId?: () => string - ID generator
enableResume?: boolean - Enable stream resumption

Best Practices

Always use environment variables for sensitive configuration
Implement proper error handling in tool execute functions
Use caching strategies appropriate for your model provider
Enable observability from day one with Braintrust or OpenTelemetry
Leverage TypeScript for type-safe tool definitions
Set reasonable step limits to prevent infinite loops
Use Redis memory for production deployments
Implement stream resumption for better reliability
Monitor token usage through the onFinish handler
Test tools thoroughly with comprehensive input validation

Contributing

Issues and pull requests are welcome at github.com/lightfastai/ai-sdk. Releases are cut via changesets — open a PR with a pnpm changeset entry describing the change.

License

MIT — see LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 71 Commits
.changeset		.changeset
.github/workflows		.github/workflows
src/core		src/core
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
tsconfig.base.json		tsconfig.base.json
tsconfig.build.json		tsconfig.build.json
tsconfig.json		tsconfig.json
tsup.config.ts		tsup.config.ts
vitest.config.ts		vitest.config.ts

Folders and files

Latest commit

History

Repository files navigation

@lightfastai/ai-sdk

Installation

Quick Start

Production Setup

1. Environment Configuration

2. Complete Production Example

3. Caching Strategies

Anthropic Cache Control (Recommended for Claude)

4. Memory Adapters

Redis Memory (Production)

In-Memory (Development)

5. Tool Development

Type-Safe Tool Creation

6. Observability & Monitoring

Braintrust Integration

OpenTelemetry Support

7. Advanced Features

Stream Resumption

Thinking Models (Claude)

Custom Streaming

8. Type Safety

API Reference

createAgent<TRuntimeContext, TTools>(options)

Type Parameters

Options

createTool<TContext>(options)

Type Parameters

Options

fetchRequestHandler(options)

Options

Best Practices

Contributing

License

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`createAgent<TRuntimeContext, TTools>(options)`

`createTool<TContext>(options)`

`fetchRequestHandler(options)`

Packages