Ollama Code

AI-powered programming assistant with local models

Русская версия • Features • Instruments • Tools Reference

Ollama Code is a CLI tool for AI-powered programming assistance using local Ollama models. The project provides full control over code and data, working completely offline.

Features

🚀 Fully Local — all models run locally via Ollama
💾 Context Caching — KV-cache reuse for 80-90% faster multi-turn conversations
💻 CLI Interface — convenient terminal interface based on Ink (React for CLI)
🌐 Web UI — full-featured Next.js web interface with chat, file explorer, terminal
🔧 Code Tools — read, edit, search files, execute commands
🔌 MCP Support — integration with Model Context Protocol servers
🌐 Web Search — integration with Tavily and Google Custom Search
📦 Extensions — extension system for adding new capabilities
🐛 Debugging — built-in VSCode debugging support
🧠 Thinking Models — support for reasoning models (DeepSeek R1)
📊 Code Analysis — code quality analysis with A-F grading
🎨 Diagram Generator — create Mermaid and PlantUML diagrams
🔀 Git Workflow — complete git workflow (commit, push, pull, MR/PR creation)
🔀 Git Advanced — advanced git operations (stash, cherry-pick, rebase, bisect)
🌐 API Tester — REST API endpoint testing
🔐 SSH Tools — remote server connectivity with profile management
🏷️ Tool Aliases — short names for tools (run → run_shell_command)
🧠 Self-Learning — automatic learning of tool names from errors
🔄 Undo/Redo — reversible operations with command pattern
🔌 Plugin System — dynamic tool loading and runtime registration

Requirements

Node.js 20.x or 22.x LTS (Node.js 23+ / 24+ / 25+ NOT supported due to node-pty compilation issues)
pnpm >= 10.0.0
Ollama installed and running (https://ollama.ai)

Node.js Compatibility with node-pty

Node.js Version	Status	node-pty Compatibility
18.x	LTS (Maintenance)	✅ Works
20.x	LTS (Current)	✅ Works
22.x	LTS (Latest)	✅ Works
23.x	Current	⚠️ May work
24.x	Nightly	❌ Does not work
25.x	Experimental	❌ Does not work

⚠️ Important: Node.js 23+ is not supported. The node-pty native module requires Node.js 20.x or 22.x LTS. If you have Node.js 23+, please downgrade:
# Using asdf
asdf install nodejs 22.14.0
asdf local nodejs 22.14.0

GPU Requirements for Models

Different models require different amounts of VRAM. Below is a guide for NVIDIA GPUs:

Minimum GPU Requirements by Model Size

Model Size	Min VRAM	Recommended GPU	Notes
3B	4 GB	RTX 3050, GTX 1660, RTX 5060	Basic models, quantization recommended
7B	6 GB	RTX 3060, RTX 4060, RTX 5060	Good balance of speed and quality
8B	8 GB	RTX 3070, RTX 4060 Ti, RTX 5070	DeepSeek R1, Llama 3.1
14B	12 GB	RTX 3080, RTX 4070, RTX 5070 Ti	Qwen2.5-Coder 14B
30B	20 GB	RTX 3090, RTX 4090, RTX 5080	Qwen3-Coder 30B
70B	32 GB	RTX 5090, 2x RTX 3090	DeepSeek R1 70B, Llama 3.1 70B
120B+	48+ GB	2x RTX 5090, A100	Requires multi-GPU or cloud

Model Performance Test Results

Performance tests conducted with standard tasks (code generation, refactoring, debugging):

GPU	VRAM	Model	Quantization	Speed (tok/s)	Quality Score
RTX 3060	12 GB	llama3.2:3b	Q4_K_M	45-55	Good
RTX 3060	12 GB	qwen2.5-coder:7b	Q4_K_M	28-35	Very Good
RTX 3060	12 GB	deepseek-r1:8b	Q4_K_M	22-28	Excellent
RTX 3060	12 GB	qwen2.5-coder:14b	Q3_K_M	12-18	Excellent
RTX 3070	8 GB	llama3.2:3b	Q4_K_M	55-65	Good
RTX 3070	8 GB	qwen2.5-coder:7b	Q4_K_M	35-42	Very Good
RTX 3070	8 GB	deepseek-r1:8b	Q4_K_M	28-35	Excellent
RTX 3080	10 GB	qwen2.5-coder:7b	Q8_0	40-48	Excellent
RTX 3080	10 GB	qwen2.5-coder:14b	Q4_K_M	25-32	Excellent
RTX 3080	10 GB	deepseek-r1:8b	Q8_0	32-40	Excellent
RTX 3090	24 GB	qwen2.5-coder:14b	Q8_0	38-45	Excellent
RTX 3090	24 GB	qwen3-coder:30b	Q4_K_M	18-25	Outstanding
RTX 3090	24 GB	deepseek-r1:32b	Q4_K_M	12-18	Outstanding
RTX 4070	12 GB	qwen2.5-coder:14b	Q5_K_M	35-42	Excellent
RTX 4070 Ti	16 GB	qwen3-coder:30b	Q4_K_M	22-28	Outstanding
RTX 4080	16 GB	qwen3-coder:30b	Q5_K_M	28-35	Outstanding
RTX 4080	16 GB	deepseek-r1:32b	Q4_K_M	20-28	Outstanding
RTX 4090	24 GB	qwen3-coder:30b	Q8_0	45-55	Outstanding
RTX 4090	24 GB	deepseek-r1:32b	Q5_K_M	35-45	Outstanding
RTX 4090	24 GB	deepseek-r1:70b	Q3_K_M	8-12	Exceptional
RTX 5070	12 GB	qwen2.5-coder:14b	Q6_K	45-55	Excellent
RTX 5070 Ti	16 GB	qwen3-coder:30b	Q5_K_M	35-45	Outstanding
RTX 5080	16 GB	qwen3-coder:30b	Q6_K	45-55	Outstanding
RTX 5080	16 GB	deepseek-r1:32b	Q5_K_M	38-48	Outstanding
RTX 5090	32 GB	qwen3-coder:30b	FP16	80-100	Exceptional
RTX 5090	32 GB	deepseek-r1:70b	Q4_K_M	25-35	Exceptional
RTX 5090	32 GB	llama3.1:70b	Q5_K_M	30-40	Exceptional

Note: Speed varies based on context length, prompt complexity, and system configuration. Quality Score is subjective based on code generation accuracy and coherence. RTX 50 series shows significant performance improvements due to Blackwell architecture and GDDR7 memory.

Quantization Guide

Quantization	Size Reduction	Quality Loss	Recommended For
Q4_K_M	~70%	Minimal	Most use cases
Q5_K_M	~65%	Very Low	Better quality
Q6_K	~60%	Negligible	High quality needs
Q8_0	~50%	None	Maximum quality
FP16	0%	None	RTX 5090 with 32GB+ VRAM

Quick Start

Installation

# Clone the repository
git clone <repository-url>
cd ollama-code

# Install dependencies
npm install

# Build the project
npm run build

Running

# Interactive mode
npm run start

# With specific model
npm run start -- --model llama3.2

# One-off query
npm run start -- "Explain how async/await works in JavaScript"

# Debug mode
npm run debug

Running Task Files

Load and execute predefined task files:

# Run a task by name (from ~/.ollama-code/tasks/)
npm run cli -- --model qwen2.5-coder:14b --yolo --file tools-demo

# Run with full path
npm run cli -- --yolo --file ~/my-tasks/project-audit.md

# Run with relative path
npm run cli -- --yolo --file ./TASK.md

# Combine with other options
npm run cli -- --model deepseek-r1:8b --yolo --file code-review

Available Task Files (in ~/.ollama-code/tasks/):

Task File	Description
`tools-demo`	Test all 46+ builtin tools
`project-audit`	Comprehensive project audit
`code-review`	Review code changes
`debug-issue`	Debug an issue
`refactor-module`	Refactor a module
`quick-fix`	Quick functionality test

Web UI

Ollama Code now includes a full-featured web interface:

# Start Web UI (development)
cd packages/web-app
npm run dev

# Start with terminal support
npm run dev:server

Web UI Features:

Tab	Features
Chat	Streaming responses, model selection, session management
Files	File browser, Monaco editor, syntax highlighting
Terminal	Full PTY terminal with xterm.js

API Endpoints:

Endpoint	Description
`/api/models`	List available Ollama models
`/api/chat`	Chat with streaming
`/api/generate`	Generate with streaming
`/api/fs`	Filesystem operations
`/terminal`	WebSocket terminal

What's New in v0.17.11

Auto-Extract JSON Tool Calls from Text

When models output JSON as text instead of making actual tool calls, the system now automatically detects and executes them:

Before	After
Model outputs JSON as text → ignored	JSON detected → executed as tool calls
"Model responded (no tool calls)"	"📝 Extracted N tool call(s) from text output"

Example:

✅ Model responded in 323.0s (no tool calls)
📝 Extracted 2 tool call(s) from text output
   → model_storage (from text)
   → model_storage (from text)
🔧 Executing 2 tool call(s)...

This makes the system more robust for models that "give up" and output JSON as text instead of making actual tool calls.

What's New in v0.17.10

Non-Interactive Mode & Storage Tool Improvements

Bug Fixes:

Issue	Solution
Double "Loaded task file" output	Only print in child process (`OLLAMA_CODE_NO_RELAUNCH=true`)
"fetch failed" in addWithEmbedding	Save data first, embedding is optional
Model gives up after error	Added guidance hints in tool results

Resilient Storage Operations:

// Before: Failed completely if Ollama not running
{ "operation": "addWithEmbedding", ... } // ❌ fetch failed

// After: Saves data even without embeddings
{ "operation": "addWithEmbedding", ... } // ✅ Data saved (embedding skipped)

Model Guidance in Tool Results:

Situation	Hint Added
Success	"Continue with the next step in your task."
Embedding failed	"Data saved successfully. Continue..."
Error	"Try using 'set' operation instead..."

What's New in v0.17.5

Fixed: Template Loading & --file Option

Bug Fixes:

Issue	Solution
"Unknown file extension .md" error	Replaced static imports with runtime `fs.readFileSync()`
Wrong template paths in bundle	Added multiple path detection for bundled/dist modes
Missing templates	Fallback templates when files not found

New --file Option:

# Load task by name (from ~/.ollama-code/tasks/)
npm run cli -- --file tools-demo

# Load with full path
npm run cli -- --file ~/my-tasks/project-audit.md

# Load with relative path
npm run cli -- --file ./TASK.md

Supported Path Formats:

Format	Example	Description
Name	`tools-demo`	Looks in `~/.ollama-code/tasks/`
Home	`~/tasks/test.md`	Expanded to home directory
Relative	`./TASK.md`	Relative to current directory
Absolute	`/path/to/task.md`	Full path

What's New in v0.17.2

Model Storage Tool — Persistent AI Memory

Universal key-value storage for AI models to persist data between sessions:

Operation	Description
`set`	Store a value
`get`	Retrieve a value
`delete`	Remove a key
`list`	List all keys in namespace
`append`	Append item to array
`merge`	Merge object with existing
`clear`	Clear all data in namespace

Predefined Namespaces

Namespace	Purpose	Default Mode
`roadmap`	Project roadmap, milestones, plans	persistent
`session`	Temporary session data	session
`knowledge`	Learned facts, patterns, preferences	persistent
`context`	Current task context and state	session
`learning`	Tool aliases, corrections	persistent
`metrics`	Statistics, performance data	persistent

Storage Scopes

global: ~/.ollama-code/storage/ (shared across all projects)
project: <project>/storage/ (project-specific data)

Tool Aliases

Alias	Tool
`storage`, `store`	`model_storage`
`kv`, `cache`	`model_storage`
`roadmap`, `persist`	`model_storage`

What's New in v0.17.1

SSH Profile Management Tools

Complete SSH profile management system for storing and reusing SSH credentials:

Tool	Description
`ssh_add_host`	Save SSH profile with credentials
`ssh_list_hosts`	List all saved SSH profiles
`ssh_remove_host`	Delete a saved SSH profile

Enhanced SSH Connect

The ssh_connect tool now supports using saved profiles:

// Using saved profile
{ "profile": "production", "command": "docker ps" }

// Direct connection (still supported)
{ "host": "192.168.1.100", "user": "admin", "command": "ls -la" }

SSH Tool Aliases

Alias	Tool
`ssh`, `ssh_connect`, `remote`	`ssh_connect`
`ssh_add_host`, `add_host`	`ssh_add_host`
`ssh_list_hosts`, `list_hosts`	`ssh_list_hosts`
`ssh_remove_host`, `remove_host`	`ssh_remove_host`

Security Notes

SSH credentials are stored in ~/.ollama-code/ssh_credentials.json
Use identity_file (SSH key) authentication instead of password when possible

What's New in v0.17.0

Git Workflow Tool — Complete Git Integration

Full git workflow integration with support for GitHub and GitLab (including self-hosted):

Operation	Description
`status`	Check repository status
`add`, `commit`	Stage and commit changes
`push`, `pull`	Remote operations
`clone`	Clone repository
`create_branch`	Create new branch
`switch`	Switch to branch
`log`, `diff`	History and differences
`create_mr`	Create GitLab Merge Request
`create_pr`	Create GitHub Pull Request
`create_merge`	Auto-detect and create MR/PR

Authentication Operations

Operation	Description
`auth_status`	Check authentication status for GitHub/GitLab
`auth_login`	Interactive login instructions
`auth_logout`	Logout from GitHub/GitLab
`auth_token`	Set authentication token

Platform Support

GitHub: Full support via gh CLI
GitLab.com: Full support via glab CLI
Self-hosted GitLab: Auto-detection and support

SSH Tools Plugin

New SSH Tools plugin for remote server connectivity with profile management:

SSH connection management
SCP file transfer
Profile-based configuration

What's New in v0.16.9

Extended Tool Aliases for Model Hallucinations

Added 70+ new tool aliases to handle common model hallucinations gracefully:

Category	New Aliases
Docker	`docker`, `docker_dev`, `container`, `container_dev`, `docker_compose`, `compose`
Database	`database`, `db`, `sql`, `mysql`, `postgres`, `mongodb`, `redis`
Kubernetes	`kubernetes`, `k8s`, `kubectl`, `helm`
CI/CD	`ci`, `cd`, `github_actions`, `gitlab_ci`, `jenkins`
Infrastructure	`terraform`, `ansible`, `aws`, `azure`, `gcp`

IDE Support Improvements

Added TypeScript to workspace root for better IDE support
Fixed "Cannot find lib.es2023.d.ts" and related TypeScript errors
Fixed global type errors (Promise, Boolean, etc.)

Removed bun Dependency

Replaced bun with pnpm for assets build
pnpm already required for monorepo workspace support

What's New in v0.16.8

Dependency Updates to Latest Versions

All core dependencies updated to their latest stable versions:

Package	New Version
`zod`	4.3.6
`@modelcontextprotocol/sdk`	1.27.1
`ajv-formats`	3.0.1
`msw`	2.12.10

Zod v4 Migration

Migrated to Zod v4 for compatibility with latest MCP SDK:

Change	Description
`z.record()`	Now requires explicit key/value type arguments
`z.string()`	Simplified error message parameter
`ZodObject`	Simplified generic parameters
Error structure	Uses `.issues` instead of `.errors`

Build Verification

✅ TypeScript compilation passes
✅ Build succeeds
✅ Bundle created successfully

What's New in v0.16.7

Test Migration — 100% Complete

All tests migrated from tools/ to plugins/builtin/:

Category	Tests
dev-tools	python, nodejs, golang, rust, java, cpp, swift, php, typescript
file-tools	edit, glob, ls, read-file, read-many-files, write-file
search-tools	grep, ripGrep, web-fetch
database-tools	database, docker, redis
mcp-tools	mcp-client, mcp-tool, mcp-client-manager
Other	skill, task, todoWrite, exitPlanMode, lsp, shell, git-advanced, etc.

TypeScript Strict Mode

Fixed all TypeScript strict mode errors with explicit type annotations.

Dependency Updates

zod upgraded to 3.25.0 for MCP SDK compatibility
ajv-formats downgraded to 2.1.1 for ESM compatibility

What's New in v0.16.4

Code Quality & ESLint Fixes

Comprehensive codebase cleanup with all ESLint errors resolved:

Metric	Before	After
ESLint Errors	336	0
ESLint Warnings	48	48
TypeScript Errors	0	0

Key Improvements:

Fixed 150+ unused variables/imports
Added default cases to all switch statements
Fixed optional chaining for type safety
Relaxed test file rules for better DX

What's New in v0.15.0

Web UI — Complete Next.js Interface (95%)

Full-featured web application with three main components:

Component	Technology	Features
ChatInterface	React + Zustand	Streaming, model selection, session persistence
FileExplorer	Monaco Editor	Syntax highlighting, multi-language support, auto-save
TerminalEmulator	xterm.js + node-pty	Full PTY support, resize, 256 colors

Terminal WebSocket Server:

Full PTY support via WebSocket
Session management with IP limits
Timeout cleanup for inactive sessions

TSDoc API Documentation

Comprehensive API documentation for all packages:

// SDK Usage
import { query, createSdkMcpServer, tool } from '@ollama-code/sdk';

const result = await query({
  prompt: 'Explain async/await',
  model: 'llama3.2',
});

// MCP Server
const myTool = tool({
  name: 'echo',
  description: 'Echo back a message',
  parameters: { message: { type: 'string' } },
  execute: async (params) => ({ echo: params.message }),
});

Technical Improvements

TypeScript Configuration: Fixed monorepo project references
HTTP Client: Completed fetch → axios migration
Terminal Server: WebSocket-based PTY with session management
Documentation: TSDoc for core and SDK packages

What's New in v0.14.0

Plugin System v2 — Complete Implementation

Component	Description
PluginLoader	Discovery from builtin, user, project, npm sources
PluginManager	Lifecycle management with enable/disable hooks
PluginSandbox	Filesystem, network, command restrictions
PluginMarketplace	NPM-based search, install, update, uninstall

Builtin Plugins (5):

core-tools — echo, timestamp, get_env
dev-tools — python_dev, nodejs_dev, golang_dev, rust_dev, typescript_dev
file-tools — read_file, write_file, edit_file
search-tools — grep, glob, web_fetch
shell-tools — run_shell_command

Prompt System v2 — Model-Size-Optimized Templates

Model Size	Template	Prompt Size
<= 10B	8b	~500 tokens
<= 30B	14b	~800 tokens
<= 60B	32b	~1200 tokens
> 60B	70b	~1500 tokens

What's New in v0.13.0

HTTP Client Migration (fetch → axios)

Completed migration to axios for all HTTP operations:

File	Changes
`packages/core/src/utils/httpClient.ts`	Axios instance with interceptors
`packages/core/src/core/ollamaNativeClient.ts`	Streaming with axios
`packages/core/src/tools/web-search/providers/*.ts`	Provider migration

Features:

Request/Response logging
Retry with exponential backoff
Timeout handling
Auth header injection

TypeScript Configuration

Fixed monorepo TypeScript configuration:

Added project references for all packages
Added composite: true for referenced packages
Fixed ESLint configuration for web-app
Separated server code tsconfig (tsconfig.server.json)

What's New in v0.11.3

Bug Fixes

React Hooks Rules Fix: Fixed LoadingIndicator.tsx where useMemo hooks were called after early return, violating React Rules of Hooks

Complete Feature Set

Feature	Description
Plugin System v2	PluginLoader, PluginCLI, PluginMarketplace, PluginSandbox
HTTP Client	Axios with interceptors, retry logic, timeout handling
React Optimization	6 specialized contexts, 11 memoized components
Cancellation	CancellationToken, AbortController cleanup
Context Caching	KV-cache reuse for 80-90% faster conversations

What's New in v0.11.0

Architecture Improvements

Major architectural enhancements for better performance and extensibility:

Feature	Description
Zustand Migration	Replaced Context API, eliminates unnecessary re-renders
Event Bus	Typed pub/sub system for loose component coupling
Command Pattern	Full Undo/Redo support for reversible operations
Plugin System v1	Dynamic tool loading, builtin plugins, lifecycle hooks
Context Caching	KV-cache reuse for 80-90% faster conversations
Prompt Documentation	Complete documentation of prompt formation system

New Stores

Store	Purpose
`sessionStore`	Session state and metrics
`streamingStore`	Streaming state + AbortController
`uiStore`	UI settings with persistence
`commandStore`	Command pattern for undo/redo
`eventBus`	Event pub/sub system

Plugin System

Dynamic plugin architecture with lifecycle hooks:

const plugin: PluginDefinition = {
  metadata: { id: 'my-plugin', name: 'My Plugin', version: '1.0.0' },
  tools: [
    { id: 'hello', name: 'hello', execute: async () => ({ success: true }) },
  ],
  hooks: {
    onLoad: async (ctx) => ctx.logger.info('Loaded'),
    onBeforeToolExecute: async (id, params) => true,
  },
};

Builtin Plugins:

core-tools — echo, timestamp, get_env
dev-tools — python_dev, nodejs_dev, golang_dev, rust_dev, typescript_dev
file-tools — read_file, write_file, edit_file
search-tools — grep, glob, web_fetch
shell-tools — run_shell_command

Event Bus

Typed events for cross-component communication:

// Subscribe to events
eventBus.subscribe('stream:finished', (data) => {
  console.log('Tokens:', data.tokenCount);
});

// Emit events
eventBus.emit('command:executed', { commandId: '123', type: 'edit' });

Prompt System Documentation

New comprehensive documentation in docs/PROMPT_SYSTEM.md:

getCoreSystemPrompt() — main system prompt construction
getCompressionPrompt() — history compression to XML
getToolCallFormatInstructions() — for models without native tools
getToolLearningContext() — learning from past mistakes
getEnvironmentInfo() — runtime environment context

What's New in v0.10.9

Context Caching with KV-cache Reuse

Major performance improvement for multi-turn conversations:

Feature	Description
80-90% Faster	Subsequent messages use cached context tokens
KV-cache Reuse	Leverages Ollama's native context caching
Auto Endpoint Selection	Switches between `/api/generate` and `/api/chat`
Session Tracking	Per-session context management

// Enable context caching
const config: ContentGeneratorConfig = {
  model: 'llama3.2',
  enableContextCaching: true, // Key improvement
};

// Performance gains:
// Message 1: 100% (baseline)
// Message 2: ~15% tokens processed (85% cached)
// Message 10: ~7% tokens processed (93% cached)

Test Coverage

All context caching components are fully tested with 118 tests:

Component	Tests	Coverage
ContextCacheManager	50	TTL, eviction, concurrency, edge cases
OllamaContextClient	32	Streaming, errors, session management
HybridContentGenerator	36	Endpoint selection, token counting

See docs/CONTEXT_CACHING.md for full API documentation.

Architecture Improvements

Component	Description
Zustand Stores	Replaced Context API for better performance
Event Bus	Typed publish/subscribe for loose coupling
Command Pattern	Undo/Redo support for reversible operations
Plugin System	Dynamic tool loading at runtime

New Configuration Options

interface ContentGeneratorConfig {
  // Enable context caching for faster conversations
  enableContextCaching?: boolean;

  // Session ID for context tracking
  sessionId?: string;
}

What's New in v0.10.8

Context Progress Bar & Model Info Display

The header now provides real-time context usage visualization:

Feature	Description
Token Progress Bar	Visual indicator of context window usage
Model Context Size	Shows model's context window (128K, 32K, etc.)
Capability Icons	Visual indicators for vision, tools, streaming support
Full-Width Display	Progress bar spans full info panel width

Command Cleanup

Streamlined CLI by removing unused commands:

Removed: /bug, /docs, /help, /setup-github
Merged: /stats + /about → /info

Technical Improvements

Optimized system prompts for better model performance
Added tool call format instructions for models without native support
Fixed progress bar cumulative token tracking
ES module imports in development tools

What's New in v0.10.7

Self-Learning System for Tool Calling

The system now automatically learns from tool call errors and creates dynamic aliases:

Feature	Description
Automatic Learning	Records tool call errors and creates aliases automatically
Fuzzy Matching	Uses Levenshtein distance to suggest correct tool names
Persistence	Learning data saved to `~/.ollama-code/learning/`
Dynamic Aliases	Runtime alias creation without code modifications

How it works:

Model calls a non-existent tool name → System records the error
System uses fuzzy matching to find the most similar valid tool
After threshold reached → Dynamic alias is created
Future calls with incorrect name → Resolved to correct tool

What's New in v0.10.6

Development Tools

Three new comprehensive development tools have been added:

Tool	Aliases	Description
`python_dev`	`py`, `python`, `pip`, `pytest`	Python development (run, test, lint, venv, pip)
`nodejs_dev`	`node`, `npm`, `yarn`, `pnpm`, `bun`	Node.js development with auto-detected package manager
`golang_dev`	`go`, `golang`	Go development (run, build, test, mod)
`php_dev`	`php`, `composer`, `phpunit`, `artisan`	PHP development with Composer and Laravel support

Environment Notification

The model now receives detailed environment information at session start, including:

Ollama configuration (base URL, model, API key status)
System information (Node.js version, platform, working directory)
Debug settings

Enhanced Documentation

New comprehensive documentation:

FEATURES.md - Complete feature reference
TOOLS.md - Detailed tools reference
FEATURES.ru.md - Russian feature reference
TOOLS.ru.md - Russian tools reference

What's New in v0.10.5

Tool Alias System

Models can now use short tool names:

Alias	Tool Name
`run`, `shell`, `exec`, `cmd`	`run_shell_command`
`read`	`read_file`
`write`, `create`	`write_file`
`grep`, `search`, `find`	`grep_search`
`ls`, `list`, `dir`	`list_directory`

Session ID Display

Session ID is now shown in the header for easier debugging and log correlation.

UTF-8 Locale Check

Added startup warning if terminal encoding is not UTF-8.

UI/UX Improvements

// Progress bar for model downloads
<ProgressBar
  progress={45}
  label="Downloading model"
  speed="5.2 MB/s"
  eta="2m 30s"
/>

// Thinking indicator for reasoning models
<ThinkingIndicator
  message="Analyzing code..."
  elapsedTime={45}
  showContent
/>

// Token usage display
<TokenUsageDisplay
  totalTokens={1500}
  promptTokens={500}
  completionTokens={1000}
  tokensPerSecond={45}
/>

Additional Tools

Database Tool

> Execute SELECT * FROM users LIMIT 10 in SQLite database data.db
> Save database backup to /backup/db.sql
> Show schema of users table

Docker Tool

> Run nginx container on port 8080
> Show logs of my-app container
> Stop all containers
> Build Docker image from current directory

Redis Tool

> Get value of key session:user:123
> Set cache:data with 1 hour expiry
> Publish message to notifications channel
> Show all keys with user: prefix

Project Structure

ollama-code/
├── packages/
│   ├── core/           # Core: Ollama client, tools, types
│   ├── cli/            # CLI interface based on Ink
│   ├── web-app/        # Web UI: Next.js application (NEW)
│   ├── webui/          # Web components for UI
│   └── sdk-typescript/ # SDK for programmatic use
├── scripts/            # Build and run scripts
├── integration-tests/  # Integration tests
└── docs/               # Documentation

Documentation

📚 Complete Guides (English)

Guide	Description
CLI_GUIDE.md	Complete CLI usage guide
CORE_GUIDE.md	Core library developer guide
WEB_UI_GUIDE.md	Web UI complete usage guide
FEATURES.md	Feature reference
TOOLS.md	Tools reference
USAGE_GUIDE.md	Usage guide
EXAMPLES.md	Usage examples
OLLAMA_API.md	API documentation

📚 Полные руководства (Русский)

Руководство	Описание
CLI_GUIDE.ru.md	Полное руководство по CLI
CORE_GUIDE.ru.md	Руководство разработчика Core
WEB_UI_GUIDE.ru.md	Полное руководство по Web UI
FEATURES.ru.md	Справочник функций
TOOLS.ru.md	Справочник инструментов
README.ru.md	README на русском

Quick Reference

Document	Description
WEB_UI.md	Web UI technical docs
FEATURES.md	Complete feature reference
TOOLS.md	Detailed tools reference
USAGE_GUIDE.md	Usage guide
EXAMPLES.md	Usage examples
OLLAMA_API.md	API documentation

Project Resources

Document	Description
PROJECT_STRUCTURE.md	Project structure
ROADMAP.md	Development roadmap
CONTRIBUTING.md	Contribution guidelines

Plugin System

Document	Description
PLUGIN_SYSTEM.md	Plugin architecture and API
PLUGIN_MARKETPLACE.md	Plugin Marketplace usage guide
PLUGIN_SANDBOX.md	Plugin security and sandboxing

Prompt System

Document	Description
PROMPT_SYSTEM_V2.md	Model-size-optimized prompts (NEW)
PROMPT_SYSTEM.md	Legacy prompt system docs

Commands

Command	Description
`npm run build`	Build all packages
`npm run start`	Run CLI
`npm run dev`	Run in development mode
`npm run debug`	Run with debugger
`npm run test`	Run tests
`npm run lint`	Lint code
`npm run typecheck`	TypeScript type check

CLI Options

Options:
  -d, --debug                     Debug mode
  -m, --model                     Specify model
  -s, --sandbox                   Run in sandbox
  -y, --yolo                      Auto-confirm all actions
  -f, --file                      Load task file to execute
      --approval-mode             Approval mode: plan, default, auto-edit, yolo
      --experimental-lsp          Enable experimental LSP support
      --ollama-base-url           Ollama server URL (default: http://localhost:11434)
      --ollama-api-key            API key for remote instances

Environment Variables

Variable	Description
`OLLAMA_BASE_URL`	Ollama server URL
`OLLAMA_API_KEY`	API key (optional)
`OLLAMA_MODEL`	Default model
`OLLAMA_KEEP_ALIVE`	Model memory retention time (default: 5m)
`DEBUG`	Enable debug mode (1 or true)
`OLLAMA_CODE_DEBUG_LOG_FILE`	Log to file

VSCode Debugging

The project includes ready-to-use VSCode debug configurations:

Open project in VSCode
Press F5 or select "Run and Debug"
Choose configuration:
- Debug Ollama Code CLI — basic debugging
- Debug Ollama Code CLI (with args) — with arguments
- Debug Current Test File — debug current test

Ollama API

The project uses native Ollama APIs:

Endpoint	Method	Description
`/api/tags`	GET	List local models
`/api/show`	POST	Model info
`/api/generate`	POST	Text generation
`/api/chat`	POST	Chat with model
`/api/embed`	POST	Embeddings
`/api/create`	POST	Create model
`/api/pull`	POST	Download model
`/api/ps`	GET	Running models
`/api/version`	GET	Ollama version

Full API docs: OLLAMA_API.md

Recommended Models

Model	Purpose	Size
`llama3.2`	General purpose	3B
`qwen2.5-coder:7b`	Programming	7B
`qwen2.5-coder:14b`	Programming	14B
`qwen3-coder:30b`	Programming	30B
`deepseek-r1:8b`	Reasoning (thinking)	8B
`codellama`	Programming	7B+
`mistral`	General purpose	7B
`nomic-embed-text`	Embeddings	274M

Development

Build Single Package

# Build core
npm run build --workspace=packages/core

# Build cli
npm run build --workspace=packages/cli

Run Tests

# All tests
npm run test

# Core package tests
npm run test --workspace=packages/core

# Integration tests
npm run test:integration:sandbox:none

Adding a New Tool

Create file in packages/core/src/tools/
Implement class extending BaseDeclarativeTool
Export from index.ts
Add alias in tool-names.ts

License

Apache License 2.0

Documentation created with GLM-5 from Z.AI

Contributing

See CONTRIBUTING.md for contribution guidelines.

Name		Name	Last commit message	Last commit date
Latest commit History 4,594 Commits
.github		.github
.husky		.husky
.vscode		.vscode
assets		assets
build/Release		build/Release
docs-site		docs-site
docs		docs
eslint-rules		eslint-rules
examples		examples
integration-tests		integration-tests
lib/binding/node-v127-linux-x64		lib/binding/node-v127-linux-x64
packages		packages
scripts		scripts
.editorconfig		.editorconfig
.gitattributes		.gitattributes
.gitignore		.gitignore
.npmrc		.npmrc
.nvmrc		.nvmrc
.prettierignore		.prettierignore
.prettierrc.json		.prettierrc.json
.yamllint.yml		.yamllint.yml
CHANGELOG.md		CHANGELOG.md
CHANGELOG.ru.md		CHANGELOG.ru.md
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
README.ru.md		README.ru.md
ROADMAP.md		ROADMAP.md
SECURITY.md		SECURITY.md
bookmarks.json		bookmarks.json
esbuild.config.js		esbuild.config.js
eslint.config.js		eslint.config.js
package-lock.json		package-lock.json
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
pnpm-workspace.yaml		pnpm-workspace.yaml
tsconfig.json		tsconfig.json
vitest.config.d.ts		vitest.config.d.ts
vitest.config.js		vitest.config.js
vitest.config.js.map		vitest.config.js.map
vitest.config.ts		vitest.config.ts

Folders and files

Latest commit

History

Repository files navigation

Ollama Code

Features

Requirements

Node.js Compatibility with node-pty

GPU Requirements for Models

Minimum GPU Requirements by Model Size

Model Performance Test Results

Quantization Guide

Quick Start

Installation

Running

Running Task Files

Web UI

What's New in v0.17.11

Auto-Extract JSON Tool Calls from Text

What's New in v0.17.10

Non-Interactive Mode & Storage Tool Improvements

What's New in v0.17.5

Fixed: Template Loading & --file Option

What's New in v0.17.2

Model Storage Tool — Persistent AI Memory

Predefined Namespaces

Storage Scopes

Tool Aliases

What's New in v0.17.1

SSH Profile Management Tools

Enhanced SSH Connect

SSH Tool Aliases

Security Notes

What's New in v0.17.0

Git Workflow Tool — Complete Git Integration

Authentication Operations

Platform Support

SSH Tools Plugin

What's New in v0.16.9

Extended Tool Aliases for Model Hallucinations

IDE Support Improvements

Removed bun Dependency

What's New in v0.16.8

Dependency Updates to Latest Versions

Zod v4 Migration

Build Verification

What's New in v0.16.7

Test Migration — 100% Complete

TypeScript Strict Mode

Dependency Updates

What's New in v0.16.4

Code Quality & ESLint Fixes

What's New in v0.15.0

Web UI — Complete Next.js Interface (95%)

TSDoc API Documentation

Technical Improvements

What's New in v0.14.0

Plugin System v2 — Complete Implementation

Prompt System v2 — Model-Size-Optimized Templates

What's New in v0.13.0

HTTP Client Migration (fetch → axios)

TypeScript Configuration

What's New in v0.11.3

Bug Fixes

Complete Feature Set

What's New in v0.11.0

Architecture Improvements

New Stores

Plugin System

Event Bus

Prompt System Documentation

What's New in v0.10.9

Context Caching with KV-cache Reuse

Test Coverage

Architecture Improvements

New Configuration Options

What's New in v0.10.8

Context Progress Bar & Model Info Display

Command Cleanup

Technical Improvements

Packages