Skip to content

AI4Unow/claude-agents

Repository files navigation

Modal.com Self-Improving Agents

Multi-agent system using the II Framework (Information & Implementation) deployed on Modal.com. Agents read instructions from Modal Volume, execute tasks, and can self-improve by rewriting their instructions based on experience. Now fully migrated to the Claude Agents SDK.

Status

Phase: Production MVP Deploy URL: https://duc-a-nguyen--claude-agents-telegramchatagent-app.modal.run Last Updated: Jan 5, 2026

Key Features

  • 11 circuit breakers (claude, exa, tavily, firebase, qdrant, telegram, gemini, evolution, google_calendar, google_tasks, apple_caldav)
  • Execution tracing with tool-level timing (10% success, 100% error)
  • Self-improvement loop with Telegram admin approval (HITL)
  • 102 skills (14 local, 73 remote, 15 hybrid deployment)
  • Gemini API integration (deep research, grounding, vision, thinking)
  • Smart FAQ system with hybrid keyword+semantic matching
  • Web Dashboard (React + Cloudflare Workers)
  • Smart Task Management (NLP parsing, bidirectional calendar sync, React dashboard)
  • Claude Agents SDK (hooks, tools, checkpointing, autonomous agents)
  • MCP Servers (Firebase, Qdrant, Google Search)
  • Gemini embeddings (gemini-embedding-001, 3072 dimensions)
  • Firebase Storage for research reports + content downloads (24h links)
  • User tier system (guest, user, developer, admin)
  • Command Router pattern (decorator-based registration)
  • PKM Second Brain (capture, organize, semantic search)
  • WhatsApp Evolution API integration
  • Bidirectional sync with Google/Apple Calendar & Tasks
  • Smart timing & behavior-based reminder optimization
  • Auto-scheduler with multi-skill orchestration (DAG validation)
  • Stress test framework (Locust + chaos engineering)

Agents

Agent Purpose Trigger
Telegram Chat Primary user interface via Telegram Webhook (always-on)
GitHub Repository automation Cron (hourly) + webhook
Data Data processing & analytics Scheduled (daily)
Content Content generation & transformation On-demand

Technology Stack

  • Runtime: Modal.com (Python 3.11, serverless)
  • Web Framework: FastAPI
  • AI: Anthropic Claude (kiro-claude-opus-4-5), Google Gemini (2.5-flash-lite)
  • Protocol: Model Context Protocol (MCP)
  • Embeddings: Gemini gemini-embedding-001 (3072 dim)
  • Vector Memory: Qdrant Cloud
  • State Store: Firebase Firestore + Storage
  • Chat Platform: Telegram Bot + WhatsApp Evolution
  • Web Search: Exa (primary) + Gemini Grounding (fallback)

Quick Start

# Install Modal CLI
pip install modal
modal setup

# Deploy
cd agents
modal deploy main.py

# View logs
modal app logs claude-agents

API Endpoints

Endpoint Method Purpose
/health GET Health check with circuit status
/webhook/telegram POST Telegram bot webhook
/webhook/whatsapp POST WhatsApp Evolution API webhook
/webhook/github POST GitHub webhook
/api/skill POST Execute skill (simple/routed/orchestrated)
/api/skills GET List available skills
/api/task/{id} GET Get local task status
/api/reports GET List user research reports
/api/reports/{id} GET Get report download URL
/api/reports/{id}/content GET Get report content
/api/traces GET Execution traces (developer+)
/api/circuits GET Circuit breaker status

Telegram Commands

Command Tier Description
/start All Welcome message
/help All Show available commands
/status All Check status and tier
/skills All Browse skills (menu)
/skill <name> <task> All Execute skill
`/mode <simple routed auto>`
/task <id> User+ Check task status
/traces Developer+ Recent execution traces
/circuits Developer+ Circuit breaker status
/grant <id> <tier> Admin Grant user tier
/revoke <id> Admin Revoke user access

Skill Architecture

102 skills with deployment types:

  • Local (14): canvas-design, docx, xlsx, pptx, pdf, media-processing, image-enhancer, video-downloader, mcp-builder, etc.
  • Remote (73): planning, debugging, research, code-review, gemini-*, backend-dev, frontend-dev, mobile-dev, ui-ux-pro-max, ai-multimodal, automation skills
  • Hybrid (15): repomix, sequential-thinking, better-auth, chrome-devtools, mcp-management, webapp-testing, skill-share, worktree-manager, gemini-vision, etc.

Gemini Skills

Skill Description
gemini-deep-research Multi-step agentic research with citations
gemini-grounding Real-time factual queries via Google Search
gemini-thinking Configurable reasoning depth
gemini-vision Image and document analysis

Project Structure

web-ui/                      # React frontend application
ui-service/                  # Cloudflare Worker API gateway
agents/
├── main.py                    # Modal app entry point (~2,430 lines)
├── api/                       # FastAPI routes (modular)
├── commands/                  # Command Router pattern
├── src/
│   ├── agents/                # Agent implementations (4)
│   ├── sdk/                   # Claude Agents SDK (Hooks, Tools, Agent)
│   ├── mcp/                   # MCP Server implementations
│   ├── services/              # External integrations
│   │   ├── firebase/          # Modular Firebase (14 modules)
│   │   ├── gemini.py          # Gemini API client
│   │   ├── google_calendar.py # Google Calendar integration
│   │   ├── google_tasks.py    # Google Tasks integration
│   │   └── apple_caldav.py    # Apple CalDAV integration
│   ├── tools/                 # Tool system (10+ tools)
│   └── core/                  # II Framework (Legacy Core - Deprecated)
├── skills/                    # 102 skill info.md files
├── scripts/                   # Utility scripts
└── tests/                     # 50+ test files (unit + e2e + stress)
docs/
├── project-overview-pdr.md
├── system-architecture.md
├── codebase-summary.md        # Comprehensive overview
├── code-standards.md
├── project-roadmap.md
└── deployment-guide.md

Secrets Required

modal secret create anthropic-credentials ANTHROPIC_API_KEY=...
modal secret create telegram-credentials TELEGRAM_BOT_TOKEN=...
modal secret create firebase-credentials FIREBASE_PROJECT_ID=... FIREBASE_CREDENTIALS_JSON=...
modal secret create qdrant-credentials QDRANT_URL=... QDRANT_API_KEY=...
modal secret create exa-credentials EXA_API_KEY=...
modal secret create tavily-credentials TAVILY_API_KEY=...
modal secret create admin-credentials ADMIN_TELEGRAM_ID=... ADMIN_API_TOKEN=...
modal secret create gcp-credentials GCP_PROJECT_ID=... GOOGLE_APPLICATION_CREDENTIALS_JSON=...

Documentation

License

Private repository.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors