Nexus Agents

Governance substrate for your AI coding agents — adversarial review, drift-detected rules, immutable audit, closed-loop telemetry

Why Nexus Agents?

Nexus-agents is a governance layer that sits above your AI coding agents — Claude Code, Codex, Gemini, OpenCode (and, via adapters, Devin / Factory). The agents do the engineering; nexus-agents enforces the rules they have to follow, reviews their work adversarially before it ships, audits everything they touch, and routes the next task based on what actually worked.

What it gives you:

Adversarial PR review — pr_review runs 5 voter roles (architect, security, devex, catfish, scope_steward) with a 4-point verification gate that catches false positives. Empirically: 100% bug-catch on diff-readable bugs, 0% strict false-positive rate
Drift-detected charter — CLAUDE.md + governance:check + blocking CI gates fail the build when documented rules drift from registered behavior (model registry, MCP tools, expert types, skills)
Immutable audit trail — every tool call, every voter decision, every routing choice flows through AuditTrail with structured logging and (in flight) hash-chained append-only storage
Closed-loop routing — OutcomeStore feeds production telemetry back into LinUCB + TOPSIS scoring so the system actually learns from what shipped vs what regressed
Multi-voter consensus — 6 strategies (simple/super-majority, unanimous, higher-order Bayesian, opinion-wise, proof-of-learning) for proposals where one agent's word isn't enough

You:               "Review this PR / orchestrate this task / vote on this proposal"
                    ↓
nexus-agents:       enforce rules → route → adversarial review → audit → learn from outcome
                    ↓
Engineering agents: Claude Code · Codex · Gemini · OpenCode · (Devin / Factory adapters)
                    ↓
Code:               actual edits, tests, PRs, issues

What this is NOT:

Not another autonomous coding agent. OpenHands, SWE-agent, AutoGen, Devin, Factory — those are agents. Nexus-agents is the layer above them. Use whichever agents fit; we govern them
Not a chat framework. Nothing here orchestrates conversations. It orchestrates real CLI tool invocations with real file I/O and outcome tracking
Not a model API proxy. The value is the rules, the gates, the audit, and the learning. Routing is a side effect of the governance work, not the product

Where nexus-agents sits in your stack

   Human / IDE / CLI
   (Claude Code, Cursor, VS Code, terminal)
            │ MCP Protocol
            ▼
  ┌─────────────────────────────────────────────────────┐
  │  GOVERNANCE SUBSTRATE — what nexus-agents provides   │
  │                                                       │
  │   Charter (drift-checked)   Adversarial PR review    │
  │   Role registry             Multi-voter consensus    │
  │   Immutable audit trail     Closed-loop telemetry    │
  │                                                       │
  │   34 MCP tools · 9-stage CompositeRouter             │
  └────────────────────────┬────────────────────────────┘
                           │
                           ▼ delegates execution to
  ┌─────────────────────────────────────────────────────┐
  │  ENGINEERING AGENTS — what does the actual work      │
  │                                                       │
  │   Claude Code · Codex · Gemini · OpenCode            │
  │   (Devin / Factory adapters in flight)               │
  └────────────────────────┬────────────────────────────┘
                           │
                           ▼ produces
                   Code, tests, PRs, issues

The governance substrate is the layer that catches the mistakes engineering agents would otherwise make — bad code shipped, rules drifting from intent, audit gaps, telemetry-free routing — and routes the next task based on what actually worked the last time.

Quick Start (2 minutes)

1. Install

npm install -g nexus-agents

Or as a Claude Code plugin (single-command install from the official marketplace):

/plugin install nexus-agents

See docs/getting-started/PLUGIN_INSTALL.md for plugin-specific setup, or llms-install.md for the short install guide an AI agent can follow.

2. Verify

nexus-agents doctor

3. Use

With Claude Code (recommended):

nexus-agents setup   # Auto-configures MCP server

Then in Claude: "orchestrate: Review this PR for issues"

Standalone CLI:

export ANTHROPIC_API_KEY=your-key
nexus-agents orchestrate "Explain the architecture of this codebase"

Security: In default MCP mode, the server communicates only via stdio with the parent process (no network exposure). The REST API (opt-in) auto-generates an API key on first start. For network-exposed deployments, set NEXUS_AUTH_ENABLED=true. See SECURITY.md.

Capabilities

Category	Details
Adversarial PR Review	`pr_review` MCP tool: 5 voter roles (architect, security, devex, catfish, scope_steward) with 4-point verification gate. 100% bug-catch on diff-readable bugs in v5 evaluation
Consensus Voting	6 strategies: simple_majority, supermajority, unanimous, higher_order (Bayesian correlation-aware), opinion_wise, proof_of_learning
Drift-Detected Charter	`CLAUDE.md` + `inject-governance.ts check` enforces single-source registries (model registry, MCP tools, expert types). Blocking CI gate fails build on drift
Audit Trail	Structured logging for every tool call, voter decision, and routing choice. Hash-chained immutable storage in flight (#2281)
Closed-Loop Telemetry	`OutcomeStore` records production results; LinUCB bandit + TOPSIS scoring + adaptive routing bonuses adjust based on what actually worked. No other framework closes this loop
Security Pipeline	Sandboxing (Docker/policy), trust-tiered input handling, SARIF parsing, red-team patterns, ClawGuard access policies (audit/enforce)
Multi-Expert Orchestration	12 built-in expert types coordinated by Orchestrator. Roles bind prompt + tools + memory
Development Pipeline	Research → Plan → Vote → Decompose → Implement → QA → Security. Three modes: autonomous, harness (caller implements), dry-run
Memory & Learning	5 user-facing backends (session, belief, agentic, adaptive, typed). Cross-session persistence feeds routing decisions
Research System	9 discovery sources (arXiv, GitHub, Semantic Scholar, etc). Auto-catalog, quality scoring, synthesis into topic clusters
Graph Workflows	DAG-based workflow execution with checkpoint/resume, state reduction, and event hooks
34 MCP Tools	Agent management, workflow execution, research, memory, codebase intelligence, repo analysis, consensus, operations

Available Experts

Expert	Specialization
Code	Implementation, debugging, optimization
Architecture	System design, patterns, scalability
Security	Vulnerability analysis, secure coding
Testing	Test strategies, coverage, test generation
QA	Acceptance criteria, regression checks
Documentation	Technical writing, API docs
DevOps	CI/CD, deployment, infrastructure
Research	Literature review, state-of-the-art analysis
PM	Product management, requirements, priorities
UX	User experience, usability, accessibility
Infrastructure	Server management, bare metal, networking
Data Viz	Charts, dashboards, visual data presentation

Supported CLIs & Providers

Nexus-agents routes tasks through 5 CLI adapters, each connecting to major AI providers:

CLI	Provider	Best For
claude	Anthropic (Claude)	Complex reasoning, analysis
gemini	Google (Gemini)	Long context, multimodal
codex	OpenAI (Codex CLI)	Code generation, reasoning
codex-mcp	OpenAI (Codex MCP)	MCP-native Codex integration
opencode	Custom OpenAI-compat	Custom endpoints, local models

CLI Commands

nexus-agents                    # Start MCP server (default)
nexus-agents doctor             # Check installation health
nexus-agents setup              # Configure Claude CLI integration
nexus-agents orchestrate "..."  # Run task with experts
nexus-agents vote "proposal"    # Multi-agent consensus voting
nexus-agents review <pr-url>    # Review a GitHub PR
nexus-agents expert list        # List available experts
nexus-agents workflow list      # List workflow templates
nexus-agents config init        # Generate config file
nexus-agents fitness-audit      # Run fitness score audit
nexus-agents research query     # Query research registry
nexus-agents --help             # Full command list

See docs/ENTRYPOINTS.md for the complete CLI reference (28+ commands).

MCP Tools

When running as an MCP server, the following tools are available:

Tool	Description
`orchestrate`	Task orchestration with Orchestrator coordination
`create_expert`	Create a specialized expert agent
`execute_expert`	Execute a task using a created expert
`run_workflow`	Execute a workflow template
`delegate_to_model`	Route task to optimal model
`list_experts`	List available expert types
`list_workflows`	List available workflow templates
`consensus_vote`	Multi-model consensus voting on proposals
`research_query`	Query research registry (status, overlap, stats, search)
`research_add`	Add paper to registry by arXiv ID
`research_add_source`	Add non-paper source (GitHub repo, tool, blog)
`research_discover`	Discover papers/repos from external sources
`research_analyze`	Analyze registry for gaps, trends, coverage
`research_catalog_review`	Review auto-cataloged research references
`research_synthesize`	Synthesize registry into topic clusters with themes
`memory_query`	Query across all memory backends
`memory_stats`	Memory system statistics dashboard
`memory_write`	Write to typed memory backends
`weather_report`	Multi-CLI performance weather report
`issue_triage`	Triage GitHub issues with trust classification
`run_graph_workflow`	Execute graph-based workflows with checkpointing
`execute_spec`	Execute AI software factory spec pipeline
`registry_import`	Generate draft model registry entry
`query_trace`	Query execution traces for observability
`query_task_state`	Query the structured task-state log for a task ID
`verify_audit_chain`	Verify hash chain of a FileAuditStorage audit log directory
`repo_analyze`	Analyze GitHub repository structure
`repo_security_plan`	Generate security scanning pipeline for a repo
`extract_symbols`	Extract code symbols from source files for analysis
`search_codebase`	Search codebase for patterns, symbols, or text
`run_dev_pipeline`	Full dev pipeline: research, plan, vote, implement, QA
`run_pipeline`	Execute a pipeline plugin by name with typed input
`pr_review`	Multi-voter PR review with verification gate (experimental)
`supply_chain_tradeoff_panel`	Per-axis tradeoff vote for build-vs-buy / supply-chain decisions

Configuration

Environment Variables:

Variable	Description
`ANTHROPIC_API_KEY`	Claude API key
`OPENAI_API_KEY`	OpenAI API key
`GOOGLE_AI_API_KEY`	Gemini API key
`NEXUS_LOG_LEVEL`	Log level (debug/info/warn/error)

Generate config file:

nexus-agents config init   # Creates nexus-agents.yaml

Documentation

Topic	Link
Full CLI Reference	docs/ENTRYPOINTS.md
Architecture	docs/architecture/README.md
Contributing	CONTRIBUTING.md
Coding Standards	CODING_STANDARDS.md
Quick Start Guide	QUICK_START.md

Development

git clone https://github.com/williamzujkowski/nexus-agents.git
cd nexus-agents
pnpm install
pnpm build
pnpm test

Requirements: Node.js 22.x LTS, pnpm 9.x

Contributing

Fork the repository
Create a feature branch (git checkout -b feat/amazing-feature)
Commit with conventional commits (feat(scope): add feature)
Open a Pull Request

See CONTRIBUTING.md for details.

License

MIT - See LICENSE

Built with Claude Code

Name		Name	Last commit message	Last commit date
Latest commit History 2,411 Commits
.changeset		.changeset
.claude-plugin		.claude-plugin
.claude		.claude
.github		.github
.husky		.husky
.rules		.rules
agents		agents
artifacts		artifacts
docs		docs
hooks		hooks
logs/run_evaluation		logs/run_evaluation
packages		packages
scripts		scripts
skills		skills
test-fixtures		test-fixtures
testing		testing
website		website
.dockerignore		.dockerignore
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitleaks.toml		.gitleaks.toml
.markdownlint.json		.markdownlint.json
.mcp.json.example		.mcp.json.example
.prettierignore		.prettierignore
.prettierrc		.prettierrc
AGENTS.md		AGENTS.md
ARCHITECTURE.md		ARCHITECTURE.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CODEOWNERS		CODEOWNERS
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CODING_STANDARDS.md		CODING_STANDARDS.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile.npm-verify		Dockerfile.npm-verify
Dockerfile.opencode		Dockerfile.opencode
Dockerfile.sandbox		Dockerfile.sandbox
ECOSYSTEM.md		ECOSYSTEM.md
LICENSE		LICENSE
Makefile		Makefile
NOTICE		NOTICE
QUICK_START.md		QUICK_START.md
README.md		README.md
SECURITY.md		SECURITY.md
commitlint.config.ts		commitlint.config.ts
cspell.json		cspell.json
eslint.config.js		eslint.config.js
llms-install.md		llms-install.md
lychee.toml		lychee.toml
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
pnpm-workspace.yaml		pnpm-workspace.yaml
tsconfig.json		tsconfig.json
turbo.json		turbo.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Nexus Agents

Why Nexus Agents?

Where nexus-agents sits in your stack

Quick Start (2 minutes)

1. Install

2. Verify

3. Use

Capabilities

Available Experts

Supported CLIs & Providers

CLI Commands

MCP Tools

Configuration

Documentation

Development

Contributing

License

About

Uh oh!

Releases 60

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Nexus Agents

Why Nexus Agents?

Where nexus-agents sits in your stack

Quick Start (2 minutes)

1. Install

2. Verify

3. Use

Capabilities

Available Experts

Supported CLIs & Providers

CLI Commands

MCP Tools

Configuration

Documentation

Development

Contributing

License

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 60

Uh oh!

Contributors

Uh oh!

Languages