Applied Epistemic Engineering toolkit for AI-assisted development.
Intelligence proposes. Constraints decide. The ledger remembers.
specsmith treats belief systems like code: codable, testable, and deployable. It scaffolds epistemically-governed projects, stress-tests requirements as BeliefArtifacts, runs cryptographically-sealed trace vaults, and orchestrates AI agents under formal AEE governance.
0.10.1 — Governance REST API, machine-state sync, and least-privilege permissions.
Specsmith now serves as the governance backend for Kairos (the epistemically-governed
Rust terminal) via specsmith governance-serve, keeps .specsmith/ JSON in sync
with docs/ Markdown via specsmith sync, and gates every agent tool call via
specsmith agent permissions-check. Multi-agent profiles, BYOE endpoints, and the
AEE phase lifecycle are all fully wired.
specsmith governance-serve --port 7700 # Kairos governance REST API
specsmith sync # sync .specsmith/ from docs/ markdown
specsmith agent permissions-check git_push # check tool permission (REG-012)It also co-installs the standalone epistemic Python library for direct use in any project:
from epistemic import AEESession # works in any Python 3.10+ project
from epistemic import BeliefArtifact, StressTester, CertaintyEngineAEE treats requirements, decisions, and assumptions — the beliefs your project depends on — as engineering artifacts subject to the same discipline as code: version control, testing, and refactoring.
The 4-step core method: Frame → Disassemble → Stress-Test → Reconstruct
The 5 foundational axioms:
- Observability — every belief must be inspectable
- Falsifiability — every belief must be challengeable
- Irreducibility — beliefs decompose to atomic primitives
- Reconstructability — every failed belief can be rebuilt
- Convergence — stress-test + recovery always reaches Equilibrium
specsmith tracks your project through the full AEE development cycle:
🌱 Inception → 🏗 Architecture → 📋 Requirements → ✅ Test Spec
→ ⚙ Implementation → 🔬 Verification → 🚀 Release
specsmith phase # show current phase + readiness checklist
specsmith phase next # advance to the next phase (runs checks first)
specsmith phase set requirements # jump to a specific phase
specsmith phase list # list all phasesThe current phase is persisted in scaffold.yml as aee_phase and displayed in the VS Code
Settings Panel. Each phase has a checklist of file/command criteria, recommended commands,
and a readiness percentage.
Recommended — via the VS Code extension (creates a project-isolated environment):
- Install the specsmith AEE Workbench VS Code extension
- Open
Ctrl+Shift+,(⚙ specsmith Settings) - Click 🔒 Create Environment — creates
~/.specsmith/venv/with specsmith + your provider packages
The extension uses this environment for all agent sessions and terminal commands.
Or via pipx (system-wide):
pipx install specsmith # core CLI + epistemic library
pipx inject specsmith anthropic # + Claude support
pipx inject specsmith openai # + GPT / O-series support
pipx inject specsmith google-generativeai # + Gemini supportOr with pip:
pip install specsmith # core
pip install "specsmith[anthropic]" # + Claude
pip install "specsmith[openai]" # + GPT/O-series
pip install "specsmith[gemini]" # + GeminiUpdate:
pipx upgrade specsmith
specsmith self-update# New project (interactive)
specsmith init
# Adopt an existing project
specsmith import --project-dir ./my-project
# Check governance health
specsmith audit --project-dir ./my-project
# Run AEE stress-test on requirements
specsmith stress-test --project-dir ./my-project
# Full epistemic audit (certainty + logic knots + recovery proposals)
specsmith epistemic-audit --project-dir ./my-project
# Start the agentic REPL
specsmith run --project-dir ./my-project
# AG2 agent shell — Planner/Builder/Verifier over Ollama
specsmith agent status # check agent config + Ollama
specsmith agent plan "add logging" # plan only (no execution)
specsmith agent run "fix lint errors" # full Plan → Build → Verify
specsmith agent improve "add tests" # self-improvement with reports
specsmith agent verify # run Verifier on current state
specsmith agent reports # list improvement reports
# Check current AEE workflow phase
specsmith phase --project-dir ./my-project.specsmith/ always mirrors the human-readable docs/ governance files.
Run specsmith sync after any change to docs/REQUIREMENTS.md or docs/TESTS.md:
specsmith sync # regenerate .specsmith/requirements.json + testcases.json
specsmith sync --check # CI mode: exits 1 if out of sync without writing
specsmith sync --json # emit sync result as JSONspecsmith agent permissions # show active permission profile
specsmith agent permissions-check git_push # check if git_push is allowed
specsmith agent permissions-check git_push --no-log # dry-run (no ledger write)Configure in docs/SPECSMITH.yml:
agent:
permissions:
preset: standard # read_only | standard | extended | admin
# Or custom:
allow: [read_file, write_file, run_shell, git_status]
deny: [git_push, git_create_pr]Kairos is the companion Rust terminal runtime (BitConcepts/kairos). specsmith
acts as the governance backend: Kairos spawns specsmith governance-serve at startup
and routes all preflight and verify calls through it.
# Start the governance REST API (Kairos calls this automatically)
specsmith governance-serve --port 7700 --project-dir .
# Classify a natural-language utterance under Specsmith governance
specsmith preflight "fix the cleanup dry-run regression" --json
# Start the agentic REPL
specsmith run
> what does the cleanup module do? # read-only ask -> answered
> fix the cleanup dry-run regression # change -> Specsmith approves, runs
> delete the entire dist directory # destructive -> needs clarificationHow it works. A natural-language broker classifies intent, infers scope from
your requirements, and asks Specsmith to preflight the request. Only when the
preflight decision is accepted does Nexus drive the AG2 orchestrator — and it does so
through a bounded-retry harness so you can never accidentally run away. By default,
Nexus speaks plain English; toggle /why in the REPL to surface the underlying
requirement, test, and work-item identifiers Specsmith assigned.
Pieces in this repo.
specsmith preflight— CLI subcommand emitting a deterministic governance JSON payload (decision,requirement_ids,test_case_ids,confidence_target,instruction).src/specsmith/agent/broker.py— natural-language broker (intent + scope + narration).src/specsmith/agent/repl.py— Nexus REPL with the/whytoggle and execution gate.docker-compose.yml— pinned vLLMl1-nexusmodel server with the Hermes tool-call parser.scripts/nexus_smoke.py— opt-in live smoke test (NEXUS_LIVE=1to run against a running container).
The specsmith AEE Workbench VS Code extension is the flagship client:
# VS Code: Ctrl+Shift+P → specsmith: New Agent Session
# Settings: Ctrl+Shift+, (⚙ specsmith Settings — global)
# Project Settings: Ctrl+Shift+G (⚙ Project Settings — per-project)
Key features:
- Dual-panel architecture — ⚙ specsmith Settings (global: venv, version, Ollama, system) and ⚙ Project Settings (per-project: scaffold, tools, files, actions, execution)
- Global environment management —
~/.specsmith/venv/with Create / Update / Rebuild / Delete; persistent restart banner; Remove System Installs cleanup button - VCS context at session start — git status + recent commits shown in chat and in system prompt
- Execution profiles — safe / standard / open / admin; custom allow/block command lists
- AEE phase indicator — shows current phase with readiness %, Next Phase button, phase selector
- AI agent sessions — independent process per project, JSONL bridge, chat with file injection
- AG2 agent shell — Planner/Builder/Verifier agents over Ollama in Actions tab
- Agent tab — per-project provider/model/context/iteration config (overrides global defaults)
- Live model listing — Anthropic, OpenAI, Gemini, Mistral, local Ollama (GPU-aware)
- Ollama model catalog — 16 models, 4 tiers, GPU-aware recommendations, filter by installed/available
- Ollama integration — model manager (update/remove/update-all), version check, upgrade
- FPGA/HDL tool support — vivado, gtkwave, vsg, ghdl, verilator, yosys, nextpnr, and 15 more
- Tool installer — scan installed tools; one-click install via winget/brew/apt for missing tools
- API key management — stored in OS credential store (Windows Credential Manager / macOS Keychain)
- Update checker — PyPI version check, auto-checks on panel open, release channel selector
specsmith is open source and built by a small team. Every bit of support helps:
- ⭐ Star specsmith and specsmith-vscode on GitHub
- 📣 Tell your friends and colleagues — word of mouth is our best marketing
- 🐛 Report bugs via GitHub Issues — even small ones help
- 💡 Suggest features via GitHub Discussions — we read every suggestion
- 🔧 Fix bugs and contribute — see CONTRIBUTING.md; PRs welcome
- 📝 Write about specsmith — blog posts, tutorials, and talks help the community grow
- ❤️ Sponsor BitConcepts — directly funds development
specsmith has first-class Ollama support, including:
specsmith ollama gpu # detect GPU and VRAM tier
specsmith ollama available # show catalog filtered by VRAM budget
specsmith ollama available --task code # filter by task type
specsmith ollama pull qwen2.5:14b # download a model
specsmith ollama suggest requirements # task-based recommendations
specsmith ollama list # show installed modelsGPU-aware context sizing in the VS Code extension: 4K/8K/16K/32K tokens based on detected VRAM.
Override with specsmith.ollamaContextLength in VS Code settings.
specsmith supports FPGA-specific project types with full governance:
# scaffold.yml
type: fpga-rtl-amd # or fpga-rtl-intel / fpga-rtl-lattice / fpga-rtl
fpga_tools:
- vivado
- gtkwave
- vsg
- ghdl
- verilatorSupported tools: Synthesis: vivado, quartus, radiant, diamond, gowin. Simulation: ghdl, iverilog, verilator, modelsim, questasim, xsim. Waveform: gtkwave, surfer. Linting: vsg, verible, svlint. Formal: symbiyosys. OSS flow: yosys, nextpnr, openFPGALoader.
Governance: init import audit validate diff upgrade compress doctor export architect
AEE Epistemic: stress-test epistemic-audit belief-graph trace seal/verify/log integrate
Workflow: phase show/set/next/list ledger add/list req list/add/gaps/trace
Agent: run agent run/plan/status/verify/improve/reports agent providers/tools/skills
Ollama: ollama list/available/gpu/pull/suggest
Workspace: workspace init/audit/export
VCS: commit push sync branch pr status
Tools: tools scan [--fpga] tools install <tool> tools rules [--tool] [--list]
Tools: exec ps abort watch optimize credits self-update
Auth: auth set/list/remove/check
Patent: patent search/prior-art
Software: Python CLI/lib/web, Rust, Go, C/C++, .NET, Node.js/TypeScript, mobile, microservices, data/ML.
Hardware/Embedded: FPGA/RTL (Xilinx, Intel, Lattice, generic), Yocto BSP, embedded C/C++.
Documents: Technical specs, research papers, API specs, requirements management.
Business/Legal: Business plans, patent applications, compliance frameworks.
The standalone epistemic Python library works in any Python 3.10+ project — no specsmith coupling:
from epistemic import AEESession, BeliefArtifact, StressTester
session = AEESession("my-project", threshold=0.70)
session.add_belief(
artifact_id="HYP-001",
propositions=["The API always returns valid JSON"],
epistemic_boundary=["Valid auth token required"],
)
session.accept("HYP-001")
result = session.run()
print(result.summary())
# certainty=0.55, failures=2, equilibrium=FalseUse cases: linguistics research, compliance pipelines, AI alignment, patent prosecution.
13 hard rules enforced by specsmith validate:
- H11 — Every loop or blocking wait must have a timeout, fallback exit, and diagnostic message.
- H12 — Windows multi-step automation goes into
.cmdfiles, not inline shell invocations. - H13 — Agent tools must declare epistemic contracts (what they claim and what they cannot detect).
specsmith governs itself — the specsmith repo is a specsmith-managed project. Run specsmith audit
in this repo to check its governance health. This means every feature we add to specsmith is
immediately dogfooded on specsmith itself. The VS Code extension
is developed alongside it as the flagship client.
specsmith.readthedocs.io — Full manual: AEE primer, command reference, project types, tool registry, governance model, Ollama guide, VS Code extension.
MIT — Copyright (c) 2026 BitConcepts, LLC.