specsmith

Applied Epistemic Engineering toolkit for AI-assisted development.

Intelligence proposes. Constraints decide. The ledger remembers.

specsmith treats belief systems like code: codable, testable, and deployable. It scaffolds epistemically-governed projects, stress-tests requirements as BeliefArtifacts, runs cryptographically-sealed trace vaults, and orchestrates AI agents under formal AEE governance.

0.10.1 — Governance REST API, machine-state sync, and least-privilege permissions. Specsmith now serves as the governance backend for Kairos (the epistemically-governed Rust terminal) via specsmith governance-serve, keeps .specsmith/ JSON in sync with docs/ Markdown via specsmith sync, and gates every agent tool call via specsmith agent permissions-check. Multi-agent profiles, BYOE endpoints, and the AEE phase lifecycle are all fully wired.

specsmith governance-serve --port 7700     # Kairos governance REST API
specsmith sync                              # sync .specsmith/ from docs/ markdown
specsmith agent permissions-check git_push # check tool permission (REG-012)

It also co-installs the standalone epistemic Python library for direct use in any project:

from epistemic import AEESession         # works in any Python 3.10+ project
from epistemic import BeliefArtifact, StressTester, CertaintyEngine

What is Applied Epistemic Engineering?

AEE treats requirements, decisions, and assumptions — the beliefs your project depends on — as engineering artifacts subject to the same discipline as code: version control, testing, and refactoring.

The 4-step core method: Frame → Disassemble → Stress-Test → Reconstruct

The 5 foundational axioms:

Observability — every belief must be inspectable
Falsifiability — every belief must be challengeable
Irreducibility — beliefs decompose to atomic primitives
Reconstructability — every failed belief can be rebuilt
Convergence — stress-test + recovery always reaches Equilibrium

The AEE Workflow — 7 Phases

specsmith tracks your project through the full AEE development cycle:

🌱 Inception → 🏗 Architecture → 📋 Requirements → ✅ Test Spec
    → ⚙ Implementation → 🔬 Verification → 🚀 Release

specsmith phase          # show current phase + readiness checklist
specsmith phase next     # advance to the next phase (runs checks first)
specsmith phase set requirements  # jump to a specific phase
specsmith phase list     # list all phases

The current phase is persisted in scaffold.yml as aee_phase and displayed in the VS Code Settings Panel. Each phase has a checklist of file/command criteria, recommended commands, and a readiness percentage.

Install

Recommended — via the VS Code extension (creates a project-isolated environment):

Install the specsmith AEE Workbench VS Code extension
Open Ctrl+Shift+, (⚙ specsmith Settings)
Click 🔒 Create Environment — creates ~/.specsmith/venv/ with specsmith + your provider packages

The extension uses this environment for all agent sessions and terminal commands.

Or via pipx (system-wide):

pipx install specsmith                    # core CLI + epistemic library
pipx inject specsmith anthropic           # + Claude support
pipx inject specsmith openai              # + GPT / O-series support
pipx inject specsmith google-generativeai # + Gemini support

Or with pip:

pip install specsmith                     # core
pip install "specsmith[anthropic]"       # + Claude
pip install "specsmith[openai]"          # + GPT/O-series
pip install "specsmith[gemini]"          # + Gemini

Update:

pipx upgrade specsmith
specsmith self-update

Quick Start

# New project (interactive)
specsmith init

# Adopt an existing project
specsmith import --project-dir ./my-project

# Check governance health
specsmith audit --project-dir ./my-project

# Run AEE stress-test on requirements
specsmith stress-test --project-dir ./my-project

# Full epistemic audit (certainty + logic knots + recovery proposals)
specsmith epistemic-audit --project-dir ./my-project

# Start the agentic REPL
specsmith run --project-dir ./my-project

# AG2 agent shell — Planner/Builder/Verifier over Ollama
specsmith agent status                    # check agent config + Ollama
specsmith agent plan "add logging"        # plan only (no execution)
specsmith agent run "fix lint errors"     # full Plan → Build → Verify
specsmith agent improve "add tests"       # self-improvement with reports
specsmith agent verify                    # run Verifier on current state
specsmith agent reports                   # list improvement reports

# Check current AEE workflow phase
specsmith phase --project-dir ./my-project

Machine State Sync

.specsmith/ always mirrors the human-readable docs/ governance files. Run specsmith sync after any change to docs/REQUIREMENTS.md or docs/TESTS.md:

specsmith sync                     # regenerate .specsmith/requirements.json + testcases.json
specsmith sync --check             # CI mode: exits 1 if out of sync without writing
specsmith sync --json              # emit sync result as JSON

Least-Privilege Agent Permissions (REG-012)

specsmith agent permissions                      # show active permission profile
specsmith agent permissions-check git_push       # check if git_push is allowed
specsmith agent permissions-check git_push --no-log  # dry-run (no ledger write)

Configure in docs/SPECSMITH.yml:

agent:
  permissions:
    preset: standard       # read_only | standard | extended | admin
    # Or custom:
    allow: [read_file, write_file, run_shell, git_status]
    deny:  [git_push, git_create_pr]

Kairos + Governance REST API

Kairos is the companion Rust terminal runtime (BitConcepts/kairos). specsmith acts as the governance backend: Kairos spawns specsmith governance-serve at startup and routes all preflight and verify calls through it.

# Start the governance REST API (Kairos calls this automatically)
specsmith governance-serve --port 7700 --project-dir .

# Classify a natural-language utterance under Specsmith governance
specsmith preflight "fix the cleanup dry-run regression" --json

# Start the agentic REPL
specsmith run
> what does the cleanup module do?           # read-only ask -> answered
> fix the cleanup dry-run regression          # change -> Specsmith approves, runs
> delete the entire dist directory            # destructive -> needs clarification

How it works. A natural-language broker classifies intent, infers scope from your requirements, and asks Specsmith to preflight the request. Only when the preflight decision is accepted does Nexus drive the AG2 orchestrator — and it does so through a bounded-retry harness so you can never accidentally run away. By default, Nexus speaks plain English; toggle /why in the REPL to surface the underlying requirement, test, and work-item identifiers Specsmith assigned.

Pieces in this repo.

specsmith preflight — CLI subcommand emitting a deterministic governance JSON payload (decision, requirement_ids, test_case_ids, confidence_target, instruction).
src/specsmith/agent/broker.py — natural-language broker (intent + scope + narration).
src/specsmith/agent/repl.py — Nexus REPL with the /why toggle and execution gate.
docker-compose.yml — pinned vLLM l1-nexus model server with the Hermes tool-call parser.
scripts/nexus_smoke.py — opt-in live smoke test (NEXUS_LIVE=1 to run against a running container).

VS Code Extension

The specsmith AEE Workbench VS Code extension is the flagship client:

# VS Code: Ctrl+Shift+P → specsmith: New Agent Session
# Settings:         Ctrl+Shift+,     (⚙ specsmith Settings — global)
# Project Settings: Ctrl+Shift+G     (⚙ Project Settings — per-project)

Key features:

Dual-panel architecture — ⚙ specsmith Settings (global: venv, version, Ollama, system) and ⚙ Project Settings (per-project: scaffold, tools, files, actions, execution)
Global environment management — ~/.specsmith/venv/ with Create / Update / Rebuild / Delete; persistent restart banner; Remove System Installs cleanup button
VCS context at session start — git status + recent commits shown in chat and in system prompt
Execution profiles — safe / standard / open / admin; custom allow/block command lists
AEE phase indicator — shows current phase with readiness %, Next Phase button, phase selector
AI agent sessions — independent process per project, JSONL bridge, chat with file injection
AG2 agent shell — Planner/Builder/Verifier agents over Ollama in Actions tab
Agent tab — per-project provider/model/context/iteration config (overrides global defaults)
Live model listing — Anthropic, OpenAI, Gemini, Mistral, local Ollama (GPU-aware)
Ollama model catalog — 16 models, 4 tiers, GPU-aware recommendations, filter by installed/available
Ollama integration — model manager (update/remove/update-all), version check, upgrade
FPGA/HDL tool support — vivado, gtkwave, vsg, ghdl, verilator, yosys, nextpnr, and 15 more
Tool installer — scan installed tools; one-click install via winget/brew/apt for missing tools
API key management — stored in OS credential store (Windows Credential Manager / macOS Keychain)
Update checker — PyPI version check, auto-checks on panel open, release channel selector

→ specsmith-vscode on GitHub

Supporting specsmith

specsmith is open source and built by a small team. Every bit of support helps:

⭐ Star specsmith and specsmith-vscode on GitHub
📣 Tell your friends and colleagues — word of mouth is our best marketing
🐛 Report bugs via GitHub Issues — even small ones help
💡 Suggest features via GitHub Discussions — we read every suggestion
🔧 Fix bugs and contribute — see CONTRIBUTING.md; PRs welcome
📝 Write about specsmith — blog posts, tutorials, and talks help the community grow
❤️ Sponsor BitConcepts — directly funds development

Ollama — Local LLMs (Zero API Cost)

specsmith has first-class Ollama support, including:

specsmith ollama gpu                    # detect GPU and VRAM tier
specsmith ollama available              # show catalog filtered by VRAM budget
specsmith ollama available --task code  # filter by task type
specsmith ollama pull qwen2.5:14b      # download a model
specsmith ollama suggest requirements  # task-based recommendations
specsmith ollama list                  # show installed models

GPU-aware context sizing in the VS Code extension: 4K/8K/16K/32K tokens based on detected VRAM. Override with specsmith.ollamaContextLength in VS Code settings.

FPGA / HDL Projects

specsmith supports FPGA-specific project types with full governance:

# scaffold.yml
type: fpga-rtl-amd          # or fpga-rtl-intel / fpga-rtl-lattice / fpga-rtl
fpga_tools:
  - vivado
  - gtkwave
  - vsg
  - ghdl
  - verilator

Supported tools: Synthesis: vivado, quartus, radiant, diamond, gowin. Simulation: ghdl, iverilog, verilator, modelsim, questasim, xsim. Waveform: gtkwave, surfer. Linting: vsg, verible, svlint. Formal: symbiyosys. OSS flow: yosys, nextpnr, openFPGALoader.

50+ CLI Commands

Governance: init import audit validate diff upgrade compress doctor export architect

AEE Epistemic: stress-test epistemic-audit belief-graph trace seal/verify/log integrate

Workflow: phase show/set/next/list ledger add/list req list/add/gaps/trace

Agent: run agent run/plan/status/verify/improve/reports agent providers/tools/skills

Ollama: ollama list/available/gpu/pull/suggest

Workspace: workspace init/audit/export

VCS: commit push sync branch pr status

Tools: tools scan [--fpga] tools install <tool> tools rules [--tool] [--list]

Tools: exec ps abort watch optimize credits self-update

Auth: auth set/list/remove/check

Patent: patent search/prior-art

35 Project Types

Software: Python CLI/lib/web, Rust, Go, C/C++, .NET, Node.js/TypeScript, mobile, microservices, data/ML.

Hardware/Embedded: FPGA/RTL (Xilinx, Intel, Lattice, generic), Yocto BSP, embedded C/C++.

Documents: Technical specs, research papers, API specs, requirements management.

Business/Legal: Business plans, patent applications, compliance frameworks.

epistemic Library

The standalone epistemic Python library works in any Python 3.10+ project — no specsmith coupling:

from epistemic import AEESession, BeliefArtifact, StressTester

session = AEESession("my-project", threshold=0.70)
session.add_belief(
    artifact_id="HYP-001",
    propositions=["The API always returns valid JSON"],
    epistemic_boundary=["Valid auth token required"],
)
session.accept("HYP-001")
result = session.run()
print(result.summary())
# certainty=0.55, failures=2, equilibrium=False

Use cases: linguistics research, compliance pipelines, AI alignment, patent prosecution.

Governance Rules (H1–H13)

13 hard rules enforced by specsmith validate:

H11 — Every loop or blocking wait must have a timeout, fallback exit, and diagnostic message.
H12 — Windows multi-step automation goes into .cmd files, not inline shell invocations.
H13 — Agent tools must declare epistemic contracts (what they claim and what they cannot detect).

The specsmith Bootstrap

specsmith governs itself — the specsmith repo is a specsmith-managed project. Run specsmith audit in this repo to check its governance health. This means every feature we add to specsmith is immediately dogfooded on specsmith itself. The VS Code extension is developed alongside it as the flagship client.

Documentation

specsmith.readthedocs.io — Full manual: AEE primer, command reference, project types, tool registry, governance model, Ollama guide, VS Code extension.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

specsmith

What is Applied Epistemic Engineering?

The AEE Workflow — 7 Phases

Install

Quick Start

Machine State Sync

Least-Privilege Agent Permissions (REG-012)

Kairos + Governance REST API

VS Code Extension

Supporting specsmith

Ollama — Local LLMs (Zero API Cost)

FPGA / HDL Projects

50+ CLI Commands

35 Project Types

epistemic Library

Governance Rules (H1–H13)

The specsmith Bootstrap

Documentation

Links

License

Uh oh!

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

specsmith

What is Applied Epistemic Engineering?

The AEE Workflow — 7 Phases

Install

Quick Start

Machine State Sync

Least-Privilege Agent Permissions (REG-012)

Kairos + Governance REST API

VS Code Extension

Supporting specsmith

Ollama — Local LLMs (Zero API Cost)

FPGA / HDL Projects

50+ CLI Commands

35 Project Types

epistemic Library

Governance Rules (H1–H13)

The specsmith Bootstrap

Documentation

Links

License