LLM Council Audit & Consensus Platform

From idea → plan → implementation.

📋 Documentation

Essential Reading:

VISION.md — Project vision and current status
PRD_MVP.md — MVP requirements and implementation status
ARCHITECTURE.md — Technical architecture and design

Validation & Status:

VALIDATION_SUMMARY.md — Current validation results (13/13 tests passing)
VALIDATION_README.md — How to run system validation

Complete Documentation:

docs/README.md — Full documentation index and navigation

🚪 Quality Gates

Each document must pass consensus from our LLM council (PM, Infrastructure, Data/Eval, Security, UX, Cost auditors) before proceeding. Strategic documents require human review, while technical documents can auto-approve with high consensus:

Research Brief → Market Scan: Problem validated, research plan approved (human review)
Market Scan → Vision: BUILD decision justified, market opportunity clear (human review)
Vision → PRD: Success metrics quantified, MVP scope realistic (human review)
PRD → Architecture: All requirements mapped, no critical gaps (human review)
Architecture → Implementation: Design complete, tasks traceable (auto-approve if consensus)

Human Review Triggers:

Strategic documents (Research, Market, Vision, PRD)
Low consensus between auditors (stdev > 1.0)
Blocking issues detected
Consensus deadlock after 3 attempts

🛠️ Setup

Install dependencies:
```
pip install -r requirements.txt
```

Set up environment:

export OPENAI_API_KEY="your-key-here"
# Optional: export ANTHROPIC_API_KEY="your-key-here"
# Or configure API keys in the Web UI Settings page

Initialize your project documents:

python scripts/init_docs.py --project "Your Project" --owner "Your Name"
# Or run interactively: python scripts/init_docs.py

Run the LLM council audit:

# Required: choose a stage to audit (e.g., vision, prd, architecture)
python audit.py ./docs --stage vision --interactive

# Enable research context (Vision stage):
python audit.py ./docs --stage vision --interactive --research-context

# Enable council debate mode (multi-round discussion between roles):
python audit.py ./docs --stage prd --council-debate

# Use a specific template and model:
python audit.py ./docs --stage architecture \
  --template config/templates/software_mvp.yaml \
  --model gpt-4o

Keep each file brief. If a section is not applicable, write N/A (why) rather than leaving it blank. Use markdown features (tables, task lists, code blocks) for clarity.

🌐 Web UI

Two ways to run the UI:

Development (recommended):
- Backend: python audit.py ui --debug
- Frontend: cd frontend && npm install && npm run dev (Vite dev server at http://localhost:3000, proxying /api and /ws to http://localhost:8000)
Production-like:
- Build: cd frontend && npm install && npm run build
- Start backend: python audit.py ui
- The server serves frontend/dist automatically at / and assets at /assets.

If frontend/dist is missing, the server shows a short message with setup instructions at /.

Note: Ensure Tailwind CSS is configured with tailwind.config.js and postcss.config.js in the frontend directory for proper styling.

🔌 HTTP API (Projects & Runs)

Start a run (preferred resource model):

curl -X POST http://localhost:8000/api/projects/my-project/runs
-H 'Content-Type: application/json'
-d '{"stage":"vision","model":"gpt-4o"}'

Response: { "success": true, "data": { "runId": "...", "startedAt": 172495... }, "timestamp": ... }
Get run snapshot:

curl http://localhost:8000/api/projects/my-project/runs/

Response data matches shared schema (camelCase): { "pipeline": { "documents": [...], "overallStatus": "running", ... }, "metrics": { ... } }
Legacy aliases (still supported during migration):
- POST /api/audits (body includes docsPath)
- GET /api/audits/{auditId}
Config:
- GET /api/templates → list available templates
- GET /api/quality-gates → returns config/quality_gates.yaml (if present)

✅ Testing & Coverage

Backend (pytest): 90 tests passing
- PYTHONPATH=src pytest --maxfail=1 --disable-warnings -q
- With coverage: PYTHONPATH=src pytest --cov=src/llm_council --cov-report=term-missing
Frontend (Vitest):
- cd frontend && npm run test
- With coverage/UI: npm run test:coverage or npm run test:ui

Current status: Core modules (schemas, consensus, alignment, templates, cache, research, human review) have 100% test coverage. CLI and integration tests have some remaining import issues being resolved.

Targets: backend ≥85%, frontend ≥80%. See docs/TEST_PLAN.md and docs/TRACEABILITY_MATRIX.md.

E2E (Playwright)

Prereqs: backend running on :8000 and Playwright browsers installed.
One-time install: cd frontend && npx playwright install --with-deps
Run tests: npm run e2e (or npm run e2e:headed)
- The config in frontend/playwright.config.ts auto-starts the Vite dev server on :3000 and points tests at that URL.
- Backend stays a separate process (see Web UI section above).
Tip for fast, offline-friendly runs: before starting the backend, export AUDIT_MAX_SECONDS=2 to cap each audit and avoid long model calls.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM Council Audit & Consensus Platform

📋 Documentation

🚪 Quality Gates

🛠️ Setup

🌐 Web UI

🔌 HTTP API (Projects & Runs)

✅ Testing & Coverage

E2E (Playwright)

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
.github/workflows		.github/workflows
config		config
docs		docs
frontend		frontend
scripts		scripts
shared		shared
src		src
tests		tests
.editorconfig		.editorconfig
.gitignore		.gitignore
README.md		README.md
VALIDATION_README.md		VALIDATION_README.md
VALIDATION_SUMMARY.md		VALIDATION_SUMMARY.md
audit.py		audit.py
pytest.ini		pytest.ini
requirements.txt		requirements.txt
run-validation.sh		run-validation.sh
verify_implementation.py		verify_implementation.py

Folders and files

Latest commit

History

Repository files navigation

LLM Council Audit & Consensus Platform

📋 Documentation

🚪 Quality Gates

🛠️ Setup

🌐 Web UI

🔌 HTTP API (Projects & Runs)

✅ Testing & Coverage

E2E (Playwright)

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages