Spec uncertainty engine + repoint ledger to @rafters by ssilvius · Pull Request #104 · rafters-studio/platform

ssilvius · 2026-04-27T02:59:38Z

Summary

Spec for the uncertainty engine -- the AI honesty layer for the studio. Generic input -> prediction -> outcome ledger; per-cohort Brier scores and reliability diagrams; five-state lifecycle (emitted, witnessed, calibrated, orphaned, retired). See docs/designs/uncertainty-engine.md.
Repoint @ezmode-games/drizzle-ledger -> @rafters/ledger@0.2.0. The CLAUDE.md rewrite (3c6e23b) missed apps/web/src/api/auth.ts and the package.json declarations, leaving main with a broken typecheck. Restored to green.

The institutional read

Other agents build models. Platform audits them. Every confidence number Rafters publishes is grounded in a published reliability record. Same ethic as publishing the revenue split.

The orphan state is load-bearing: silence is its own state, scored separately, surfaced as a metric. Treating silence as agreement poisons calibration upward; treating it as disagreement poisons it downward. Naming it removes the choice.

Phase 1 work breakdown

Issues #97-#103:

Uncertainty engine: D1 migration for uncertainty_prediction + uncertainty_calibration_snapshot #97 D1 migration (uncertainty_prediction, uncertainty_calibration_snapshot)
Uncertainty engine: Drizzle schema + zod companions #98 Drizzle schema + zod companions
Uncertainty engine: Hono routes (emit, witness, calibration, orphans) #99 Hono routes (emit, witness, calibration, orphans)
Uncertainty engine: orphan sweep cron #100 Orphan sweep cron (hourly)
Uncertainty engine: calibration roll cron + Brier/reliability SQL #101 Calibration roll cron + Brier/reliability SQL (nightly)
Uncertainty engine: rafters integration (color naming + service binding flip) #102 rafters integration + service binding flip from ezmode-api to platform
Uncertainty engine: ctrl debug page for calibration snapshots #103 ctrl debug page for calibration snapshots

Test plan

Spec reads cleanly; tables render in GitHub
pnpm typecheck passes (verified locally)
pnpm install clean against @rafters/ledger@0.2.0
better-auth ledgerPlugin import resolves at runtime (smoke test on next deploy)

…rtainty engine The CLAUDE.md rewrite (3c6e23b) updated some ledger imports but missed apps/web/src/api/auth.ts and the package.json declarations, leaving the workspace in a broken typecheck state -- the old package no longer resolves on the registry. Swap to @rafters/ledger across auth.ts, apps/web/package.json, and the root onlyBuiltDependencies list. Typecheck restored to green. Add docs/designs/uncertainty-engine.md -- design for the AI honesty layer. Generic input -> prediction -> outcome ledger, per-cohort Brier and reliability diagrams, five-state lifecycle (emitted, witnessed, calibrated, orphaned, retired). Phase 1 unblocks rafters by replacing the ezmode CORE_API ghost binding with a real platform endpoint. Issues #97-103 break out the Phase 1 work. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Local lint, format:check, and frozen-lockfile install all pass clean on this commit. CI failure on the prior run looks transient. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

ssilvius and others added 2 commits April 26, 2026 19:52

Re-run CI

ff969a5

Local lint, format:check, and frozen-lockfile install all pass clean on this commit. CI failure on the prior run looks transient. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

ssilvius merged commit beff9d7 into main Apr 27, 2026
1 check failed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Spec uncertainty engine + repoint ledger to @rafters#104

Spec uncertainty engine + repoint ledger to @rafters#104
ssilvius merged 2 commits intomainfrom
feat/uncertainty-engine-spec

ssilvius commented Apr 27, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ssilvius commented Apr 27, 2026

Summary

The institutional read

Phase 1 work breakdown

Test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant