feat: autoresearch harness for SNAG optimization by realityinspector · Pull Request #26 · timepointai/timepoint-pro

realityinspector · 2026-03-16T14:43:27Z

Summary

Adds autoresearch/ directory with Karpathy-style optimization loop for Pro simulation parameters.

pro_autoresearch.py — main loop: mutate config → run → evaluate → keep/discard
config_space.py — 27 mutable dimensions across 6 mechanism clusters
metrics.py — Causal Resolution metric, dry-run synthetic metrics
pareto.py — Pareto frontier analysis

Purely additive — no existing files modified. Safe to merge.

Merge intent

This is the base branch. Once merged, the 8 cluster branches below can PR their result files in:

autoresearch/pro/fidelity → results + issue Autoresearch Pro-1: Fidelity optimization findings (M1/M2/M5/M6) #18
autoresearch/pro/temporal → results + issue Autoresearch Pro-2: Temporal mode optimization findings (M17) #21
autoresearch/pro/knowledge → results + issue Autoresearch Pro-3: Knowledge provenance findings (M3/M4/M19) #22
autoresearch/pro/entities → results + issue Autoresearch Pro-4: Entity simulation findings (M9-M16) #19
autoresearch/pro/models → results + issue Autoresearch Pro-5: Model routing optimization findings (M18) #20
autoresearch/pro/dialog → results + issue Autoresearch Pro-6: Dialog quality findings (M10/M11) #23
autoresearch/pro/generalize → results + issue Autoresearch Pro-7: Cross-template generalization findings #25
autoresearch/pro/tdf-training → results + issue Autoresearch Pro-8: TDF + training data quality findings #24

DO NOT MERGE: feat/pro/pytorch-backend — modifies tensors.py, needs venv testing first.

Test plan

python3 -m autoresearch.pro_autoresearch --dry-run --iterations 10 completes <1s
Verify no import conflicts with Pro's existing modules

Karpathy-style autoresearch loop that mutates Hydra config parameters across 27 dimensions (6 mechanism clusters), runs simulations, extracts quality metrics (Causal Resolution, coherence, plausibility), and identifies Pareto-optimal configs via quality vs cost tradeoffs. Supports --dry-run mode with deterministic synthetic metrics for testing the mutation/selection loop without API calls.

seanfromthepast approved these changes Mar 16, 2026

View reviewed changes

seanfromthepast merged commit a927d6e into main Mar 16, 2026
1 of 5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: autoresearch harness for SNAG optimization#26

feat: autoresearch harness for SNAG optimization#26
seanfromthepast merged 1 commit intomainfrom
autoresearch/pro/harness

realityinspector commented Mar 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

realityinspector commented Mar 16, 2026

Summary

Merge intent

Test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants