autoresearch

Autonomous experiment loop skill for Claude Code. Try ideas, keep what works, discard what doesn't, never stop.

Inspired by karpathy/autoresearch and davebcn87/pi-autoresearch.

Give it an optimization target (test speed, bundle size, training loss, etc.) and it will:

Write benchmark scripts
Loop autonomously — edit code, run benchmark, measure
Keep improvements, revert regressions
Track everything in append-only JSONL

Installation

git clone https://github.com/anthropics/claude-autoresearch.git ~/.claude/skills/autoresearch

Or clone elsewhere and symlink:

git clone https://github.com/anthropics/claude-autoresearch.git ~/projects/claude-autoresearch
ln -sf ~/projects/claude-autoresearch ~/.claude/skills/autoresearch

Usage

In Claude Code, invoke the skill:

/autoresearch optimize test suite speed

Or use natural language:

set up autoresearch for bundle size reduction
run experiments to find the fastest build configuration

The skill will ask clarifying questions (or infer from context), create benchmark scripts, initialize tracking, and start looping.

Architecture

Single CLI at scripts/cli.py with subcommands — inspired by pi-autoresearch's single-file approach:

Command	Purpose
`cli.py init`	Initialize experiment session (writes config to JSONL)
`cli.py run`	Run benchmark with timing, timeout, and optional checks
`cli.py baseline`	Run N baselines, compute variance and significance threshold
`cli.py log`	Record result, git commit/revert, auto-print dashboard
`cli.py state`	Reconstruct experiment state as JSON
`cli.py dashboard`	Print ASCII dashboard with strategy column
`cli.py analyze`	Strategy effectiveness analysis with recommendations
`cli.py history`	Full experiment history (all runs, not truncated)
`cli.py recover`	Diagnose and fix inconsistent state (corrupt JSONL, dirty git)

Running Tests

python3 -m pytest tests/ -v

Requirements

python3 (3.8+)
git

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
evals		evals
references		references
scripts		scripts
tests		tests
.gitignore		.gitignore
README.md		README.md
SKILL.md		SKILL.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

autoresearch

Installation

Usage

Architecture

Running Tests

Requirements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

autoresearch

Installation

Usage

Architecture

Running Tests

Requirements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages