Skip to content

volkan/claude-autoresearch

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

autoresearch

Autonomous experiment loop skill for Claude Code. Try ideas, keep what works, discard what doesn't, never stop.

Inspired by karpathy/autoresearch and davebcn87/pi-autoresearch.

Give it an optimization target (test speed, bundle size, training loss, etc.) and it will:

  1. Write benchmark scripts
  2. Loop autonomously — edit code, run benchmark, measure
  3. Keep improvements, revert regressions
  4. Track everything in append-only JSONL

Installation

git clone https://github.com/anthropics/claude-autoresearch.git ~/.claude/skills/autoresearch

Or clone elsewhere and symlink:

git clone https://github.com/anthropics/claude-autoresearch.git ~/projects/claude-autoresearch
ln -sf ~/projects/claude-autoresearch ~/.claude/skills/autoresearch

Usage

In Claude Code, invoke the skill:

/autoresearch optimize test suite speed

Or use natural language:

set up autoresearch for bundle size reduction
run experiments to find the fastest build configuration

The skill will ask clarifying questions (or infer from context), create benchmark scripts, initialize tracking, and start looping.

Architecture

Single CLI at scripts/cli.py with subcommands — inspired by pi-autoresearch's single-file approach:

Command Purpose
cli.py init Initialize experiment session (writes config to JSONL)
cli.py run Run benchmark with timing, timeout, and optional checks
cli.py baseline Run N baselines, compute variance and significance threshold
cli.py log Record result, git commit/revert, auto-print dashboard
cli.py state Reconstruct experiment state as JSON
cli.py dashboard Print ASCII dashboard with strategy column
cli.py analyze Strategy effectiveness analysis with recommendations
cli.py history Full experiment history (all runs, not truncated)
cli.py recover Diagnose and fix inconsistent state (corrupt JSONL, dirty git)

Running Tests

python3 -m pytest tests/ -v

Requirements

  • python3 (3.8+)
  • git

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages