Forge

Forge is an autonomous AI software-engineering agent. Point it at any GitHub issue and it will clone the repository, write the fix, and produce a verified git diff patch — all inside an isolated Docker sandbox, driven by any OpenAI-compatible model API.

Tested with TestSprite — see testsprite_tests/ and landing/testsprite_tests for generated test cases and reports.

It integrates with ElizaOS as a first-class action handler and can be deployed on decentralised compute via Nosana.

Quick start — no Rust required

The only prerequisite is Docker.

1. Get the config files

git clone <your-repo-url>
cd forge

2. Configure credentials

cp .env.example .env

Open .env and fill in three required values:

FORGE_MODEL=your-model-name        # e.g. gpt-4o, models/gemini-2.0-flash-001
FORGE_BASE_URL=https://...         # OpenAI-compatible endpoint
FORGE_API_KEY=your-api-key

3. See what issues are open on a repo

docker compose run --rm list-issues
# or specify a different repo:
docker compose run --rm list-issues --repo owner/repo

Output:

Open issues on owner/repo (5 shown):

#      Title
------------------------------------------------------------------------
#3     Add input validation to signup form  [bug]
#7     Dark mode flicker on page load       [enhancement]
#12    Upgrade to Node 20

4. Fix an issue

Set FORGE_REPO and FORGE_ISSUE in your .env, then:

docker compose run --rm forge

Or pass them directly:

docker compose run --rm forge run --repo owner/repo --issue 12

Forge will clone the repo, work autonomously, and print a patch when done.

5. Always-on watch mode

Set it up once — then just label issues on GitHub and Forge handles the rest.

Add to .env:

FORGE_WATCH_REPO=owner/repo
FORGE_WATCH_LABEL=forge        # label to watch for
FORGE_WATCH_INTERVAL=60        # seconds between polls
GITHUB_TOKEN=ghp_...           # required to push the fix branch back

Start in the background:

docker compose up watch -d

That's it. Go to GitHub, open any issue in your repo, and add the label forge. Within 60 seconds Forge picks it up, fixes it, and pushes the result to a branch named forge/issue-{N}. Review the branch and merge when you're happy.

Already-processed issues are tracked in trajectories/watch_state.json — Forge never double-processes the same issue.
The service has restart: unless-stopped — it survives crashes and Docker restarts automatically.
Stop it any time with docker compose down.

How Forge works

GitHub issue  ──►  add label "forge"
                          │
                    Forge detects it
                    (within 60 s)
                          │
               ┌──────────────────────┐
               │  Docker sandbox      │
               │  1. Clone repo       │
               │  2. Agent loop       │
               │     think → act      │
               │     → observe        │
               │  3. submit           │
               │     git diff patch   │
               └──────────────────────┘
                          │
                    Push branch
                    forge/issue-{N}
                          │
                   You review & merge

CLI reference

`forge list-issues`

Scan a GitHub repo and show open issues.

forge list-issues --repo owner/repo
forge list-issues --repo owner/repo --label bug
forge list-issues --repo owner/repo --limit 50

Flag	Default	Description
`--repo`	—	GitHub repository (owner/repo)
`--label`	—	Filter by label
`--limit`	30	Maximum issues to show

`forge run`

Run the agent on a single issue or problem.

# Recommended — pick from list-issues output
forge run --repo owner/repo --issue 42

# Or with a full URL
forge run --github-url https://github.com/owner/repo/issues/42

# Inline problem text (no GitHub needed)
forge run --problem-text "Add rate-limiting to the /api/login endpoint"

Flag	Env var	Default	Description
`--repo`	—	—	GitHub repository (use with `--issue`)
`--issue`	—	—	Issue number
`--github-url`	—	—	Full GitHub issue URL
`--problem-text`	—	—	Inline problem statement
`--problem-file`	—	—	Path to plain-text problem file
`--model`	`FORGE_MODEL`	—	Model identifier
`--base-url`	`FORGE_BASE_URL`	—	OpenAI-compatible API URL
`--api-key`	`FORGE_API_KEY`	—	API key
`--image`	—	`forge-sandbox:latest`	Docker sandbox image
`--output-dir`	—	`trajectories`	Where to save `.traj` files
`--max-steps`	—	`100`	Step limit
`--config`	—	—	YAML config file (see Advanced)

`forge watch`

Poll a repo and automatically fix every issue that carries a given label. When GITHUB_TOKEN is set, the fix is pushed to branch forge/issue-{N} automatically.

forge watch --repo owner/repo --label forge --interval 60

Flag	Env var	Default	Description
`--repo`	`FORGE_WATCH_REPO`	—	GitHub repository
`--label`	`FORGE_WATCH_LABEL`	`forge`	Label to watch for
`--interval`	`FORGE_WATCH_INTERVAL`	`60`	Poll interval (seconds)
`--model`	`FORGE_MODEL`	—	Model identifier
`--base-url`	`FORGE_BASE_URL`	—	API base URL
`--api-key`	`FORGE_API_KEY`	—	API key
`--image`	`FORGE_SANDBOX_IMAGE`	`forge-sandbox:latest`	Sandbox image
`--output-dir`	—	`trajectories`	Trajectory output dir

Branch push: set GITHUB_TOKEN in .env and every completed fix is pushed to forge/issue-{N} on GitHub automatically.

`forge quick-stats`

Summarise trajectory results.

forge quick-stats                   # scans ./trajectories
forge quick-stats /path/to/trajs

Environment variables

Variable	Required	Description
`FORGE_MODEL`	Yes	Model identifier passed to the API
`FORGE_BASE_URL`	Yes	Base URL of an OpenAI-compatible endpoint
`FORGE_API_KEY`	Yes	API key for the endpoint
`FORGE_REPO`	For one-shot	GitHub repo (owner/repo)
`FORGE_ISSUE`	For one-shot	Issue number
`FORGE_WATCH_REPO`	For watch	Repo to monitor
`FORGE_WATCH_LABEL`	No	Label to watch (default: `forge`)
`FORGE_WATCH_INTERVAL`	No	Poll interval in seconds (default: `60`)
`GITHUB_TOKEN`	For branch push	PAT — required to push fix branches; also raises API rate limit and enables private repos
`RUST_LOG`	No	Log filter — e.g. `forge=debug` (default: `forge=warn`)
`DOCKER_GID`	No	Docker group GID on host (docker-compose socket mount)

Deploying to Nosana

Forge runs as a standard Docker container on Nosana's decentralised compute network.

1. Configure the job definition

Edit nos_job_def/forge_job_definition.json — replace the $FORGE_* placeholders with your actual values, or pass them as environment variables via the Nosana CLI.

2. Deploy

npm install -g @nosana/cli

nosana job post \
  --file ./nos_job_def/forge_job_definition.json \
  --market nvidia-4090 \
  --timeout 120 \
  --api <YOUR_NOSANA_API_KEY>

Or paste the job definition JSON into the Nosana Dashboard.

Running the HTTP API (forge-api)

TestSprite and other integration tools expect an HTTP surface to test against. This repository includes a thin Axum wrapper crate at crates/forge-api that exposes the core Forge functionality over HTTP (port 8080 by default). The service maps CLI commands to REST endpoints such as:

GET /health — liveness/readiness probe
POST /api/run — trigger a single RunSingle::run() agent execution
POST /api/run/batch — run multiple items in parallel
GET /api/issues — list GitHub issues (wraps existing GitHub client logic)
GET /api/stats — aggregate trajectory statistics
POST/GET/DELETE /api/watch — start/inspect/stop the label watcher

Quick start (compose-managed):

# start forge-api and dependent services
docker compose up forge-api -d
# or run the full stack:
docker compose up -d

Important notes:

If you run forge-api inside Docker, the service needs access to the Docker socket to spawn sandbox containers. Ensure the forge-api service in docker-compose.yml includes the host socket mount:

services:
  forge-api:
    # ...
    volumes:
      - /var/run/docker.sock:/var/run/docker.sock

Provide required environment variables (see the Environment variables section below) to allow the server and agent to authenticate with your model provider and GitHub.

TestSprite integration & automated tests

This project includes TestSprite-generated tests and reports under testsprite_tests/ and landing/testsprite_tests/. The test set covers:

Backend API tests (HTTP) for crates/forge-api (bootstrap expects the API on localhost:8080)
Frontend UI tests for the Next.js landing site (visual/DOM checks, scroll/hash navigation)

To run TestSprite tests locally:

Ensure forge-api is running on port 8080 (see previous section).
Start the Next.js landing server if you want to run frontend checks: cd landing && pnpm dev (or build + start for a stable environment).
Use your TestSprite client / MCP dashboard to bootstrap the project pointing at the appropriate local ports. In TestSprite configuration, point the backend tests to localPort: 8080 and the frontend tests to your landing server port (usually 3000).

Notes and troubleshooting:

Runner environment: TestSprite executes tests in an environment that may differ from your local shell. Make sure required env vars (FORGE_MODEL, FORGE_BASE_URL, FORGE_API_KEY, GITHUB_TOKEN) are available to the services under test.
Docker-in-Docker: If the server runs inside Docker, do not forget the socket mount described above. Missing socket access commonly causes 500 errors in POST /api/run.
DOM assertions for external links: Frontend tests validate link attributes on the landing page itself (look for data-testid on GitHub anchors). External site navigation may be blocked or lose DOM context; prefer DOM attribute checks over full external navigation for reliability.

Output — trajectories

Every run produces a <instance-id>.traj JSON file in output_dir with the complete agent history: every command, every output, every model response, and timing data.

forge quick-stats          # summary counts
cat trajectories/*.traj    # full detail

Advanced — YAML config

Most users only need environment variables and CLI flags. For advanced use (custom system prompts, extended timeouts, batch runs), use a YAML config:

forge run --config example.yaml

See example.yaml in this repository for all available fields with comments.

ElizaOS integration

forge-plugin exposes a SolveIssueAction that plugs into any ElizaOS agent:

use forge_plugin::action::{SolveIssueAction, SolveIssueParams};

let result = SolveIssueAction::new()
    .handle(SolveIssueParams {
        github_url: Some("https://github.com/owner/repo/issues/42".into()),
        model_name: Some("your-model".into()),
        base_url:   Some("https://your-provider.example.com/v1".into()),
        api_key:    Some("your-api-key".into()),
        ..Default::default()
    })
    .await?;

println!("status: {:?}", result.exit_status);
println!("patch:  {:?}", result.submission);

Building from source

Rust 1.82+ required. Docker still needed at runtime.

# Build the sandbox image once
docker build -f Dockerfile.sandbox -t forge-sandbox:latest .

# Build the forge binary
cargo build --release -p forge

# Run
./target/release/forge list-issues --repo owner/repo

Run all tests:

cargo test --workspace
# Include Docker-gated integration tests:
cargo test --workspace -- --include-ignored

Crate architecture

forge/crates/
├── forge-types      Shared data types
├── forge-tools      Response parsers
├── forge-model      Model backends (OpenAI-compat, Anthropic, Replay, Human)
├── forge-env        Docker runtime, bash sessions
├── forge-agent      Agent loop, history processors
├── forge-run        RunSingle, RunBatch, YAML config
├── forge-plugin     ElizaOS integration
└── forge            CLI binary (list-issues, run, watch, quick-stats)

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
.github/workflows		.github/workflows
crates		crates
docs/superpowers		docs/superpowers
examples		examples
landing		landing
nos_job_def		nos_job_def
testsprite_tests		testsprite_tests
trajectories		trajectories
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
Dockerfile		Dockerfile
Dockerfile.release		Dockerfile.release
Dockerfile.sandbox		Dockerfile.sandbox
LICENSE		LICENSE
README.md		README.md
TESTSPRITE_PRD.md		TESTSPRITE_PRD.md
docker-compose.yml		docker-compose.yml
example.yaml		example.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Forge

Quick start — no Rust required

1. Get the config files

2. Configure credentials

3. See what issues are open on a repo

4. Fix an issue

5. Always-on watch mode

How Forge works

CLI reference

`forge list-issues`

`forge run`

`forge watch`

`forge quick-stats`

Environment variables

Deploying to Nosana

1. Configure the job definition

2. Deploy

Running the HTTP API (forge-api)

TestSprite integration & automated tests

Output — trajectories

Advanced — YAML config

ElizaOS integration

Building from source

Crate architecture

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Forge

Quick start — no Rust required

1. Get the config files

2. Configure credentials

3. See what issues are open on a repo

4. Fix an issue

5. Always-on watch mode

How Forge works

CLI reference

forge list-issues

forge run

forge watch

forge quick-stats

Environment variables

Deploying to Nosana

1. Configure the job definition

2. Deploy

Running the HTTP API (forge-api)

TestSprite integration & automated tests

Output — trajectories

Advanced — YAML config

ElizaOS integration

Building from source

Crate architecture

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

`forge list-issues`

`forge run`

`forge watch`

`forge quick-stats`

Packages