GitHub - collinear-ai/simlab: SimLab is the data layer for creating RL environments to evaluate, hillclimb, and refine agents.

A self-serve simulation lab, enabling you to build and refine long-horizon AI agents.

SimLab is the data layer for adaptively composing RL simulations and evaluating and refining agents. SimLab is toolset, agent harness and sandbox agnostic. Browse pre-built scenario templates or bring your own CLI/MCP toolset.

Browse & compose environments from a catalog of tool servers and scenario templates
Run agents against tasks using any LLM provider (OpenAI, Fireworks, custom endpoints)
Generate custom tasks with built-in task generation pipelines
Evaluate automatically with verifiers and reward model scoring
Scale to the cloud with Daytona for remote sandbox execution (if you want to experiment with large-scale parallel rollouts, reach out to us or join the Discord!)

Quickstart

Install

uv tool install simulationlab
# or, if you want Daytona support:
uv tool install "simulationlab[daytona]"

# pipx works too:
pipx install simulationlab

The PyPI package is named simulationlab. The installed CLI command is simlab. SimLab currently supports Python 3.13.

Get your API keys and export them:

export SIMLAB_COLLINEAR_API_KEY="col_..."   # from platform.collinear.ai (Developers > API Keys)
export DAYTONA_API_KEY="dtn_..."            # from app.daytona.io
export OPENAI_API_KEY="sk-..."              # from platform.openai.com/api-keys
export SIMLAB_VERIFIER_MODEL="gpt-5.2"      # reward model (tasks also come with a programmatic verifier so can skip this)
export SIMLAB_VERIFIER_PROVIDER="openai"    # litellm compatible provider name
export SIMLAB_VERIFIER_API_KEY="$OPENAI_API_KEY" # corresponding key

Create an environment:

# Create and start an environment on Daytona
simlab env init my-env --template hr
simlab env up my-env --daytona

To list templates run simlab templates list

Create and list tasks in directory ./generated-tasks

simlab tasks-gen init --preset recruiting # Can go to the config.toml to setup number of tasks etc.
simlab tasks list --tasks-dir ./generated-tasks # takes 5-10 mins with the default setting, choose haiku and 2 tasks for a faster generation.

Run a task with task id `task_id`.

simlab tasks run --env my-env \
  --task task_id \
  --tasks-dir ./generated-tasks \
  --agent-model gpt-5.2 \
  --agent-api-key "$OPENAI_API_KEY"

Tear down

simlab env down my-env --daytona

Running locally with Docker

If you have Docker + Docker Compose installed, you can run environments on your machine instead of Daytona:

simlab env init my-env --template hr
simlab env up my-env
simlab env down my-env

No DAYTONA_API_KEY required. First run may take several minutes while images are pulled/built.

For the full walkthrough — configuration, custom agents, task generation, verifiers, and more — see the Quickstart Guide.

API Keys

Key	Required	How to get it
Collinear API key	Yes	platform.collinear.ai (Developers > API Keys)
OpenAI API key	For running agents	platform.openai.com/api-keys
Daytona API key	Yes (default runtime)	app.daytona.io

Configuration

SimLab resolves configuration in this order: config file < environment variables < CLI flags.

Config file: ~/.config/simlab/config.toml (override with --config-file or SIMLAB_CONFIG)

collinear_api_key = "col_..."

[agent]
model = "gpt-5-mini"
provider = "openai"
api_key = "sk-..."

[daytona]
api_key = "dtn_..."

[verifier]
model = "claude-sonnet-4-6"
api_key = "sk-ant-..."

Environment Variables

Variable	Description
`SIMLAB_COLLINEAR_API_KEY`	Collinear API key
`SIMLAB_AGENT_API_KEY`	Agent API key
`OPENAI_API_KEY`	Fallback agent API key (when provider is `openai`)
`DAYTONA_API_KEY`	Daytona API key
`SIMLAB_DAYTONA_API_KEY`	Daytona API key (alternative)
`SIMLAB_SCENARIO_MANAGER_API_URL`	Override Scenario Manager API URL
`SIMLAB_VERIFIER_MODEL`	Verifier model
`SIMLAB_VERIFIER_API_KEY`	Verifier API key
`SIMLAB_ENVIRONMENTS_DIR`	Root directory for environments
`SIMLAB_DISABLE_TELEMETRY`	Set to `1` to disable CLI telemetry

CLI Reference

Command	Description
`simlab env init <name>`	Create a new environment (from template or interactive)
`simlab env up <name>`	Start environment containers (local Docker or `--daytona`)
`simlab env down <name>`	Stop and remove environment containers
`simlab env seed <name>`	Seed initial data into a running environment
`simlab tasks list`	List available tasks for an environment
`simlab tasks run`	Run an agent against a task and evaluate results
`simlab tasks-gen init`	Initialize task generation config (with presets)
`simlab tasks-gen validate`	Validate a task generation config
`simlab tasks-gen run`	Generate custom tasks via the API
`simlab templates list`	List available scenario templates
`simlab templates info <name>`	Show details for a specific template
`simlab tools list`	List available tool servers
`simlab tools info <name>`	Show details for a specific tool server

Run simlab --help for full usage details.

Documentation

Quickstart Guide — full setup and usage walkthrough
Docs — complete documentation
Collinear Platform — get your API key

License

This project is licensed under the Apache 2.0 License.

Contact

Questions or feedback? Reach out to us:

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.github/workflows		.github/workflows
assets		assets
examples/custom-cli-tools		examples/custom-cli-tools
output/agent_run_generated-task_20260319_170544		output/agent_run_generated-task_20260319_170544
src/simlab		src/simlab
taskgen		taskgen
tests		tests
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
QUICKSTART.md		QUICKSTART.md
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Quickstart

Install

Create an environment:

Run a task with task id `task_id`.

Tear down

API Keys

Configuration

Environment Variables

CLI Reference

Documentation

License

Contact

About

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Quickstart

Install

Create an environment:

Run a task with task id task_id.

Tear down

API Keys

Configuration

Environment Variables

CLI Reference

Documentation

License

Contact

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages

Run a task with task id `task_id`.