Agent trajectory extractors

This repo includes trajectory extractors for different coding agents, exposed via a unified API.

Supported agents

1. MiniSWE-agent

Format: .traj.json files
Location: contextbench/agents/minisweagent/extract.py
Notes:
- Extracts file views from bash commands in messages
- Supports cat, sed -n, head, grep, nl | sed commands
- Parses patch_context_data.patch_context for final context
- Returns model patch from info.submission

2. SWE-agent

Format: .checkpoints.jsonl files
Location: contextbench/agents/sweagent/extract.py
Notes:
- Extracts from str_replace_editor view commands with --view_range
- Only includes steps with explicit line ranges
- Parses patch_context string format (File:/Lines:)

Unified interface

from contextbench.agents import extract_trajectory

result = extract_trajectory("path/to/trajectory.traj.json")
result = extract_trajectory("path/to/trajectory.checkpoints.jsonl")

The extractor returns a unified structure:

{
    "pred_steps": [{"files": [...], "spans": {...}}, ...],
    "pred_files": [...],
    "pred_spans": {...},
}

Testing

python -m contextbench.evaluate \
    --gold Context-dataset/Verified/annots_pass \
    --pred traj_verified-mini/instance/instance.traj.json \
    --out results.jsonl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Agent trajectory extractors

Supported agents

1. MiniSWE-agent

2. SWE-agent

Unified interface

Testing

FilesExpand file tree

agents.md

Latest commit

History

agents.md

File metadata and controls

Agent trajectory extractors

Supported agents

1. MiniSWE-agent

2. SWE-agent

Unified interface

Testing