mev-kit

Status: Alpha (v0.1.0) — research and paper-trading toolkit. Live execution is scaffolded but uses placeholder transactions. Do not use with real funds without replacing the execution stubs.

Open-source Solana MEV research and paper-trading kit with experimental live-execution scaffolding.

Build, backtest, and paper-trade MEV strategies on Solana. The architecture is designed so the same detector code can eventually run across backtest, paper, and live modes — but the live execution path is not production-ready yet (see Current Limitations).

What works today

Strategy research pipeline: Source → Detector → Simulator → Sink → Monitor
Web UI: Backtesting, strategy editor (Monaco), data management, risk analytics
Multi-venue data: Binance, Coinbase, Birdeye, Helius — auto-merge with lookahead bias prevention
Live CEX WebSockets: Binance and Coinbase real-time trade feeds, no API key needed
Realistic fill simulation: Adverse selection model, volume-based order book depth, spread-proportional staleness decay
Risk analytics: Sharpe ratio, max drawdown, equity curves, hourly heatmaps, survivorship bias warnings
Strategy optimization: Parameter sweeps, walk-forward validation, result versioning
7 detectors: CEX-DEX arb, price momentum, spread tracker, multi-pool arb, liquidation, statistical arb, volatility regime spread

What's experimental

Live execution: JitoBundleSink builds real signed Solana transactions via solders for known pools (SOL/USDC Raydium). Falls back to placeholder for unknown pools. Use with caution — tested against transaction structure, not against mainnet execution.
RPC simulation: RPCSimulator builds real Raydium swap transactions and submits to simulateTransaction. Works for known pool addresses. Unknown pools return a graceful error.
Paper trading: Requires live WebSocket connections (Helius + Binance or Coinbase). Works when API keys are configured but may fail silently on connection drops.
Pool coverage: Transaction construction is currently hardcoded for SOL/USDC Raydium AMM v4. Other pools need their account addresses added to the registry.
Fill simulation accuracy: Models adverse selection (~55% of fills experience spread reversion), volume-based order book depth, and spread-proportional staleness. Realism rating: 6/10 — directionally correct but still uses statistical models rather than tick-level replay.

Quick start

# Install
pip install -e ".[dev]"

# Launch the web UI
mev-kit ui

# Opens at http://localhost:8080

First backtest (no API keys needed)

Go to Data tab → Select SOL/USDC → Check Coinbase + Birdeye → Click Fetch & Prepare
Go to Backtest tab → Select the merged dataset → Pick a strategy → Run Backtest
View results: P&L, Sharpe ratio, equity curve, hourly heatmap, cost breakdown

API keys (optional, for more features)

# Add to .env file:
HELIUS_API_KEY=your-key          # Free at helius.dev — enables paper trading
BIRDEYE_API_KEY=your-key         # Free at birdeye.so — enables historical DEX prices
TARDIS_API_KEY=your-key          # Optional — real L2 order book data
MEV_KIT_WEBHOOK_URL=your-url     # Optional — Slack/Discord alerts

Architecture

Source (IngestAdapter) → Detector → Simulator → Sink → Monitor

Every layer is a pluggable interface. Free-tier defaults ship with the package. Pro-tier implementations swap in without changing strategy code.

Layer	Free tier	Pro tier
Source	Helius WS, Binance WS, Coinbase WS, Birdeye, Parquet replay	Geyser, Yellowstone gRPC, ShredStream
Detector	CEX-DEX arb, momentum, spread tracker	Custom strategies via Strategy Editor
Simulator	Passthrough, FillSimulator (adverse selection + volume depth)	RPC simulator, forked validator
Sink	Paper trade (SQLite), Backtest (Parquet)	Jito bundle, multi-path
Monitor	Prometheus metrics, structured logging	Custom dashboards

Web UI

Launch with mev-kit ui — a professional trading terminal interface:

Dashboard: Real-time pipeline monitoring with draggable panels
Strategies: Monaco code editor with Python syntax highlighting, validation, fork examples
Backtest: Run strategies against historical data with fill simulation
Config: TOML profile management, API key status
Analysis: Results explorer with charts, trade table, CSV export
Data: Multi-venue data fetching with auto-merge and lag correction
Learn: 6 educational guides on Solana MEV

Data pipeline

Select market → Pick venues → Fetch → Auto-merge → Lag-correct → Backtest-ready

Supported venues:

Binance — CEX historical candles + live WebSocket trades (free, no key)
Coinbase — CEX historical candles + live WebSocket trades (free, no key)
Bybit — CEX historical candles (free, no key)
Birdeye — DEX historical prices aggregated across Raydium, Orca, Meteora, Phoenix
Helius — Live on-chain pool state polling
Tardis.dev — L2 order book snapshots for realistic CEX slippage (optional)

Fill simulation

Backtests model realistic execution per venue with three layers of realism:

Venue	Fee	Slippage model	Landing rate
Raydium AMM	25 bps	Constant product `dx/(R+dx)`	40% Jito
Orca Whirlpool	30 bps	CLMM (3x efficiency)	40% Jito
Jupiter	0 + venue	Aggregated routing (4x eff)	45% Jito
Binance	10 bps taker	Order book (Almgren-Chriss)	98% fill
Coinbase	18 bps taker	Order book	97% fill

Realism features:

Adverse selection: ~55-80% of fills experience spread reversion before execution (breaks the artificial 100% win rate)
Volume-based depth: Order book depth estimated from CEX candle volume (~1% of daily volume within 10 bps), not hardcoded
Spread-proportional staleness: Large spreads (30+ bps) decay 60-80%/sec as competition closes them; small spreads persist
Two-leg arb modeling: Separate cost simulation for DEX and CEX legs
Dynamic landing rates: Competition-adjusted Jito bundle landing probability

Strategy development

from mev_kit.strategies.base import Detector
from mev_kit.models import Opportunity, Source, StateUpdate

class MyDetector(Detector):
    required_sources = {Source.BINANCE_WS, Source.HELIUS_WS}

    async def process(self, update: StateUpdate) -> Opportunity | None:
        # Your detection logic here
        return None

    def hyperparameters(self):
        return {"min_spread_bps": (5.0, 50.0, 5.0)}

Enhanced Detector API (inspired by Artemis, Hummingbot, Jesse.trade):

required_sources — declare data feed requirements
sync_state() — one-time initialization
before() / after() — per-update lifecycle hooks
filters() — post-detection validation
process_batch() — multi-opportunity per update
hyperparameters() — optimizer-friendly parameter ranges

CLI

mev-kit ui                    # Launch web dashboard
mev-kit backtest --config config/free.toml --data ./data/file.parquet
mev-kit paper --config config/free.toml
mev-kit live --config config/free.toml --size 0.01
mev-kit analyze --db ./data/results.db

Tech stack

Backend: Python 3.11+, FastAPI, asyncio, Pydantic v2, aiosqlite, Polars, httpx

Frontend: React 18, TypeScript, Vite, Tailwind CSS, TradingView Lightweight Charts, Recharts, Monaco Editor

Solana: solders, solana-py, websockets (Helius/Binance), Jito bundles

Project structure

src/mev_kit/
├── pipeline/          # Pipeline orchestrator
├── models/            # Pydantic data models
├── adapters/
│   ├── ingest/        # Data source adapters (Helius, Binance, Birdeye, Parquet replay)
│   ├── simulators/    # Fill simulation (passthrough, RPC, venue-specific)
│   └── sinks/         # Execution output (paper trade, backtest, Jito bundle)
├── strategies/        # Detector base class + examples
│   └── examples/      # 6 example detectors
├── utils/             # Precision math, risk metrics, alerts, monitoring
└── ui/                # FastAPI server + React dashboard
    ├── routers/       # API endpoints
    ├── guides/        # Educational markdown content
    └── static/        # Built React app

Testing

pytest tests/ -v          # 256 tests
ruff check src/ tests/    # Lint
cd ui && npx tsc --noEmit # TypeScript

Current Limitations

This is an alpha research toolkit. Be aware of these gaps:

Area	Status	Gap
Backtesting	Working	Fill simulation uses estimated pool depth, not real on-chain reserves. P&L is directionally correct but not exact.
Paper trading	Working (with keys)	Requires Helius + Binance WebSocket connections. May fail silently on connection drops.
Live execution	Working for SOL/USDC	Real `solders`-built, keypair-signed transactions for Raydium AMM v4. Falls back to placeholder for unknown pools. Not yet mainnet-tested.
RPC simulation	Working for known pools	Builds real swap transactions, submits to `simulateTransaction`. Unknown pools return graceful error.
Strategy generality	Working	CLI and web UI both use dynamic `_load_detector()`. Any registered strategy works.
Fill accuracy	~1.5-3x of reality	Adverse selection and volume-based depth are directionally correct. Still uses statistical models, not tick-level replay.

Roadmap to production

~~Replace Jito bundle placeholders with real solders-built, keypair-signed transactions~~ ✅
~~Replace RPC simulator stub with real transaction construction + simulateTransaction~~ ✅
~~Calibrate fill simulation against real order book data (Tardis free tier)~~ ✅
~~Make CLI strategy selection dynamic (not hardcoded to one detector)~~ ✅
~~Expand pool registry beyond SOL/USDC (dynamic on-chain account lookup)~~ ✅
~~End-to-end mainnet test script (scripts/test_mainnet_bundle.py --dry-run)~~ ✅
~~Realistic fill simulation: adverse selection, volume-based depth, spread-proportional decay~~ ✅
~~Coinbase live WebSocket adapter~~ ✅
Wire Tardis L2 snapshots into backtest flow for data-driven adverse selection
Tick-level replay with actual order book state per timestamp

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 85 Commits
config		config
docs		docs
examples		examples
scripts		scripts
src/mev_kit		src/mev_kit
tests		tests
ui		ui
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

mev-kit

What works today

What's experimental

Quick start

First backtest (no API keys needed)

API keys (optional, for more features)

Architecture

Web UI

Data pipeline

Fill simulation

Strategy development

CLI

Tech stack

Project structure

Testing

Current Limitations

Roadmap to production

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

mev-kit

What works today

What's experimental

Quick start

First backtest (no API keys needed)

API keys (optional, for more features)

Architecture

Web UI

Data pipeline

Fill simulation

Strategy development

CLI

Tech stack

Project structure

Testing

Current Limitations

Roadmap to production

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages