🚀 ZeroCostLLM

The Smartest Free AI Infrastructure on the Planet.

ZeroCostLLM is a self-maintaining, autonomous OpenAI-compatible proxy server. It mathematically identifies the highest-IQ free models available on the market (OpenRouter Free Tier), monitors their live health, and routes your traffic to the best available provider with automatic failover.

✨ Key Features

🧠 Mathematical IQ Ranking: Uses a verified capability formula (Params × log(Context)) cross-referenced with the models.dev database to rank models by intelligence, not hype.
📡 Live Health Monitoring: Authenticates against OpenRouter endpoints to pull real-time Uptime (%) and authenticated Throughput (TPS).
🔄 Autonomous Failover: Integrated APScheduler refreshes rankings every 60 minutes. If the #1 model goes down, the server silently jumps to the next best in the pool.
🏎️ Nitro-Optimized: Uses OpenRouter's sort: throughput routing to ensure you are always using the fastest provider for any given model ID.
🛠️ Dynamic Tuning: Hot-swap weights (IQ vs. Speed vs. Reliability) via a config.json or on-the-fly via the /v1/refresh endpoint.
🔌 OpenAI Compatible: Drop-in replacement for any app that supports OpenAI. Just point your base URL to http://localhost:8000/v1.

🚀 Quick Start

1. Install Dependencies

# Recommended: use uv for blazing fast installs
uv pip install fastapi uvicorn litellm requests pydantic python-dotenv apscheduler rich typer

2. Configure Environment

Create a .env file and add your OpenRouter API Key:

OPENROUTER_API_KEY=sk-or-v1-your-key-here

3. Start the Server

python main.py

The server is now live at http://localhost:8000.

📊 Management & Tooling

The Master Ranker CLI

Visualize the entire free-tier market with our rich CLI tool:

# Default balanced view
python live_health_ranker.py

# Sort by fastest Reasoning models
python live_health_ranker.py --sort brain --sort tps

On-the-Fly Overrides

Change the server's brain without a restart:

curl -X POST http://localhost:8000/v1/refresh \
  -H "Content-Type: application/json" \
  -d '{"model_pool_size": 3, "weights": {"intelligence": 1.0}}'

🛠️ Configuration (`config.json`)

Key	Description
`update_interval_minutes`	How often to re-scrape model health.
`model_pool_size`	Number of top-tier models to include in the router.
`weights.intelligence`	Weight (0.0 - 1.0) for raw parameter/context score.
`weights.speed`	Weight (0.0 - 1.0) for real-time TPS.
`overrides.blacklist`	Model IDs to never use.

🛡️ License

MIT - Built with ❤️ for the community.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.gitignore		.gitignore
README.md		README.md
benchmark.py		benchmark.py
config.json		config.json
independent_ranker.py		independent_ranker.py
live_health_ranker.py		live_health_ranker.py
main.py		main.py
models_dev_ranker.py		models_dev_ranker.py
ranker.py		ranker.py
rankings_data.json		rankings_data.json
rich_ranker.py		rich_ranker.py
test_models_dev.py		test_models_dev.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🚀 ZeroCostLLM

✨ Key Features

🚀 Quick Start

1. Install Dependencies

2. Configure Environment

3. Start the Server

📊 Management & Tooling

The Master Ranker CLI

On-the-Fly Overrides

🛠️ Configuration (`config.json`)

🛡️ License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🚀 ZeroCostLLM

✨ Key Features

🚀 Quick Start

1. Install Dependencies

2. Configure Environment

3. Start the Server

📊 Management & Tooling

The Master Ranker CLI

On-the-Fly Overrides

🛠️ Configuration (config.json)

🛡️ License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

🛠️ Configuration (`config.json`)

Packages