Tri-Model LLM Arena 🥊

title

Tri-Model LLM Arena

emoji

🥊

colorFrom

blue

colorTo

indigo

sdk

gradio

sdk_version

4.44.0

app_file

app.py

pinned

false

license

mit

Tri-Model LLM Arena 🥊

A unified, asynchronous chat interface designed to compare the "Big Three" AI providers—Anthropic, Google, and OpenAI—simultaneously within a single, responsive dashboard.

🚀 Overview

Tri-Model LLM Arena allows developers and researchers to send a single prompt and receive three parallel responses. It provides a "Replit-inspired" UI that adapts between desktop and mobile, offering deep insights into the performance and economics of each model through real-time telemetry.

✨ Key Features

Asynchronous Execution: Queries Claude, Gemini, and GPT APIs in parallel to minimize wait times and ensure simultaneous streaming.
Flexible UI (Replit-Style):
- Desktop: 3-pane side-by-side layout with an option to stack windows vertically.
- Mobile: Vertical stack with two panes closed by default (toggleable).
Model Tier Presets: Quickly switch between comparison sets:
- Flash: GPT-4o-mini, Gemini 1.5 Flash, Claude 3.5 Haiku.
- Pro/Mid: GPT-4o, Gemini 1.5 Pro, Claude 3.5 Sonnet.
- SOTA: o1-preview, Gemini 2.0 Ultra, Claude 3 Opus.
Real-time Analytics (Hovering Report):
- Latency: Average response time in seconds.
- Tokens: Total cumulative prompt and completion tokens.
- Cost: Estimated USD spend based on current provider pricing.
Session Reporting: Option to download a full session report (PDF/HTML) before closing, featuring series charts and an AI-generated performance summary.

🛠️ Tech Stack

Framework: Gradio
Backend: Python (with asyncio for non-blocking API calls)
Hosting: Hugging Face Spaces
Version Control: GitHub

⚙️ Installation & Setup

Clone the repository:

git clone [https://github.com/jaswanth-surabattula/tri-model-arena.git](https://github.com/jaswanth-surabattula/tri-model-arena.git)
cd tri-model-arena

Install dependencies:
```
pip install -r requirements.txt
```

Set up environment variables: Create a .env file in the root directory:

OPENAI_API_KEY=your_key_here
ANTHROPIC_API_KEY=your_key_here
GEMINI_API_KEY=your_key_here

Run the app locally:
```
python app.py
```

📊 Analytics Logic

The application calculates session metrics using the following logic:

Latency ($L$): $L = T_{response} - T_{request}$
Cost Calculation: $$Cost = \sum (Tokens_{in} \cdot Rate_{in}) + (Tokens_{out} \cdot Rate_{out})$$

🔄 Keeping Models Up to Date

Model IDs and pricing are hardcoded in src/three_way/core/config.py. When providers release new models or retire old ones, update that file manually.

See docs/how_to_update_models.md for step-by-step instructions, including where to find model IDs, how to convert pricing to the format the code expects, and which deprecation pages to watch.

📝 License

This project is licensed under the MIT License. This allows for open contribution and modification while keeping the tool accessible to the community. See the LICENSE file for more details.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
docs		docs
src/three_way		src/three_way
tests		tests
.env.example		.env.example
.gitignore		.gitignore
.python-version		.python-version
AI.md		AI.md
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Tri-Model LLM Arena 🥊

🚀 Overview

✨ Key Features

🛠️ Tech Stack

⚙️ Installation & Setup

📊 Analytics Logic

🔄 Keeping Models Up to Date

📝 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Tri-Model LLM Arena 🥊

🚀 Overview

✨ Key Features

🛠️ Tech Stack

⚙️ Installation & Setup

📊 Analytics Logic

🔄 Keeping Models Up to Date

📝 License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages