GitHub - sgl-project/genai-bench: Genai-bench is a powerful benchmark tool designed for comprehensive token-level performance evaluation of large language model (LLM) serving systems.

Unified, accurate, and beautiful LLM Benchmarking

| User Guide | Contribution Guideline |

Introduction

Genai-bench is a powerful benchmark tool designed for comprehensive token-level performance evaluation of large language model (LLM) serving systems.

It provides detailed insights into model serving performance, offering both a user-friendly CLI and a live UI for real-time progress monitoring.

Features

🛠️ CLI Tool: Validates user inputs and initiates benchmarks seamlessly.
📊 Live UI Dashboard: Displays current progress, logs, and real-time metrics.
📝 Rich Logs: Automatically flushed to both terminal and file upon experiment completion.
📈 Experiment Analyzer: Generates comprehensive Excel reports with pricing and raw metrics data, plus flexible plot configurations (default 2x4 grid) that visualize key performance metrics including throughput, latency (TTFT, E2E, TPOT), error rates, and RPS across different traffic scenarios and concurrency levels. Supports custom plot layouts and multi-line comparisons.

Installation

Quick Start: Install with pip install genai-bench. Alternatively, check Installation Guide for other options.

How to use

Quick Start

Run a benchmark against your model:

genai-bench benchmark --api-backend openai \
  --api-base "http://localhost:8080" \
  --api-key "your-api-key" \
  --api-model-name "your-model" \
  --task text-to-text \
  --max-time-per-run 5 \
  --max-requests-per-run 100

Generate Excel reports from your results:

genai-bench excel --experiment-folder ./experiments/your_experiment \
  --excel-name results --metric-percentile mean

Create visualizations:

genai-bench plot --experiments-folder ./experiments \
  --group-key traffic_scenario --preset 2x4_default

Next Steps

If you're new to GenAI Bench, check out the Getting Started page.

For detailed instructions, advanced configuration options, and comprehensive examples, check out the User Guide.

Development

If you are interested in contributing to GenAI-Bench, you can use the Development Guide.

Name		Name	Last commit message	Last commit date
Latest commit History 84 Commits
.github		.github
docs		docs
examples		examples
genai_bench		genai_bench
tests		tests
.coveragerc		.coveragerc
.dockerignore		.dockerignore
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
mypy.ini		mypy.ini
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Unified, accurate, and beautiful LLM Benchmarking

Introduction

Features

Installation

How to use

Quick Start

Next Steps

Development

About

Uh oh!

Releases 2

Packages

Uh oh!

Contributors 19

Languages

License

sgl-project/genai-bench

Folders and files

Latest commit

History

Repository files navigation

Unified, accurate, and beautiful LLM Benchmarking

Introduction

Features

Installation

How to use

Quick Start

Next Steps

Development

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors 19

Languages

Packages