Skip to content

docs: update Performance section with two-host benchmark numbers#49

Merged
levleontiev merged 3 commits intomainfrom
docs/issue-48-benchmark-numbers
Mar 18, 2026
Merged

docs: update Performance section with two-host benchmark numbers#49
levleontiev merged 3 commits intomainfrom
docs/issue-48-benchmark-numbers

Conversation

@levleontiev
Copy link
Contributor

Closes #48

Summary

  • Hardware updated: 2 × c7i.xlarge, cluster placement group, eu-central-1 (two-host setup)
  • Latency table: new numbers (p50 304µs decision, 302µs proxy, 235µs nginx)
  • Enforcement overhead framing: +69 µs p50 / +134 µs p90 over raw nginx
  • Throughput: 195k RPS simple and complex (up from 110k/67k single-host)
  • Removed tiktoken/token estimation row (not supported)
  • Inline tagline updated to < 70 µs enforcement overhead · 195k RPS

Test plan

  • Verify all old numbers replaced (112, 241, 110,500, 67,600)
  • Verify overhead framing in tagline and section

 #48)

- Hardware: 2 × c7i.xlarge, cluster placement group, eu-central-1
- New latency table: p50 304µs / p90 543µs decision service
- Enforcement overhead framing: +69µs p50 / +134µs p90 over raw nginx
- New throughput: 195k RPS (simple and complex)
- Removed tiktoken/token estimation row (not supported)
- Updated inline tagline to overhead framing
 #48)

- Hardware: 2 × c7i.xlarge, cluster placement group, eu-central-1
- New latency table: p50 304µs decision service
- Enforcement overhead framing: +69µs p50 / +134µs p90 over raw nginx
- New throughput: 195k RPS (simple and complex)
- Removed tiktoken row, updated tagline to overhead framing
@levleontiev levleontiev merged commit 95d1160 into main Mar 18, 2026
11 checks passed
@levleontiev levleontiev deleted the docs/issue-48-benchmark-numbers branch March 18, 2026 18:55
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

docs: update Performance section with two-host c7i.xlarge benchmark numbers

1 participant