Skip to content

dynamicheart/llm-eval-viewer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

53 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

LLM Eval Viewer

中文文档

This project was built entirely with AI assistance (Baidu Comate IDE + Claude Opus 4.6).

LLM Eval Viewer is a lightweight web tool for visualizing LLM evaluation results.
Currently supports result formats generated by evalscope.

Live Demo:
https://dynamicheart.github.io/llm-eval-viewer/


Features

  • Multi-format support: Evalscope Predictions / Reviews evaluation results
  • Directory browsing: Select a directory to auto-scan its structure, quickly switch between experiments and datasets (Chrome/Edge)
  • Statistics & distribution: Token histogram, result/finish-reason distribution, per-dataset accuracy — all interactive with click-to-filter
  • Reasoning support: Displays reasoning content marked as [R], view Text and Reasoning separately
  • Dark mode: Light / Dark / Auto theme with system preference detection
  • cURL export: Generate cURL commands from request details for quick API replay
  • i18n: English and Chinese language support

Example Files (evalscope)

You can use the following example files for local or online experience:


Screenshots

Reviews View (Dark Mode)

Reviews View - Dark Mode

Predictions View (Light Mode)

Predictions View - Light Mode


Development

cd llm-eval-viewer
npm install
npm run dev

Build

npm run build

Built with AI

This project was developed entirely through AI-assisted programming using Baidu Comate IDE with Claude Opus 4.6 as the agent model. From architecture design to implementation, all code was generated and iterated via human-AI collaboration.

License

MIT

About

A lightweight web viewer for LLM evaluation results — visualize, browse and analyze evalscope outputs instantly in your browser.

Topics

Resources

License

Stars

Watchers

Forks

Packages

 
 
 

Contributors