FABLE Leaderboard

An interactive web-based leaderboard for evaluating Large Language Models (LLMs) on the FABLE benchmark - a comprehensive data-flow analysis benchmark using procedural text.

🌐 View Live Leaderboard

Overview

FABLE measures LLMs' ability to perform data flow analysis tasks across multiple domains: travel routes, plans, and recipes. This leaderboard provides an interactive interface to explore model performance across different analysis types and domains.

Research Paper

This leaderboard is based on the research presented in:

FABLE: A Novel Data-Flow Analysis Benchmark on Procedural Text for Large Language Model Evaluation

License

This project is licensed under the MIT License - see the LICENSE file for details.

Citation

If you use this leaderboard or the FABLE benchmark in your research, please cite:

@article{pallagani2025fable,
  title={FABLE: A Novel Data-Flow Analysis Benchmark on Procedural Text for Large Language Model Evaluation},
  author={Pallagani, Vishal and Gupta, Nitin and Aydin, John and Srivastava, Biplav},
  journal={arXiv preprint arXiv:2505.24258},
  year={2025}
}

FABLE • Built with ❤️ by AI4Society

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
LICENSE		LICENSE
README.md		README.md
all_results.json		all_results.json
index.html		index.html
script.js		script.js
style.css		style.css

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FABLE Leaderboard

Overview

Research Paper

License

Citation

About

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

FABLE Leaderboard

Overview

Research Paper

License

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages