Skip to content

ai4society/FABLE-Leaderboard

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

FABLE Leaderboard

An interactive web-based leaderboard for evaluating Large Language Models (LLMs) on the FABLE benchmark - a comprehensive data-flow analysis benchmark using procedural text.

🌐 View Live Leaderboard

Overview

FABLE measures LLMs' ability to perform data flow analysis tasks across multiple domains: travel routes, plans, and recipes. This leaderboard provides an interactive interface to explore model performance across different analysis types and domains.

Research Paper

This leaderboard is based on the research presented in:

FABLE: A Novel Data-Flow Analysis Benchmark on Procedural Text for Large Language Model Evaluation

License

This project is licensed under the MIT License - see the LICENSE file for details.

Citation

If you use this leaderboard or the FABLE benchmark in your research, please cite:

@article{pallagani2025fable,
  title={FABLE: A Novel Data-Flow Analysis Benchmark on Procedural Text for Large Language Model Evaluation},
  author={Pallagani, Vishal and Gupta, Nitin and Aydin, John and Srivastava, Biplav},
  journal={arXiv preprint arXiv:2505.24258},
  year={2025}
}

FABLE • Built with ❤️ by AI4Society