🌍 AppWorld: A Controllable World of Apps and People for Benchmarking Function Calling and Interactive Coding Agent, ACL'24 Best Resource Paper.
-
Updated
Oct 19, 2025 - Python
🌍 AppWorld: A Controllable World of Apps and People for Benchmarking Function Calling and Interactive Coding Agent, ACL'24 Best Resource Paper.
Separation of planning concerns in ReAct-style LLM agents. Planner fine-tuning on synthetic trajectories.
a suite of finetuned LLMs for atomically precise function calling 🧪
[ICML DMLR 2024] Repo that contains code for the paper titled: "Class-Level Code Generation from Natural Language Using Iterative, Tool-Enhanced Reasoning over Repository".
Benchmarking Large Language Models for Materials Science Tools
🌍 Leaderboard Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL2024
Code Base for Multi-Tool Usage for black box api access
A vision-centric multimodal agent framework that learns robust, step-wise tool reasoning through trajectory supervision and preference optimization
An autonomous research agent that synthesizes detailed reports by combining dynamic planning, unsupervised topic discovery (PCA/KMeans) HyDe (hypothetical doc embeddings), and a self-correcting reflexion loop.
An advanced conversational AI system with multi-model support, RAG capabilities, dynamic tool usage, and persistent memory.
A comprehensive framework for developing AI systems that automatically utilize MCP (Model Context Protocol) tools while maintaining authenticity and preventing behavioral drift
Enterprise agent/LLM platform with layered governance (RBAC, audit, policy-as-code); Azure OpenAI and RAG ready.
Add a description, image, and links to the tool-usage topic page so that developers can more easily learn about it.
To associate your repository with the tool-usage topic, visit your repo's landing page and select "manage topics."