Skip to content
Closed
No due date
Closed Apr 3, 2026

Model Capability Tiering — probe model size, hardware EP, and inference latency at startup to classify into Basic/Moderate/Strong tiers. Agent adapts strategy per tier: skip LLM for Basic, use LLM for relevance filtering in Moderate, enable multi-turn reasoning in Strong.

100% complete

List view

    There are no open issues in this milestone

    Add issues to milestones to help organize your work for a particular release or project. Find and add issues with no milestones in this repo.