Skip to content

Pull requests: AISBench/benchmark

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[feature] [sub feature 2] Dependency for qwen image edit run feature
#151 opened Feb 13, 2026 by SJTUyh Loading…
1 of 15 tasks
[feature] [sub feature 1] Support model judge in evaluation feature
#149 opened Feb 13, 2026 by SJTUyh Loading…
1 of 15 tasks
【TEST】补充math和agieval数据集的冒烟用例 test-cases
#145 opened Feb 11, 2026 by GaoHuaZhang Loading…
1 of 15 tasks
local eval add mindformers model
#110 opened Jan 15, 2026 by muqing-li Loading…
1 of 15 tasks
ProTip! What’s not been updated in a month: updated:<2026-01-18.