Commit 3cd31fd
authored
Move tasks to individual files (#1016)
* use inspect-ai to evaluate aime25 and gsm8k
* revert file
* working for 3 tasks
* parallel evals of tasks
* adds gpqa diamond to inspect
* move tasks to individual files
* move tasks to individual files
* enable extended tasks as well
* run precomit hook
* fix mkqa
* chaange extended suite to lighteval
* chaange extended suite to lighteval
* add metdata to tasks
* add metdata to tasks
* remove license notice and put docstring on top of file
* homogenize tags
* add docstring for all multilingual tasks
* add docstring for all multilingual tasks
* add name and dataset to metadata
* use TASKS_TABLE for multilingual tasks
* use TASKS_TABLE for default tasks
* use TASKS_TABLE for default tasks
* loads all tasks correclty
* move community tasks to default tasks and update doc
* move community tasks to default tasks and update doc
* revert uneeded changes
* fix doc build
* fix doc build
* remove custom tasks and let user decide if loading multilingual tasks
* load-tasks multilingual fix
* update doc
* remove uneeded file
* update readme
* update readme
* update readme
* fix test
* add back the custom tasks
* add back the custom tasks
* fix tasks
* fix tasks
* fix tasks
* fix tests
* fix tests1 parent bf8b547 commit 3cd31fd
File tree
331 files changed
+24603
-28548
lines changed- community_tasks
- docs/source
- examples
- custom_models
- src/lighteval
- metrics
- imports
- utils
- tasks
- extended
- multilingual
- tasks
- tasks
- hle
- ifbench
- ifeval
- lcb
- mix_eval
- mt_bench
- olympiade_bench
- tiny_benchmarks
- utils
- tests
- reference_details
- SmolLM2-1.7B-Instruct-transformers
- SmolLM2-1.7B-Instruct-vllm
- reference_scores
- slow_tests
- unit
- metrics
- pipeline
- tasks
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
331 files changed
+24603
-28548
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
25 | 25 | | |
26 | 26 | | |
27 | 27 | | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
28 | 31 | | |
29 | 32 | | |
30 | 33 | | |
| |||
39 | 42 | | |
40 | 43 | | |
41 | 44 | | |
42 | | - | |
| 45 | + | |
| 46 | + | |
| 47 | + | |
| 48 | + | |
43 | 49 | | |
44 | 50 | | |
45 | 51 | | |
| |||
62 | 68 | | |
63 | 69 | | |
64 | 70 | | |
65 | | - | |
| 71 | + | |
66 | 72 | | |
67 | 73 | | |
68 | 74 | | |
| |||
This file was deleted.
This file was deleted.
This file was deleted.
This file was deleted.
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
2 | 2 | | |
3 | 3 | | |
4 | 4 | | |
5 | | - | |
6 | | - | |
7 | | - | |
8 | | - | |
9 | | - | |
10 | | - | |
11 | | - | |
12 | | - | |
13 | | - | |
14 | | - | |
15 | | - | |
16 | | - | |
17 | | - | |
18 | | - | |
19 | | - | |
20 | | - | |
21 | | - | |
22 | | - | |
23 | | - | |
24 | | - | |
25 | | - | |
26 | | - | |
| 5 | + | |
27 | 6 | | |
28 | 7 | | |
29 | | - | |
| 8 | + | |
30 | 9 | | |
31 | 10 | | |
32 | 11 | | |
33 | 12 | | |
34 | 13 | | |
35 | | - | |
| 14 | + | |
| 15 | + | |
36 | 16 | | |
37 | 17 | | |
38 | 18 | | |
| |||
135 | 115 | | |
136 | 116 | | |
137 | 117 | | |
138 | | - | |
| 118 | + | |
139 | 119 | | |
140 | 120 | | |
141 | 121 | | |
142 | 122 | | |
143 | | - | |
| 123 | + | |
144 | 124 | | |
145 | 125 | | |
146 | 126 | | |
| |||
169 | 149 | | |
170 | 150 | | |
171 | 151 | | |
172 | | - | |
| 152 | + | |
173 | 153 | | |
174 | 154 | | |
175 | 155 | | |
| |||
179 | 159 | | |
180 | 160 | | |
181 | 161 | | |
182 | | - | |
| 162 | + | |
183 | 163 | | |
184 | 164 | | |
185 | 165 | | |
186 | 166 | | |
187 | 167 | | |
188 | | - | |
| 168 | + | |
189 | 169 | | |
190 | 170 | | |
0 commit comments