Commit 078ff68
🚨 Move
* tmp
* batch push
* maybe better pop and break, and we'll have one theta per config in the rope dict
* update a few models?
* fix tests that are easu first
* dont overwrite if already present!!!
* partial rotary factor
* more fixes to the god of fixes
* setdefault
* fix copies
* Update src/transformers/modeling_rope_utils.py
Co-authored-by: Anton Vlasjuk <73884904+vasqu@users.noreply.github.com>
* Update src/transformers/models/efficientloftr/configuration_efficientloftr.py
Co-authored-by: Anton Vlasjuk <73884904+vasqu@users.noreply.github.com>
* attempt one
* update all models
* fix tests
* fix tests
* oops
* fix slow tests with nested rope models
* fix copies
* deal with circular import and move the mixin to base config class
* fix copies
* fix a few tests
* update the migration guide
---------
Co-authored-by: Anton Vlasjuk <73884904+vasqu@users.noreply.github.com>rotary_partial_emb to RopeParams and delete unnecessary code 🔪 (#42255)1 parent 554fb40 commit 078ff68
File tree
148 files changed
+1509
-2013
lines changed- src/transformers
- models
- apertus
- arcee
- aria
- bamba
- bitnet
- blt
- chameleon
- cohere2
- cohere
- csm
- cwm
- dbrx
- deepseek_v2
- deepseek_v3
- dia
- diffllama
- doge
- dots1
- efficientloftr
- emu3
- ernie4_5_moe
- ernie4_5
- evolla
- exaone4
- falcon_h1
- falcon
- flex_olmo
- fuyu
- gemma2
- gemma3n
- gemma3
- gemma
- glm4_moe
- glm4v_moe
- glm4v
- glm4
- glm
- gpt_neox_japanese
- gpt_neox
- gpt_oss
- granitemoehybrid
- granitemoe
- granite
- helium
- hunyuan_v1_dense
- hunyuan_v1_moe
- jetmoe
- kyutai_speech_to_text
- lfm2_moe
- lfm2
- llama4
- llama
- longcat_flash
- mimi
- minimax
- ministral
- mistral
- mixtral
- mllama
- modernbert_decoder
- modernbert
- moonshine
- moshi
- nanochat
- nemotron
- olmo2
- olmo3
- olmoe
- olmo
- persimmon
- phi3
- phi4_multimodal
- phimoe
- phi
- pixtral
- qwen2_5_omni
- qwen2_5_vl
- qwen2_moe
- qwen2_vl
- qwen2
- qwen3_moe
- qwen3_next
- qwen3_omni_moe
- qwen3_vl_moe
- qwen3_vl
- qwen3
- recurrent_gemma
- seed_oss
- smollm3
- stablelm
- starcoder2
- t5gemma
- vaultgemma
- zamba2
- tests
- utils
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
148 files changed
+1509
-2013
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
328 | 328 | | |
329 | 329 | | |
330 | 330 | | |
331 | | - | |
| 331 | + | |
332 | 332 | | |
333 | 333 | | |
334 | 334 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
26 | 26 | | |
27 | 27 | | |
28 | 28 | | |
| 29 | + | |
29 | 30 | | |
30 | 31 | | |
31 | 32 | | |
| |||
49 | 50 | | |
50 | 51 | | |
51 | 52 | | |
52 | | - | |
| 53 | + | |
53 | 54 | | |
54 | 55 | | |
55 | 56 | | |
| |||
261 | 262 | | |
262 | 263 | | |
263 | 264 | | |
| 265 | + | |
| 266 | + | |
| 267 | + | |
| 268 | + | |
| 269 | + | |
| 270 | + | |
| 271 | + | |
264 | 272 | | |
265 | 273 | | |
266 | 274 | | |
| |||
Large diffs are not rendered by default.
Lines changed: 3 additions & 9 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
22 | 22 | | |
23 | 23 | | |
24 | 24 | | |
25 | | - | |
| 25 | + | |
26 | 26 | | |
27 | 27 | | |
28 | 28 | | |
| |||
99 | 99 | | |
100 | 100 | | |
101 | 101 | | |
| 102 | + | |
102 | 103 | | |
103 | 104 | | |
104 | 105 | | |
| |||
160 | 161 | | |
161 | 162 | | |
162 | 163 | | |
163 | | - | |
164 | | - | |
165 | | - | |
166 | | - | |
167 | | - | |
168 | | - | |
169 | | - | |
170 | | - | |
| 164 | + | |
171 | 165 | | |
172 | 166 | | |
173 | 167 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
20 | 20 | | |
21 | 21 | | |
22 | 22 | | |
23 | | - | |
| 23 | + | |
| 24 | + | |
24 | 25 | | |
25 | 26 | | |
26 | 27 | | |
27 | | - | |
28 | 28 | | |
29 | 29 | | |
30 | 30 | | |
| |||
43 | 43 | | |
44 | 44 | | |
45 | 45 | | |
46 | | - | |
| 46 | + | |
47 | 47 | | |
48 | 48 | | |
49 | 49 | | |
| |||
116 | 116 | | |
117 | 117 | | |
118 | 118 | | |
| 119 | + | |
| 120 | + | |
119 | 121 | | |
120 | 122 | | |
121 | 123 | | |
| |||
124 | 126 | | |
125 | 127 | | |
126 | 128 | | |
| 129 | + | |
| 130 | + | |
| 131 | + | |
| 132 | + | |
| 133 | + | |
127 | 134 | | |
128 | 135 | | |
129 | 136 | | |
| |||
154 | 161 | | |
155 | 162 | | |
156 | 163 | | |
| 164 | + | |
| 165 | + | |
| 166 | + | |
| 167 | + | |
| 168 | + | |
| 169 | + | |
| 170 | + | |
| 171 | + | |
| 172 | + | |
| 173 | + | |
| 174 | + | |
| 175 | + | |
| 176 | + | |
| 177 | + | |
| 178 | + | |
| 179 | + | |
| 180 | + | |
| 181 | + | |
| 182 | + | |
| 183 | + | |
157 | 184 | | |
158 | | - | |
159 | | - | |
160 | | - | |
161 | | - | |
162 | | - | |
163 | | - | |
164 | | - | |
165 | | - | |
166 | | - | |
167 | | - | |
168 | | - | |
169 | 185 | | |
170 | 186 | | |
171 | 187 | | |
172 | 188 | | |
173 | | - | |
174 | | - | |
175 | | - | |
176 | 189 | | |
177 | 190 | | |
178 | | - | |
179 | | - | |
180 | | - | |
181 | | - | |
182 | | - | |
183 | | - | |
184 | | - | |
185 | | - | |
186 | 191 | | |
187 | 192 | | |
188 | 193 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
22 | 22 | | |
23 | 23 | | |
24 | 24 | | |
25 | | - | |
| 25 | + | |
26 | 26 | | |
27 | 27 | | |
28 | 28 | | |
| |||
163 | 163 | | |
164 | 164 | | |
165 | 165 | | |
166 | | - | |
167 | | - | |
168 | | - | |
169 | | - | |
170 | | - | |
171 | | - | |
172 | | - | |
173 | | - | |
| 166 | + | |
174 | 167 | | |
175 | 168 | | |
176 | 169 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
21 | 21 | | |
22 | 22 | | |
23 | 23 | | |
24 | | - | |
| 24 | + | |
25 | 25 | | |
26 | 26 | | |
27 | 27 | | |
| |||
168 | 168 | | |
169 | 169 | | |
170 | 170 | | |
171 | | - | |
172 | | - | |
173 | | - | |
174 | | - | |
175 | | - | |
176 | | - | |
177 | | - | |
178 | | - | |
| 171 | + | |
179 | 172 | | |
180 | 173 | | |
181 | 174 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
17 | 17 | | |
18 | 18 | | |
19 | 19 | | |
20 | | - | |
| 20 | + | |
21 | 21 | | |
22 | 22 | | |
23 | 23 | | |
| |||
171 | 171 | | |
172 | 172 | | |
173 | 173 | | |
174 | | - | |
175 | | - | |
176 | | - | |
177 | | - | |
178 | | - | |
179 | | - | |
180 | | - | |
181 | | - | |
182 | | - | |
183 | | - | |
184 | 174 | | |
185 | 175 | | |
186 | 176 | | |
| |||
203 | 193 | | |
204 | 194 | | |
205 | 195 | | |
| 196 | + | |
| 197 | + | |
206 | 198 | | |
207 | 199 | | |
208 | 200 | | |
| |||
Lines changed: 3 additions & 9 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
16 | 16 | | |
17 | 17 | | |
18 | 18 | | |
19 | | - | |
| 19 | + | |
20 | 20 | | |
21 | 21 | | |
22 | 22 | | |
| |||
97 | 97 | | |
98 | 98 | | |
99 | 99 | | |
| 100 | + | |
100 | 101 | | |
101 | 102 | | |
102 | 103 | | |
| |||
138 | 139 | | |
139 | 140 | | |
140 | 141 | | |
141 | | - | |
142 | | - | |
143 | | - | |
144 | | - | |
145 | | - | |
146 | | - | |
147 | | - | |
148 | | - | |
| 142 | + | |
149 | 143 | | |
150 | 144 | | |
151 | 145 | | |
| |||
0 commit comments