Optimize rope_deltas propagation logic in Qwen2.5-VL #41176

Xqle · 2025-09-26T10:18:15Z

What does this PR do?

This PR fixes the propagation of rope_deltas in the Qwen2.5-VL model during generation.

Currently, the forward() method of Qwen2_5_VLForConditionalGeneration accepts rope_deltas as an argument, but the value is never passed to the underlying Qwen2_5_VLModel. As a result, users providing rope_deltas directly to forward() would see no effect.

Modifications

Qwen2_5_VLForConditionalGeneration.forward() now passes rope_deltas to Qwen2_5_VLModel.forward().
Qwen2_5_VLModel.forward() now accepts rope_deltas and updates its internal state accordingly.
Updates prepare_inputs_for_generation() to store calculated rope_deltas
in model_inputs, aligning its handling with position_ids.

Impact

Users can now explicitly provide rope_deltas during generation and have them correctly applied.
Ensures consistency between prefill and decoding phases in multi-modal generation.
Aligns the API behavior with documentation.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

…econd_per_grid_ts

- Forward rope_deltas from Qwen2_5_VLForConditionalGeneration to Qwen2_5_VLModel - Update Qwen2_5_VLModel to accept rope_deltas and store internally - Refactor prepare_inputs_for_generation to unify rope_deltas handling - Ensure that passing rope_deltas in forward() now correctly affects position_ids calculation This fixes an issue where passing rope_deltas directly to the model's forward() had no effect, which could lead to inconsistencies between pre-fill generation and manual forward calls.

Rocketknight1 · 2025-09-26T11:46:02Z

cc @zucchini-nlp for VLMs, @ArthurZucker for rope

…5-vl

github-actions · 2025-09-26T16:05:59Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: qwen2_5_vl

Xqle added 2 commits September 25, 2025 08:01

Fix: align Qwen2.5-VL inference rope index with training by passing s…

6810342

…econd_per_grid_ts

Xqle changed the title ~~Optimize rope_deltas propagation logic in Qwen2.5-VL ✅~~ Optimize rope_deltas propagation logic in Qwen2.5-VL Sep 26, 2025

Remove white space from blank line.

1dc3002

Merge branch 'huggingface:main' into fix-propagate-rope-deltas-qwen2.…

5db8725

…5-vl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Optimize rope_deltas propagation logic in Qwen2.5-VL #41176

Optimize rope_deltas propagation logic in Qwen2.5-VL #41176

Xqle commented Sep 26, 2025

Uh oh!

Rocketknight1 commented Sep 26, 2025

Uh oh!

github-actions bot commented Sep 26, 2025

Uh oh!

Uh oh!

Optimize rope_deltas propagation logic in Qwen2.5-VL #41176

Are you sure you want to change the base?

Optimize rope_deltas propagation logic in Qwen2.5-VL #41176

Conversation

Xqle commented Sep 26, 2025

What does this PR do?

Before submitting

Who can review?

Uh oh!

Rocketknight1 commented Sep 26, 2025

Uh oh!

github-actions bot commented Sep 26, 2025

Uh oh!

Uh oh!