Skip to content

Conversation

@MohammedTaherMcW
Copy link

Ticket

Link to JIRA Issue

Problem description

Enabled Support for Qwen2.5-VL-7B-Instruct Model.

What's changed

  • Added support for Qwen2.5-VL-7B-Instruct
  • Updated model_config.py to support Qwen2.5-VL-7B-Instruct.
  • Updated load_checkpoints.py to support Qwen2.5-VL-7B-Instruct weight loading.
  • Added new submodules to support the Qwen-VL 2.5 vision model.
  • Modified the text MLP to avoid L1 buffer issue for Qwen

Note

No code from tt-transformers has been re-used in experimental; all files in experimental are specifically tailored for the Qwen 2.5 VL model.

Checklist

@MohammedTaherMcW MohammedTaherMcW self-assigned this Aug 7, 2025
@MohammedTaherMcW MohammedTaherMcW force-pushed the mcw/qwen_vl_7b/branch_1_experimental branch from ec319c2 to 35ca804 Compare August 12, 2025 10:05
@MohammedTaherMcW MohammedTaherMcW force-pushed the mcw/qwen_vl_7b/branch_1_experimental branch from 3fd6ff9 to 1156718 Compare August 12, 2025 17:30
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants