Skip to content

Conversation

@jennychristopher
Copy link

Ticket

tenstorrent#25398

Problem description

TT-Transformers doesn't have support for DeepSeek-R1-Distill-Qwen-14B model

What's changed

Describe the approach used to solve the problem.
Summarize the changes made and its impact.

Checklist

@jennychristopher jennychristopher force-pushed the deepseek_distill_qwen14b_pr1 branch from 3e34e01 to aa63d55 Compare August 1, 2025 12:09
@willwray
Copy link

willwray commented Aug 5, 2025

This is good to go, thanks; I'll open a PR to TT, once a new conflict is resolved.

The conflict was highlighted in the web UI, blocking merge.
I tried the web UI conflict editor to fix the file-level conflicts, but it creates a merge commit that includes all changes to main.
I'll fixup manually, just the file, rebase then force push and open the PR.

@willwray
Copy link

willwray commented Aug 5, 2025

I fixed the conflict and submitted the TT PR from a new branch

tenstorrent#26282

@MohammedTaherMcW MohammedTaherMcW force-pushed the deepseek_distill_qwen14b_pr1 branch from 6b28ac1 to 5a061e3 Compare August 25, 2025 02:40
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants