Add experimental model bringup for google/gemma-3-1b-it #6

MohammedTaherMcW · 2025-07-29T16:13:33Z

Ticket

Link to Github Issue

Problem description

Enable Support for Gemma-3-1b-it Model.

What's changed

Added support for gemma-3-1b-it model
Updated model_config.py to support datatype (BF16) in gemma-3-1b-it , including end-of-sequence (EoS) token handling and loading via Gemma3CausalLM.
Updated load_checkpoints.py to support gemma-3-1b-it weight loading.
Modified apply_scaling logic to handle both LLaMA and gemma-3-1b-it model.
Added Compute Kernel Config of HiFi4 and Fp32 support.

Checklist

willwray · 2025-07-31T17:23:33Z

Thanks, reviewing.
To review the first commit, it would be useful to know the origin or 'provenance' of the new files in order to compare for TR-specific changes. We discussed this in last week's call, and it sounded like there is no clean-slate origin, but if you can say where the work started then that will help.

willwray · 2025-08-07T17:24:47Z

Thanks Mohammed,

I rebased on a new branch and submitted to TT with no changes

tenstorrent#26438
tenstorrent#26439
tenstorrent#26440
tenstorrent#26441

MohammedTaherMcW changed the title ~~google/gemma-3-1b-it Bringup~~ Add experimental model bringup for google/gemma-3-1b-it Jul 31, 2025

jennychristopher requested a review from willwray July 31, 2025 16:37

MohammedTaherMcW added 2 commits August 4, 2025 15:18

Add base commit for gemma 1b

ee72d26

Add experimental Gemma-3-1b-it Model bringup

ade214f

jennychristopher force-pushed the mcw/gemma_3_1b/pr_1_experimental branch from d114559 to ade214f Compare August 4, 2025 15:48

MohammedTaherMcW added 2 commits August 5, 2025 09:46

Rebase Rotary Setup

d742f83

Fix RMSNorm unit offset

4eda471

jennychristopher mentioned this pull request Aug 5, 2025

Gemma3-1B - asterisk * token unexpectedly appears once around 630 tokens are generated properly tenstorrent/tt-metal#26273

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add experimental model bringup for google/gemma-3-1b-it #6

Add experimental model bringup for google/gemma-3-1b-it #6

Uh oh!

MohammedTaherMcW commented Jul 29, 2025 •

edited

Loading

Uh oh!

willwray commented Jul 31, 2025

Uh oh!

willwray commented Aug 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add experimental model bringup for google/gemma-3-1b-it #6

Are you sure you want to change the base?

Add experimental model bringup for google/gemma-3-1b-it #6

Uh oh!

Conversation

MohammedTaherMcW commented Jul 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Ticket

Problem description

What's changed

Checklist

Uh oh!

willwray commented Jul 31, 2025

Uh oh!

willwray commented Aug 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

MohammedTaherMcW commented Jul 29, 2025 •

edited

Loading