Add Experimental Support for Gemma-3-4b-it #12

MohammedTaherMcW · 2025-08-06T09:00:44Z

Ticket

Problem description

Enable Support for Gemma-3-4b-it Model.

What's changed

Added support for gemma-3-4b-it model
Updated model_config.py to support gemma-3-4b-it , including end-of-sequence (EoS) token handling.
Updated load_checkpoints.py to support gemma-3-4b-it weight loading.
Modified apply_scaling logic to handle both LLaMA and gemma-3-4b-it model.

Checklist

arginugaTT · 2025-08-19T15:50:57Z

Hi @MohammedTaherMcW

Please add your accuracy tests symlinked to tests/nightly/single_card/model_name and add the entry in .github/workflows/fast-dispatch-full-regressions-and-models-impl.yaml

create an entry for code owners of this model as pavle petrovic: .github/CODEOWNERS

then run the frequent model test workflow and attach to the ticket.

For reference: https://github.com/tenstorrent/tt-metal/pull/20690/files

jschuhmacher · 2025-08-26T15:03:12Z

For visibility, this was merged as tenstorrent#26924

MohammedTaherMcW force-pushed the mcw/gemma_3_4b/pr_1_experimental branch 3 times, most recently from c079a67 to 56513b8 Compare August 7, 2025 19:13

jennychristopher force-pushed the mcw/gemma_3_4b/pr_1_experimental branch from fcd2325 to fdd2f1b Compare August 11, 2025 06:16

MohammedTaherMcW force-pushed the mcw/gemma_3_4b/pr_1_experimental branch from f8ca67a to 3fcf34b Compare August 11, 2025 09:21

jennychristopher requested a review from willwray August 12, 2025 17:59

jennychristopher mentioned this pull request Aug 15, 2025

Gemma3-4B - asterisk * token unexpectedly appears once around 1200 tokens are generated properly tenstorrent/tt-metal#26908

Closed

MohammedTaherMcW and others added 5 commits August 19, 2025 17:13

Add Base commit for Gemma3-4b-it

143eb8c

Add Support for Gemma-3-4b-it

9d2c39c

Fix Vision Load checkpoints for Gemma-3-4b-it

70e66bd

Fix Sliding Window logic

5075a3a

Rebase Gemma3-4B experimental setup

da70e34

MohammedTaherMcW force-pushed the mcw/gemma_3_4b/pr_1_experimental branch from fc60390 to da70e34 Compare August 19, 2025 13:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Experimental Support for Gemma-3-4b-it #12

Add Experimental Support for Gemma-3-4b-it #12

Uh oh!

MohammedTaherMcW commented Aug 6, 2025

Uh oh!

arginugaTT commented Aug 19, 2025

Uh oh!

jschuhmacher commented Aug 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Add Experimental Support for Gemma-3-4b-it #12

Are you sure you want to change the base?

Add Experimental Support for Gemma-3-4b-it #12

Uh oh!

Conversation

MohammedTaherMcW commented Aug 6, 2025

Ticket

Problem description

What's changed

Checklist

Uh oh!

arginugaTT commented Aug 19, 2025

Uh oh!

jschuhmacher commented Aug 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants