lfm2: strip lm_head.weight for tied embeddings by ykhrustalev · Pull Request #1055 · ml-explore/mlx-lm

ykhrustalev · 2026-03-25T20:00:56Z

No description provided.

The lfm2 model always uses tied embeddings (embed_tokens.as_linear), but some checkpoints (e.g. LiquidAI/LFM2.5-350M) ship a separate lm_head.weight in their safetensors. This causes load_weights to raise "Received 1 parameters not in model: lm_head.weight". Strip lm_head.weight in sanitize, matching the pattern already used by llama, mixtral, and qwen3_moe.

ykhrustalev · 2026-03-25T20:01:12Z

cc @Blaizzy

Blaizzy

Could you share what model weights have lm_head?

Because this model by default shouldn't have lm_head because it has tied word embeddings.

ykhrustalev · 2026-03-25T21:57:05Z

@Blaizzy you are right, the checkpoint I am using is little different, I will check and come back.

Blaizzy reviewed Mar 25, 2026

View reviewed changes

ykhrustalev marked this pull request as draft March 25, 2026 21:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lfm2: strip lm_head.weight for tied embeddings#1055

lfm2: strip lm_head.weight for tied embeddings#1055
ykhrustalev wants to merge 1 commit intoml-explore:mainfrom
ykhrustalev:main

ykhrustalev commented Mar 25, 2026

Uh oh!

ykhrustalev commented Mar 25, 2026

Uh oh!

Blaizzy left a comment

Uh oh!

ykhrustalev commented Mar 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ykhrustalev commented Mar 25, 2026

Uh oh!

ykhrustalev commented Mar 25, 2026

Uh oh!

Blaizzy left a comment

Choose a reason for hiding this comment

Uh oh!

ykhrustalev commented Mar 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants