Fix gptoss_from_pretrained to correctly load HuggingFace weights by dfalbel · Pull Request #4 · mlverse/minhub

dfalbel · 2026-02-03T17:08:38Z

Summary

Update gptoss_normalize_config to map HuggingFace config keys (num_local_experts → num_experts, num_experts_per_tok → experts_per_token, nested rope_scaling) to internal names
Rewrite gptoss_hf_weights_remap to use underscore suffix (_blocks/_scales) for MXFP4 weight detection, remap HF parameter names to model parameter names, and concatenate separate q/k/v projections into combined qkv tensors

🤖 Generated with Claude Code

- Update gptoss_normalize_config to map HF config keys (num_local_experts, num_experts_per_tok, nested rope_scaling) to internal names - Rewrite gptoss_hf_weights_remap to: - Use underscore suffix (_blocks/_scales) for MXFP4 weight detection - Remap HF parameter names to model parameter names - Concatenate separate q/k/v projections into combined qkv tensors Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

dfalbel and others added 3 commits February 3, 2026 14:07

Add dotty to Imports

195213a

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

++

12faf36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix gptoss_from_pretrained to correctly load HuggingFace weights#4

Fix gptoss_from_pretrained to correctly load HuggingFace weights#4
dfalbel wants to merge 3 commits intomainfrom
fix-gptoss-from-pretrained

dfalbel commented Feb 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

dfalbel commented Feb 3, 2026

Summary

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant