Fix A_log precision in mamba.py by eyupcanakman · Pull Request #1028 · ml-explore/mlx-lm

eyupcanakman · 2026-03-20T14:43:34Z

fixes #565.

mamba.py computes mx.exp(self.A_log) without casting to float32 first. When the model is loaded in bf16, the exponential loses precision and logprobs diverge from HuggingFace. mamba2.py, plamo2.py, and gated_delta.py all cast A_log to float32 at the usage site. Apply the same pattern here.

angeloskath · 2026-03-30T11:18:41Z

A_log is actually stored in float32 so no need to cast. Is there a saved model that has an issue that you can point to? Given that #565 is also stale and potentially unrelated I will close this and if you encounter an issue file an issue and we can reopen this if needed.

fix: cast A_log to float32 in mamba.py for numerical stability

17cc99a

angeloskath closed this Mar 30, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix A_log precision in mamba.py#1028

Fix A_log precision in mamba.py#1028
eyupcanakman wants to merge 1 commit intoml-explore:mainfrom
eyupcanakman:fix/mamba-alog-float32-565

eyupcanakman commented Mar 20, 2026

Uh oh!

angeloskath commented Mar 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

eyupcanakman commented Mar 20, 2026

Uh oh!

angeloskath commented Mar 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants