Skip to content

Pull requests: ml-explore/mlx-lm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add TurboQuant KV cache compression (3-bit, 4.6x)
#1067 opened Mar 28, 2026 by arozanov Loading…
4 of 6 tasks
Add LongCat Next
#1057 opened Mar 26, 2026 by kernelpool Loading…
Fix IndexError in CacheOrder.pop() on empty cache
#1054 opened Mar 25, 2026 by lyonsno Draft
2 tasks done
fix tokenizer regex issue with Mistral-based models
#1049 opened Mar 23, 2026 by amanning3390 Loading…
Fix prompt cache leak between conversations
#1039 opened Mar 22, 2026 by kernelpool Loading…
feat: configurable KVCache step size and pre-allocation
#1038 opened Mar 22, 2026 by Thump604 Loading…
5 tasks
Add Mistral Small 4 (119B MoE) support via mistral4.py
#1037 opened Mar 21, 2026 by ProducerGuy Loading…
5 tasks done
Fuse gate/up expert projections in SwitchGLU
#1032 opened Mar 21, 2026 by Thump604 Loading…
4 tasks
Fix CacheDataset.itemlen returning wrong length
#1029 opened Mar 20, 2026 by eyupcanakman Loading…
Fix A_log precision in mamba.py
#1028 opened Mar 20, 2026 by eyupcanakman Loading…
Fix SSM dt clamp default for Nemotron-H
#1026 opened Mar 20, 2026 by kernelpool Loading…
Feature/slem with context aware
#1025 opened Mar 19, 2026 by krzysiekfonal Loading…
ProTip! Updated in the last three days: updated:>2026-03-27.