Use model's generation_config.json for default sampling parameters by eyupcanakman · Pull Request #1031 · ml-explore/mlx-lm

eyupcanakman · 2026-03-20T15:37:34Z

Fixes #140.

Models like Phi-4 ship a generation_config.json with sampling defaults (temperature, top_p, etc.) but mlx-lm only read eos_token_id from it. Now generate, chat, and server read these values and use them when the user does not specify an explicit override.

The priority chain is: user CLI arg > generation_config.json > hardcoded default.

The server resolves defaults lazily per request so model hot-swapping picks up the new model's config correctly.

Also adds --min-p and --top-k CLI args to chat.py (already present in generate.py) so generation_config values for those keys are not silently ignored.

Thump604 · 2026-03-21T20:45:12Z

The three-tier priority (per-request > generation_config.json > hardcoded defaults) and the lazy resolve_default() on ModelProvider are well designed. One edge case: some HF configs set do_sample: true with temperature: 1.0, which effectively means "use default sampling." The current do_sample: false -> temp: 0.0 mapping is correct, but when do_sample: true the code probably shouldn't inject the config's temperature since 1.0 is just the HF default, not an intentional override.

Models like Phi-4 ship a generation_config.json with sampling defaults (temperature, top_p, etc.) but mlx-lm only read eos_token_id from it. Now generate, chat, and server read these values and use them when the user does not specify an explicit override. The priority chain is: user CLI arg > generation_config.json > hardcoded default. The server resolves defaults lazily per request so model hot-swapping picks up the new model's config correctly. Also adds --min-p and --top-k CLI args to chat.py (already present in generate.py) so generation_config values for those keys are not silently ignored. Fixes ml-explore#140

eyupcanakman force-pushed the feat/generation-config-defaults-140 branch from a23f530 to f9bbfe3 Compare March 22, 2026 11:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use model's generation_config.json for default sampling parameters#1031

Use model's generation_config.json for default sampling parameters#1031
eyupcanakman wants to merge 1 commit intoml-explore:mainfrom
eyupcanakman:feat/generation-config-defaults-140

eyupcanakman commented Mar 20, 2026 •

edited

Loading

Uh oh!

Thump604 commented Mar 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

eyupcanakman commented Mar 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Thump604 commented Mar 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

eyupcanakman commented Mar 20, 2026 •

edited

Loading