NVIDIA / Model-Optimizer Public

Notifications You must be signed in to change notification settings
Fork 247
Star 1.9k

Code
Issues 64
Pull requests 68
Actions
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Security
Insights

Pull requests: NVIDIA/Model-Optimizer

Labels 27 Milestones 0

New pull request New

68 Open 451 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Fixes for Megatron Expert Parallel, GroupedMLP and SequentialMLP

#831 opened Jan 30, 2026 by realAsma

Loading…

Latent MOE & Repeated MTP support for NemotronH; fix KV cache quant export

#830 opened Jan 29, 2026 by jenchen13

Loading…

Noeyy/add test cases for the newly added checkpoints on HF

#827 opened Jan 29, 2026 by noeyy-mino

Loading…

hardcode support for qwen3vl text only

#826 opened Jan 28, 2026 by h-guo18 • Draft

[Minor] fix: do not requantize the scales in FP8 scale sweep calibration

#825 opened Jan 28, 2026 by Fridah-nv

Loading…

Update on the QuantModule & DynamicModule to accept external forward

#824 opened Jan 28, 2026 by jingyu-ml

Loading…

[ONNX][Autocast] Minor bug fixes (AI-assisted)

#822 opened Jan 28, 2026 by galagam

Loading…

Support Kimi-K2.5 PTQ

#820 opened Jan 27, 2026 by Edwardf0t1 • Draft

Support MiniMax M2.1 (FP8 checkpoint)

#817 opened Jan 25, 2026 by cjluo-nv

Loading…

Fix mcore nvfp4 export for vllm

#816 opened Jan 24, 2026 by meenchen

Loading…

Drop '_pg_collection' in MBridge model config when checkpointing

#813 opened Jan 23, 2026 by AAnoosheh • Draft

Added column-major storage of weights and scales in INT4 quantization for model load time improvement in TRT-RTX

#811 opened Jan 23, 2026 by hthadicherla

Loading…

[2/4] Diffusion Quantized ckpt export

#810 opened Jan 23, 2026 by jingyu-ml

Loading…

2 of 4 tasks

add value info to the original tensor if it was directly to model output

#807 opened Jan 22, 2026 by YixuanSeanZhou • Draft

vllm fakequant reload with modelopt state for HF

#805 opened Jan 21, 2026 by kinjalpatel27

Loading…

Layerwise KD mode

#802 opened Jan 21, 2026 by AAnoosheh

Loading…

Add Megatron-Bridge pruning example scripts

#800 opened Jan 21, 2026 by kevalmorabia97

Loading…

1 of 2 tasks

[Do not merge] test int4 dequant kernel

#798 opened Jan 20, 2026 by cjluo-nv • Draft

GLM-4.7 MTP support

#792 opened Jan 16, 2026 by Edwardf0t1 • Draft

add local hessian calibration

#788 opened Jan 16, 2026 by Fridah-nv • Draft

Add Nemotron parse PTQ support

#786 opened Jan 15, 2026 by Edwardf0t1

Loading…

Finally FLUX NVFP4 quantization working

#782 opened Jan 14, 2026 by FurkanGozukara

Loading…

[For RL] Keep attrs after folding weight and fix empty extra state for Megatron

#779 opened Jan 14, 2026 by mxinO • Draft

Support multiple-batch input for autocast calibration.

#760 opened Jan 11, 2026 by byte-deve

Loading…

[draft] bug for MoE distributed parallelism

#752 opened Jan 8, 2026 by realAsma • Draft

Previous 1 2 3 Next

Previous Next

ProTip! Exclude everything labeled bug with -label:bug.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!