-
Notifications
You must be signed in to change notification settings - Fork 247
Pull requests: NVIDIA/Model-Optimizer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fixes for Megatron Expert Parallel, GroupedMLP and SequentialMLP
#831
opened Jan 30, 2026 by
realAsma
Loading…
Latent MOE & Repeated MTP support for NemotronH; fix KV cache quant export
#830
opened Jan 29, 2026 by
jenchen13
Loading…
Noeyy/add test cases for the newly added checkpoints on HF
#827
opened Jan 29, 2026 by
noeyy-mino
Loading…
[Minor] fix: do not requantize the scales in FP8 scale sweep calibration
#825
opened Jan 28, 2026 by
Fridah-nv
Loading…
Update on the QuantModule & DynamicModule to accept external forward
#824
opened Jan 28, 2026 by
jingyu-ml
Loading…
Added column-major storage of weights and scales in INT4 quantization for model load time improvement in TRT-RTX
#811
opened Jan 23, 2026 by
hthadicherla
Loading…
add value info to the original tensor if it was directly to model output
#807
opened Jan 22, 2026 by
YixuanSeanZhou
•
Draft
Add Megatron-Bridge pruning example scripts
#800
opened Jan 21, 2026 by
kevalmorabia97
Loading…
1 of 2 tasks
Support multiple-batch input for autocast calibration.
#760
opened Jan 11, 2026 by
byte-deve
Loading…
Previous Next
ProTip!
Exclude everything labeled
bug with -label:bug.