Skip to content

Pull requests: NVIDIA/Model-Optimizer

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix Conv->Relu->Concat Q/DQ insertion gap
#1398 opened May 6, 2026 by willg-nv Contributor Loading…
[Quantization] Saturate NVFP4 export FP8 scale cast to avoid NaN cherry-pick-0.44.0 After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc
#1397 opened May 6, 2026 by cjluo-nv Collaborator Loading…
2 tasks done
[Quantization] Fused Triton kernel for NVFP4 FP8 scale sweep search
#1387 opened May 4, 2026 by cjluo-nv Collaborator Loading…
3 tasks done
fixes for fused moe (qwen3.6, GLM5.1 + MSE calibration
#1382 opened May 2, 2026 by Fridah-nv Contributor Loading…
AutoQuant for VLM
#1381 opened May 1, 2026 by meenchen Contributor Draft
feat(launcher): add DFlash support for DeepSeek-V4-Flash target model
#1379 opened Apr 30, 2026 by ChenhanYu Collaborator Loading…
Use trtexec_safe on safety platforms when using remoteAutoTuning
#1378 opened Apr 30, 2026 by dthienan-nv Contributor Loading…
k25 dflash hardcode support
#1367 opened Apr 29, 2026 by h-guo18 Contributor Draft
Experiment: MXFP4 -> NVFP4 conversion MSE study (scratch)
#1364 opened Apr 28, 2026 by cjluo-nv Collaborator Draft
3 tasks
Enable runtime optimization
#1358 opened Apr 28, 2026 by grzegorz-k-karch Contributor Draft
Add pre-built evaluation recipes for common benchmarks
#1357 opened Apr 27, 2026 by kaix-nv Contributor Loading…
[minor] fixes for layerwise calib + MSE
#1344 opened Apr 24, 2026 by Fridah-nv Contributor Loading…
DSV4 dequant on the fly
#1341 opened Apr 24, 2026 by mxinO Contributor Draft
Update
#1338 opened Apr 23, 2026 by jingyu-ml Contributor Draft
[Refactor] speculative decoding: use mto config subsystem
#1328 opened Apr 23, 2026 by h-guo18 Contributor Loading…
Quantize lm_head + embedding for Nemotron-H, add NVFP4 W4A16 recipe
#1327 opened Apr 22, 2026 by ajrasane Contributor Loading…
3 of 5 tasks
Update the DMD2 at the first stage
#1326 opened Apr 22, 2026 by jingyu-ml Contributor Draft
ProTip! Exclude everything labeled bug with -label:bug.