-
Notifications
You must be signed in to change notification settings - Fork 387
Pull requests: NVIDIA/Model-Optimizer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Quantization] Saturate NVFP4 export FP8 scale cast to avoid NaN
cherry-pick-0.44.0
After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc
#1397
opened May 6, 2026 by
cjluo-nv
Collaborator
Loading…
2 tasks done
[Recipes][LLM PTQ] Add nvfp4 MSE+FP8-cast-KV recipes (experts_only / mlp_only) + --recipe in example scripts
#1391
opened May 4, 2026 by
cjluo-nv
Collaborator
Loading…
2 of 4 tasks
[Quantization] Fused Triton kernel for NVFP4 FP8 scale sweep search
#1387
opened May 4, 2026 by
cjluo-nv
Collaborator
Loading…
3 tasks done
Add unit test for checking any leak of temporary augmented onnx files, on exception during ONNX INT4 AWQ quantization
#1383
opened May 3, 2026 by
vishalpandya1990
Contributor
Loading…
fixes for fused moe (qwen3.6, GLM5.1 + MSE calibration
#1382
opened May 2, 2026 by
Fridah-nv
Contributor
Loading…
feat(launcher): add DFlash support for DeepSeek-V4-Flash target model
#1379
opened Apr 30, 2026 by
ChenhanYu
Collaborator
Loading…
Use trtexec_safe on safety platforms when using remoteAutoTuning
#1378
opened Apr 30, 2026 by
dthienan-nv
Contributor
Loading…
Add Nemotron-3-Nano-30B-A3B-BF16 e2e tutorial: Prune + Distill + Quantize + Nemo Evaluator + vLLM deployment
#1376
opened Apr 30, 2026 by
kevalmorabia97
Collaborator
•
Draft
Support Mixed precision & Static MSE PTQ in MCore export; Nemotron Super v3 NVFP4 recipe
#1363
opened Apr 28, 2026 by
jenchen13
Contributor
Loading…
[SKILL.md Chore] make .agents/ the cannonical agent-skills location
#1362
opened Apr 28, 2026 by
shljessie
Loading…
Add pre-built evaluation recipes for common benchmarks
#1357
opened Apr 27, 2026 by
kaix-nv
Contributor
Loading…
[OMNIML-4021]: align local JSONL loading with HF datasets path + keep original behaviour
#1345
opened Apr 24, 2026 by
shengliangxu
Collaborator
Loading…
3 tasks done
[OMNIML-3934] Guidelines and precommit hook for pydantic backward compatbility
#1333
opened Apr 23, 2026 by
jenchen13
Contributor
Loading…
[Refactor] speculative decoding: use mto config subsystem
#1328
opened Apr 23, 2026 by
h-guo18
Contributor
Loading…
Quantize lm_head + embedding for Nemotron-H, add NVFP4 W4A16 recipe
#1327
opened Apr 22, 2026 by
ajrasane
Contributor
Loading…
3 of 5 tasks
Fix NVFP4 quantization for Qwen3.x MoE models (4 silent-failure bugs)
#1323
opened Apr 22, 2026 by
erictinkeredapps
Loading…
Previous Next
ProTip!
Exclude everything labeled
bug with -label:bug.