Skip to content

[Quantization] Add ModelOpt NVFP4 W4A16 (4-bit weights, fp16/bf16 activations) support#41769

Merged
pavanimajety merged 19 commits into
vllm-project:mainfrom
juhi10071998:w4a16_modelopt_support
May 9, 2026
Merged

[Quantization] Add ModelOpt NVFP4 W4A16 (4-bit weights, fp16/bf16 activations) support#41769
pavanimajety merged 19 commits into
vllm-project:mainfrom
juhi10071998:w4a16_modelopt_support

Commits

Commits on May 6, 2026