Skip to content

[Quantization] Add ModelOpt NVFP4 W4A16 (4-bit weights, fp16/bf16 activations) support#41769

Merged
pavanimajety merged 19 commits into
vllm-project:mainfrom
juhi10071998:w4a16_modelopt_support
May 9, 2026
Merged

[Quantization] Add ModelOpt NVFP4 W4A16 (4-bit weights, fp16/bf16 activations) support#41769
pavanimajety merged 19 commits into
vllm-project:mainfrom
juhi10071998:w4a16_modelopt_support

Merge branch 'main' into w4a16_modelopt_support

aa530b2
Select commit
Loading
Failed to load commit list.
Meta CodeSync / Meta Internal-Only Changes Check succeeded May 9, 2026 in 0s

There is no internal Diff connected, this can be merged now