Skip to content

fix memory for online fp8 quantization with streaming weight load#31914

Merged
mgoin merged 1 commit intovllm-project:mainfrom
vkuzo:20260107_streaming_quant_memory_fix
Feb 2, 2026
Merged

fix memory for online fp8 quantization with streaming weight load#31914
mgoin merged 1 commit intovllm-project:mainfrom
vkuzo:20260107_streaming_quant_memory_fix

Commits

Commits on Jan 30, 2026