Skip to content

refactor fp8.py online quant weight loading to use layerwise reload utils#33814

Closed
vkuzo wants to merge 1 commit intovllm-project:mainfrom
vkuzo:20260204_fp8_online_use_layerwise
Closed

refactor fp8.py online quant weight loading to use layerwise reload utils#33814
vkuzo wants to merge 1 commit intovllm-project:mainfrom
vkuzo:20260204_fp8_online_use_layerwise

Commits

Commits on Mar 24, 2026