Skip to content

Do online fp8 quantization while loading weights instead of in process_weights_after_loading, reducing memory overhead#17945

Open
fxmarty-amd wants to merge 17 commits intosgl-project:mainfrom
fxmarty-amd:online-fp8-quantization-loader
Open

Do online fp8 quantization while loading weights instead of in process_weights_after_loading, reducing memory overhead#17945
fxmarty-amd wants to merge 17 commits intosgl-project:mainfrom
fxmarty-amd:online-fp8-quantization-loader

Commits

Commits on Jan 28, 2026

Commits on Jan 29, 2026

Commits on Jan 30, 2026

Commits on Feb 4, 2026