Do online `fp8` quantization while loading weights instead of in `process_weights_after_loading`, reducing memory overhead#17945

Open

fxmarty-amd wants to merge 17 commits intosgl-project:mainfrom

fxmarty-amd:online-fp8-quantization-loader

Commits on Jan 28, 2026

online quantization during weight loading
fxmarty-amd
committed

Commits on Jan 29, 2026

Commits on Jan 30, 2026

Commits on Feb 4, 2026

address constant comment
fxmarty-amd
committed