Skip to content

[Bugfix] Skip bias tensors in online FP8 quantization pipeline#39962

Closed
r266-tech wants to merge 2 commits into
vllm-project:mainfrom
r266-tech:fix/fp8-online-quant-skip-bias-v2
Closed

[Bugfix] Skip bias tensors in online FP8 quantization pipeline#39962
r266-tech wants to merge 2 commits into
vllm-project:mainfrom
r266-tech:fix/fp8-online-quant-skip-bias-v2

Commits