Skip to content

Add llmcompressor fp8 kv-cache quant (per-tensor and per-attn_head)#30141

Merged
LucasWilkinson merged 25 commits intovllm-project:mainfrom
eldarkurtic:expand-static-scaled-fp8-quant
Jan 22, 2026
Merged

Add llmcompressor fp8 kv-cache quant (per-tensor and per-attn_head)#30141
LucasWilkinson merged 25 commits intovllm-project:mainfrom
eldarkurtic:expand-static-scaled-fp8-quant

Commits

Commits on Jan 22, 2026