[Bugfix] Enable attn quantization of Llama-4 by correctly permuting scales for rope (int8, fp8)#34243

Merged

mgoin merged 3 commits intovllm-project:mainfrom

eldarkurtic:fix-attn-quant-llama4

Feb 11, 2026

Commits on Feb 10, 2026

permute scales in the same way that weights are permuted
Your Name
committed
fix ruff
Your Name
committed
Merge branch 'main' into fix-attn-quant-llama4
eldarkurtic
authored