Skip to content

[BugFix][AMD][Quantization] Fix torch.compile issue where wvSplitKQ not being called when it should when using quantized FP8 model#22281

Merged
mgoin merged 9 commits intovllm-project:mainfrom
rasmith:ransmith_fix_rocm_per_tensor_w8a8_scaled_mm
Aug 22, 2025
Merged

[BugFix][AMD][Quantization] Fix torch.compile issue where wvSplitKQ not being called when it should when using quantized FP8 model#22281
mgoin merged 9 commits intovllm-project:mainfrom
rasmith:ransmith_fix_rocm_per_tensor_w8a8_scaled_mm

Commits

Commits on Aug 4, 2025

Commits on Aug 15, 2025

Commits on Aug 19, 2025