Skip to content

[Model][Hardware][AMD]: Part 1/2 -> Enable e2e QK Norm + RoPE + KV Cache runtime fusion for Qwen3-30B-A3B on ROCM_AITER_FA, and ROCM_AITER_UNIFIED_ATTN#42749

Open
jhu960213 wants to merge 30 commits into
vllm-project:mainfrom
jhu960213:jhu96/optimize-qwen30b-part1
Open

Commits

Commits on May 13, 2026

Commits on May 15, 2026

Commits on May 25, 2026

Commits on May 26, 2026

Commits on May 27, 2026

Commits on May 28, 2026

Commits on May 29, 2026

Commits on Jun 3, 2026

Commits on Jun 4, 2026