Skip to content

Add bf16 MLA decode kernel for gqa_ratio=64, qseqlen=1 (non-persistent)#2729

Merged
fangche123 merged 8 commits intomainfrom
chefang_mlaA16W16_nhead64_qseqlen1
Apr 16, 2026
Merged

Add bf16 MLA decode kernel for gqa_ratio=64, qseqlen=1 (non-persistent)#2729
fangche123 merged 8 commits intomainfrom
chefang_mlaA16W16_nhead64_qseqlen1

Commits

Commits on Apr 14, 2026

Commits on Apr 15, 2026

Commits on Apr 16, 2026