Add bf16 MLA decode kernel for gqa_ratio=64, qseqlen=1 (non-persistent)#2729
Merged
fangche123 merged 8 commits intomainfrom Apr 16, 2026
Merged
Add bf16 MLA decode kernel for gqa_ratio=64, qseqlen=1 (non-persistent)#2729fangche123 merged 8 commits intomainfrom
fangche123 merged 8 commits intomainfrom