Skip to content

Merge branch 'main' into chefang_mlaA16W16_nhead64_qseqlen1

5c48515
Select commit
Loading
Failed to load commit list.
Merged

Add bf16 MLA decode kernel for gqa_ratio=64, qseqlen=1 (non-persistent) #2729

Merge branch 'main' into chefang_mlaA16W16_nhead64_qseqlen1
5c48515
Select commit
Loading
Failed to load commit list.