Skip to content

Short-circuit to normal attention kernels when threshold is zero to

bb7ea74
Select commit
Loading
Failed to load commit list.
Sign in for the full log view
Merged

feat: Enable TRTLLM-Gen Skip-Softmax attention for MLA #2547

Short-circuit to normal attention kernels when threshold is zero to
bb7ea74
Select commit
Loading
Failed to load commit list.