Skip to content

[TRTLLM-10022][feat] Add hopper xqa decode support for skip softmax attention#10264

Merged
pengbowang-nv merged 11 commits into
NVIDIA:mainfrom
pengbowang-nv:dev-add-hopper-xqa-skip-softmax
Jan 12, 2026
Merged

[TRTLLM-10022][feat] Add hopper xqa decode support for skip softmax attention#10264
pengbowang-nv merged 11 commits into
NVIDIA:mainfrom
pengbowang-nv:dev-add-hopper-xqa-skip-softmax

disable skip for short seqs

4bb57fa
Select commit
Loading
Failed to load commit list.