Skip to content

[CUDA] Support head_sink in flash attention for GQA#25432

Merged
tianleiwu merged 4 commits into
mainfrom
tlwu/gqa_head_sink_cuda
Jul 17, 2025
Merged

[CUDA] Support head_sink in flash attention for GQA#25432
tianleiwu merged 4 commits into
mainfrom
tlwu/gqa_head_sink_cuda

fix build

1cf1aa7
Select commit
Loading
Failed to load commit list.
Azure Pipelines / Linux QNN CI Pipeline succeeded Jul 17, 2025 in 31m 17s

Build #20250717.11 succeeded