Skip to content

[CUDA] Support head_sink in flash attention for GQA#25432

Merged
tianleiwu merged 4 commits into
mainfrom
tlwu/gqa_head_sink_cuda
Jul 17, 2025
Merged

[CUDA] Support head_sink in flash attention for GQA#25432
tianleiwu merged 4 commits into
mainfrom
tlwu/gqa_head_sink_cuda

fix build

1cf1aa7
Select commit
Loading
Failed to load commit list.
Azure Pipelines / Windows x64 QNN CI Pipeline (BUILD_QNN_EP SHARED_LIB) succeeded Jul 17, 2025 in 24m 53s

BUILD_QNN_EP SHARED_LIB succeeded