[CUDA] Support head_sink in flash attention for GQA#25432
Merged
Azure Pipelines / Windows x64 QNN CI Pipeline (BUILD_QNN_EP SHARED_LIB)
succeeded
Jul 17, 2025 in 24m 53s
BUILD_QNN_EP SHARED_LIB succeeded
Loading