[CUDA] Update Flash Attention to support head_sink for smooth softmax in GQA #25358
Azure Pipelines / Linux Android Emulator QNN CI Pipeline
succeeded
Jul 10, 2025 in 12m 48s
Build #20250710.30 succeeded
Loading