Skip to content

[CUDA] Update Flash Attention to support head_sink for smooth softmax in GQA#25358

Closed
tianleiwu wants to merge 2 commits intomainfrom
tlwu/gqa_head_sink_cuda
Closed

[CUDA] Update Flash Attention to support head_sink for smooth softmax in GQA#25358
tianleiwu wants to merge 2 commits intomainfrom
tlwu/gqa_head_sink_cuda

Commits

Commits on Jul 10, 2025