[CUDA] Update Flash Attention to support head_sink for smooth softmax in GQA#25358
Closed
[CUDA] Update Flash Attention to support head_sink for smooth softmax in GQA#25358
Commits
Commits on Jul 10, 2025
- committed
- committed