Skip to content

GQA unfused attention with FP32 QK accumulation (fixes #28195)#28198

Merged
tianleiwu merged 8 commits into
mainfrom
tlwu/unfused_gqa
Apr 25, 2026
Merged

GQA unfused attention with FP32 QK accumulation (fixes #28195)#28198
tianleiwu merged 8 commits into
mainfrom
tlwu/unfused_gqa

fix build

0fcde9a
Select commit
Loading
Failed to load commit list.
GitHub Advanced Security / CodeQL completed Apr 25, 2026 in 3s

1 configuration not found

Warning: Code scanning cannot determine the alerts introduced by this pull request, because 1 configuration present on refs/heads/main was not found:

API upload

  • ❓  <default>

View all branch alerts.