Skip to content

GQA unfused attention with FP32 QK accumulation (fixes #28195)#28198

Merged
tianleiwu merged 8 commits intomainfrom
tlwu/unfused_gqa
Apr 25, 2026
Merged

GQA unfused attention with FP32 QK accumulation (fixes #28195)#28198
tianleiwu merged 8 commits intomainfrom
tlwu/unfused_gqa

Commits

Commits on Apr 23, 2026

Commits on Apr 24, 2026

Commits on Apr 25, 2026