Skip to content

[FA4][hd256] Fix layout of non-contiguous qkv in backward kernel#2545

Merged
Johnsonms merged 1 commit into
Dao-AILab:mainfrom
wangsiyu:fused_qga_backward
May 7, 2026
Merged

[FA4][hd256] Fix layout of non-contiguous qkv in backward kernel#2545
Johnsonms merged 1 commit into
Dao-AILab:mainfrom
wangsiyu:fused_qga_backward

Conversation

@wangsiyu
Copy link
Copy Markdown
Contributor

@wangsiyu wangsiyu commented May 7, 2026

This PR fix backward bug when declaring non-contiguous qkv on head_dim=256. All Unit tests passed.

@Johnsonms Johnsonms merged commit 09aa322 into Dao-AILab:main May 7, 2026
reubenconducts pushed a commit to reubenconducts/flash-attention that referenced this pull request Jun 2, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants