Skip to content

Fix: disable 2-CTA backward mode when block_sparse_tensors is used#2433

Merged
drisspg merged 1 commit intoDao-AILab:mainfrom
jduprat:fix-block-sparse-2cta-backward
Apr 3, 2026
Merged

Fix: disable 2-CTA backward mode when block_sparse_tensors is used#2433
drisspg merged 1 commit intoDao-AILab:mainfrom
jduprat:fix-block-sparse-2cta-backward

Conversation

@jduprat
Copy link
Copy Markdown
Contributor

@jduprat jduprat commented Apr 3, 2026

The SM100 2-CTA backward kernel does not properly handle block_sparse_tensors. When block sparsity is combined with 2-CTA mode, the kernel hits an assertion:
'AssertionError: 2-CTA mode does not support block sparsity'

This fix adds block_sparse_tensors to the disable_2cta condition in the backward path, forcing the 1-CTA kernel when block sparsity is active. The 1-CTA backward kernel already supports block_sparse_tensors correctly.

Without this fix, any backward pass using block_sparse_tensors on SM100 (B200/GB200) with head_dim >= 128 will crash with the above assertion.

The SM100 2-CTA backward kernel does not properly handle block_sparse_tensors.
When block sparsity is combined with 2-CTA mode, the kernel hits an assertion:
  'AssertionError: 2-CTA mode does not support block sparsity'

This fix adds block_sparse_tensors to the disable_2cta condition in the backward
path, forcing the 1-CTA kernel when block sparsity is active. The 1-CTA backward
kernel already supports block_sparse_tensors correctly.

Without this fix, any backward pass using block_sparse_tensors on SM100 (B200/GB200)
with head_dim >= 128 will crash with the above assertion.
jduprat pushed a commit to jduprat/flash-attention that referenced this pull request Apr 3, 2026
* Initial FA-2 aiter Triton Windows build support

* minimize diff

* bump commit

* bump commit

* minimize diff

* bump commit

* bump aiter submodule

* bump aiter submodule to merged Dao-AILab#2433

* fix: guard distributed.py fallbacks with hasattr for Windows

---------

Co-authored-by: 0xDELUXA <djernovevo@gmail.com>
@drisspg drisspg merged commit 65bfd9a into Dao-AILab:main Apr 3, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants