Skip to content

Revert "[Attention][FA3] Update FA3 to include new swizzle optimization"#33841

Merged
vllm-bot merged 2 commits intomainfrom
revert-23465-lwilkinson/fa3-swizzle
Feb 5, 2026
Merged

Revert "[Attention][FA3] Update FA3 to include new swizzle optimization"#33841
vllm-bot merged 2 commits intomainfrom
revert-23465-lwilkinson/fa3-swizzle

Conversation

@ProExpertProg
Copy link
Collaborator

@ProExpertProg ProExpertProg commented Feb 4, 2026

Reverts #23465

As described in #33802, #23465 broke the Distributed Tests 2 GPUs (H100).

Note that since the tests have been slightly refactored so the failing test (test_async_tp.py::test_async_pass_correctness) is now in a new group after #33293. The full CI run for that PR is here:

This PR gets both the new tests green (old were already green above).

@ProExpertProg ProExpertProg added the ready ONLY add when PR is ready to merge/full CI is needed label Feb 4, 2026
Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request reverts a previous change that introduced a swizzle optimization in FA3, which was identified as a potential cause for test failures. The revert includes changing the GIT_TAG for the flash-attention dependency back to a previous commit and restoring the scheduler_metadata size calculation in flash_attn.py and flashattn_mla.py. The changes are a clean and necessary revert to address the correctness issues. The code looks good.

@ProExpertProg ProExpertProg changed the title Revert "[Attention][FA3] Update FA3 to include new swizzle optimization" [DO NOT MERGE] Revert "[Attention][FA3] Update FA3 to include new swizzle optimization" Feb 5, 2026
@ProExpertProg
Copy link
Collaborator Author

Plausible fix in #33854

@ProExpertProg ProExpertProg changed the title [DO NOT MERGE] Revert "[Attention][FA3] Update FA3 to include new swizzle optimization" Revert "[Attention][FA3] Update FA3 to include new swizzle optimization" Feb 5, 2026
@ProExpertProg ProExpertProg enabled auto-merge (squash) February 5, 2026 02:41
@ProExpertProg
Copy link
Collaborator Author

#33854 sadly does not fix the test, moving forward with this revert

@vllm-bot vllm-bot merged commit e3bf79f into main Feb 5, 2026
110 checks passed
@vllm-bot vllm-bot deleted the revert-23465-lwilkinson/fa3-swizzle branch February 5, 2026 03:54
RegularJoe-CEO pushed a commit to RegularJoe-CEO/vllm that referenced this pull request Feb 5, 2026
@ProExpertProg ProExpertProg linked an issue Feb 5, 2026 that may be closed by this pull request
3 tasks
LucasWilkinson added a commit to neuralmagic/vllm that referenced this pull request Feb 7, 2026
LucasWilkinson added a commit to neuralmagic/vllm that referenced this pull request Feb 8, 2026
LucasWilkinson added a commit that referenced this pull request Feb 9, 2026
LucasWilkinson added a commit to neuralmagic/vllm that referenced this pull request Feb 10, 2026
LucasWilkinson added a commit that referenced this pull request Feb 10, 2026
LucasWilkinson added a commit that referenced this pull request Feb 10, 2026
…ion" (#33841)

This reverts commit e3bf79f.

Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
LucasWilkinson added a commit to neuralmagic/vllm that referenced this pull request Feb 11, 2026
…ion" (vllm-project#33841)

This reverts commit e3bf79f.

Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
ItzDEXX pushed a commit to ItzDEXX/vllm that referenced this pull request Feb 19, 2026
tunglinwood pushed a commit to tunglinwood/vllm that referenced this pull request Mar 4, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ci/build ready ONLY add when PR is ready to merge/full CI is needed v1

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[CI Failure]: Distributed 2xH100 tests

3 participants