[Attention][MLA] Re-enable FA4 as default MLA prefill backend by MatthewBonanni · Pull Request #38819 · vllm-project/vllm

MatthewBonanni · 2026-04-02T15:17:41Z

NaN issue resulting in correctness problems for MLA models #36763 has been resolved by updating FA4 (#38690) to capture the upstream fix Dao-AILab/flash-attention@0293155

This PR makes FA4 the default again due to its superior performance (see benchmarks in #34732)

…o TRT-LL…" This reverts commit 2c734ed.

gemini-code-assist

Code Review

This pull request updates the AttentionConfig in vllm/config/attention.py by changing the default value of use_trtllm_ragged_deepseek_prefill from True to False. I have no feedback to provide.

yewentao256

LGTM, thanks for the work!

…roject#38819)

…roject#38819) Signed-off-by: Jacob Lou <jacoblou0924@gmail.com>

…roject#38819) Signed-off-by: Song Kai <songkai05@baidu.com>

…roject#38819) Signed-off-by: rishitdholakia13 <rishit+github@cohere.com>

…roject#38819) Signed-off-by: Rishi Puri <riship@nvidia.com>

…roject#38819)

Revert "[Bugfix][MLA] Change default SM100 MLA prefill backend back t…

460683e

…o TRT-LL…" This reverts commit 2c734ed.

MatthewBonanni requested review from ProExpertProg, WoosukKwon, hmellor, houseroad, mgoin, robertgshaw2-redhat, tlrmchlsmth, yewentao256 and youkaichao as code owners April 2, 2026 15:17

MatthewBonanni added the ready ONLY add when PR is ready to merge/full CI is needed label Apr 2, 2026

MatthewBonanni changed the title ~~Re-enable FA4 as default MLA prefill backend~~ [Do Not Merge] Re-enable FA4 as default MLA prefill backend Apr 2, 2026

gemini-code-assist bot reviewed Apr 2, 2026

View reviewed changes

MatthewBonanni changed the title ~~[Do Not Merge] Re-enable FA4 as default MLA prefill backend~~ [Attention][MLA] Re-enable FA4 as default MLA prefill backend Apr 2, 2026

Merge branch 'main' into revert-38562-fi_mla_prefill_default

4f20ad4

yewentao256 approved these changes Apr 6, 2026

View reviewed changes

LucasWilkinson merged commit 9c81f35 into main Apr 6, 2026
57 checks passed

LucasWilkinson deleted the revert-38562-fi_mla_prefill_default branch April 6, 2026 21:51

HenryTangDev pushed a commit to HenryTangMain/vllm that referenced this pull request Apr 6, 2026

[Attention][MLA] Re-enable FA4 as default MLA prefill backend (vllm-p…

cfad88b

…roject#38819)

ShawnWeiChew pushed a commit to ShawnWeiChew/vllm that referenced this pull request Apr 7, 2026

[Attention][MLA] Re-enable FA4 as default MLA prefill backend (vllm-p…

4c51cf5

…roject#38819)

askliar pushed a commit to netanel-haber/vllm that referenced this pull request Apr 7, 2026

[Attention][MLA] Re-enable FA4 as default MLA prefill backend (vllm-p…

ee2b55b

…roject#38819)

askliar pushed a commit to netanel-haber/vllm that referenced this pull request Apr 7, 2026

[Attention][MLA] Re-enable FA4 as default MLA prefill backend (vllm-p…

c550c4f

…roject#38819)

askliar pushed a commit to netanel-haber/vllm that referenced this pull request Apr 7, 2026

[Attention][MLA] Re-enable FA4 as default MLA prefill backend (vllm-p…

88f1493

…roject#38819)

jacob-lou pushed a commit to jacob-lou/vllm that referenced this pull request Apr 7, 2026

[Attention][MLA] Re-enable FA4 as default MLA prefill backend (vllm-p…

911faeb

…roject#38819) Signed-off-by: Jacob Lou <jacoblou0924@gmail.com>

USTCKAY pushed a commit to USTCKAY/vllm that referenced this pull request Apr 7, 2026

[Attention][MLA] Re-enable FA4 as default MLA prefill backend (vllm-p…

f79fefe

…roject#38819) Signed-off-by: Song Kai <songkai05@baidu.com>

mgoin added the nvidia label Apr 7, 2026

rishitdholakia13 pushed a commit to rishitdholakia13/vllm that referenced this pull request Apr 7, 2026

[Attention][MLA] Re-enable FA4 as default MLA prefill backend (vllm-p…

636f82b

…roject#38819) Signed-off-by: rishitdholakia13 <rishit+github@cohere.com>

puririshi98 pushed a commit to puririshi98/vllm that referenced this pull request Apr 7, 2026

[Attention][MLA] Re-enable FA4 as default MLA prefill backend (vllm-p…

097a2cf

…roject#38819) Signed-off-by: Rishi Puri <riship@nvidia.com>

big-yellow-duck pushed a commit to EmbeddedLLM/vllm that referenced this pull request Apr 8, 2026

[Attention][MLA] Re-enable FA4 as default MLA prefill backend (vllm-p…

eab9c83

…roject#38819)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Attention][MLA] Re-enable FA4 as default MLA prefill backend#38819

[Attention][MLA] Re-enable FA4 as default MLA prefill backend#38819
LucasWilkinson merged 2 commits intomainfrom
revert-38562-fi_mla_prefill_default

MatthewBonanni commented Apr 2, 2026 •

edited

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

yewentao256 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

MatthewBonanni commented Apr 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

yewentao256 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

MatthewBonanni commented Apr 2, 2026 •

edited

Loading