[Bugfix][MLA] Change default SM100 MLA prefill backend back to TRT-LLM by MatthewBonanni · Pull Request #38562 · vllm-project/vllm

MatthewBonanni · 2026-03-30T16:04:54Z

Purpose

On SM100, FA4 MLA prefill appears to cause unusable output on Kimi-K2.5. This PR changes the default MLA prefill backend back to TRTLLM while we resolve the issues with FA4.

Test Plan

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>

claude

Claude Code Review

This pull request is from a fork — automated review is disabled. A repository maintainer can comment @claude review to run a one-time review.

gemini-code-assist

Code Review

This pull request updates the default value of use_trtllm_ragged_deepseek_prefill to True in the attention configuration. The reviewer suggests renaming this flag to use_trtllm_mla_prefill to better reflect its general purpose for MLA prefill backends and improve maintainability, as the current name is overly specific to DeepSeek.

vllm/config/attention.py

LucasWilkinson

Thank you for the quick fix!

mgoin · 2026-03-30T16:11:48Z

vllm/config/attention.py

    """Whether to use cudnn prefill."""

-    use_trtllm_ragged_deepseek_prefill: bool = False
+    use_trtllm_ragged_deepseek_prefill: bool = True


Where do we control FA4 MLA prefill? I don't see a similar entry for it

It falls through to FA4 when trtllm isn't enabled. It's a messy interface, #32623 will clean this up

Relevant code block is here:

vllm/vllm/model_executor/layers/attention/mla_attention.py

Line 2130 in dbdd9ae

else: # Use FlashAttention

#38562) Signed-off-by: Matthew Bonanni <mbonanni@redhat.com> (cherry picked from commit 2c734ed)

vllm-project#38562) Signed-off-by: Matthew Bonanni <mbonanni@redhat.com> Signed-off-by: zhutaoyu <zhutaoyu97@gmail.com>

…o TRT-LLM (vllm-project#38562)" This reverts commit 2c734ed.

vllm-project#38562) Signed-off-by: Matthew Bonanni <mbonanni@redhat.com> Signed-off-by: neweyes <328719365@qq.com>

vllm-project#38562) Signed-off-by: Matthew Bonanni <mbonanni@redhat.com> Signed-off-by: EricccYang <yangyang4991@gmail.com>

vllm-project#38562) Signed-off-by: Matthew Bonanni <mbonanni@redhat.com> Signed-off-by: bhargav-patel-29 <bhargav.patel@tihiitb.org>

vllm-project#38562) Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>

vllm-project#38562) Signed-off-by: Matthew Bonanni <mbonanni@redhat.com> Signed-off-by: rishitdholakia13 <rishit+github@cohere.com>

vllm-project#38562) Signed-off-by: Matthew Bonanni <mbonanni@redhat.com> Signed-off-by: Rishi Puri <riship@nvidia.com>

vllm-project#38562) Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>

Change default

a021a0b

Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>

MatthewBonanni requested review from ProExpertProg, WoosukKwon, hmellor, houseroad, mgoin, robertgshaw2-redhat, tlrmchlsmth, yewentao256 and youkaichao as code owners March 30, 2026 16:04

claude bot reviewed Mar 30, 2026

View reviewed changes

mergify bot added the bug Something isn't working label Mar 30, 2026

gemini-code-assist bot reviewed Mar 30, 2026

View reviewed changes

vllm/config/attention.py Show resolved Hide resolved

MatthewBonanni changed the title ~~[Bugfix][MLA] Change default MLA prefill backend back to TRT-LLM~~ [Bugfix][MLA] Change default SM100 MLA prefill backend back to TRT-LLM Mar 30, 2026

LucasWilkinson approved these changes Mar 30, 2026

View reviewed changes

LucasWilkinson added the ready ONLY add when PR is ready to merge/full CI is needed label Mar 30, 2026

LucasWilkinson added this to the v0.18.0 cherry picks milestone Mar 30, 2026

tlrmchlsmth approved these changes Mar 30, 2026

View reviewed changes

mgoin approved these changes Mar 30, 2026

View reviewed changes

mgoin reviewed Mar 30, 2026

View reviewed changes

vllm-bot merged commit 2c734ed into vllm-project:main Mar 30, 2026
41 of 55 checks passed

MatthewBonanni deleted the fi_mla_prefill_default branch March 30, 2026 18:05

khluu pushed a commit that referenced this pull request Mar 30, 2026

[Bugfix][MLA] Change default SM100 MLA prefill backend back to TRT-LLM (

a26e8dc

#38562) Signed-off-by: Matthew Bonanni <mbonanni@redhat.com> (cherry picked from commit 2c734ed)

benenzhu pushed a commit to benenzhu/vllm that referenced this pull request Mar 31, 2026

[Bugfix][MLA] Change default SM100 MLA prefill backend back to TRT-LLM (

7745c38

vllm-project#38562) Signed-off-by: Matthew Bonanni <mbonanni@redhat.com> Signed-off-by: zhutaoyu <zhutaoyu97@gmail.com>

vllm-agent pushed a commit to vllm-agent/vllm that referenced this pull request Mar 31, 2026

Revert "[Bugfix][MLA] Change default SM100 MLA prefill backend back t…

0286264

…o TRT-LLM (vllm-project#38562)" This reverts commit 2c734ed.

vllm-agent mentioned this pull request Mar 31, 2026

Revert "[Bugfix][MLA] Change default SM100 MLA prefill backend back to TRT-LLM" (#38562) #38598

Draft

neweyes pushed a commit to neweyes/vllm that referenced this pull request Mar 31, 2026

[Bugfix][MLA] Change default SM100 MLA prefill backend back to TRT-LLM (

3b261ee

vllm-project#38562) Signed-off-by: Matthew Bonanni <mbonanni@redhat.com> Signed-off-by: neweyes <328719365@qq.com>

MatthewBonanni mentioned this pull request Apr 2, 2026

[Attention][MLA] Re-enable FA4 as default MLA prefill backend #38819

Merged

yzong-rh pushed a commit to yzong-rh/vllm that referenced this pull request Apr 3, 2026

[Bugfix][MLA] Change default SM100 MLA prefill backend back to TRT-LLM (

2bb48c8

vllm-project#38562) Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>

liuchenbing2026 pushed a commit to liuchenbing2026/vllm that referenced this pull request Apr 4, 2026

[Bugfix][MLA] Change default SM100 MLA prefill backend back to TRT-LLM (

a49a9fa

vllm-project#38562) Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>

puririshi98 pushed a commit to puririshi98/vllm that referenced this pull request Apr 7, 2026

[Bugfix][MLA] Change default SM100 MLA prefill backend back to TRT-LLM (

fe4aaf5

vllm-project#38562) Signed-off-by: Matthew Bonanni <mbonanni@redhat.com> Signed-off-by: Rishi Puri <riship@nvidia.com>

big-yellow-duck pushed a commit to EmbeddedLLM/vllm that referenced this pull request Apr 8, 2026

[Bugfix][MLA] Change default SM100 MLA prefill backend back to TRT-LLM (

360caa8

vllm-project#38562) Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>

mtparet pushed a commit to blackfuel-ai/vllm that referenced this pull request Apr 9, 2026

[Bugfix][MLA] Change default SM100 MLA prefill backend back to TRT-LLM (

811be16

vllm-project#38562) Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Bugfix][MLA] Change default SM100 MLA prefill backend back to TRT-LLM#38562

[Bugfix][MLA] Change default SM100 MLA prefill backend back to TRT-LLM#38562
vllm-bot merged 1 commit intovllm-project:mainfrom
MatthewBonanni:fi_mla_prefill_default

MatthewBonanni commented Mar 30, 2026 •

edited

Loading

Uh oh!

claude bot left a comment

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

LucasWilkinson left a comment

Uh oh!

mgoin Mar 30, 2026

Uh oh!

MatthewBonanni Mar 30, 2026

Uh oh!

MatthewBonanni Mar 30, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

Conversation

MatthewBonanni commented Mar 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

claude bot left a comment

Choose a reason for hiding this comment

Claude Code Review

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

LucasWilkinson left a comment

Choose a reason for hiding this comment

Uh oh!

mgoin Mar 30, 2026

Choose a reason for hiding this comment

Uh oh!

MatthewBonanni Mar 30, 2026

Choose a reason for hiding this comment

Uh oh!

MatthewBonanni Mar 30, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

MatthewBonanni commented Mar 30, 2026 •

edited

Loading