[CI][Bugfix] Fix `test_run_eagle_dp` by MatthewBonanni · Pull Request #38584 · vllm-project/vllm

MatthewBonanni · 2026-03-30T20:51:11Z

FIX: #38234
FIX: #31913
Revert: #31915

Purpose

Fixes flaky test by disabling AOT scheduling when VLLM_BATCH_INVARIANT is enabled.

Test Plan

Distributed DP Tests (2 GPUs)
pytest tests/v1/distributed/test_eagle_dp.py::test_run_eagle_dp[FLASH_ATTN]

Test Result

Should pass in CI

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>

claude

Claude Code Review

This pull request is from a fork — automated review is disabled. A repository maintainer can comment @claude review to run a one-time review.

gemini-code-assist

Code Review

This pull request modifies the Flash Attention backend to disable the Ahead-of-Time (AOT) schedule when batch invariance is enabled via the VLLM_BATCH_INVARIANT environment variable. This change is necessary because the AOT schedule varies with the maximum sequence lengths of the query and key, which is incompatible with batch-invariant execution. I have no feedback to provide as no review comments were submitted.

Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>

markmc · 2026-03-31T09:25:15Z

From @NickLucche on Slack:

I am not really convinced this is a batch invariance issue, given I can occasionally repro with the flag set and with a single request.

NickLucche · 2026-03-31T09:26:22Z

vllm/v1/attention/backends/flash_attn.py

+        # Disable AOT schedule for spec-decode proposer (not worth the overhead)
+        # and for batch invariance (schedule varies with max_seqlen_q/k).
+        aot_schedule = (
+            self.aot_schedule and not fast_build and not envs.VLLM_BATCH_INVARIANT


nit: this is a lambda

MatthewBonanni · 2026-03-31T14:11:44Z

Note: wasn't intending for this to be merged yet, wanted to run CI a few times on this PR to verify the fix. Unfortunately it looks like the test is still flaky. #38566 disables the test temporarily

Signed-off-by: Matthew Bonanni <mbonanni@redhat.com> Signed-off-by: EricccYang <yangyang4991@gmail.com>

Signed-off-by: Matthew Bonanni <mbonanni@redhat.com> Signed-off-by: bhargav-patel-29 <bhargav.patel@tihiitb.org>

Fix

c8bab2d

Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>

MatthewBonanni requested a review from LucasWilkinson as a code owner March 30, 2026 20:51

claude bot reviewed Mar 30, 2026

View reviewed changes

mergify bot added v1 bug Something isn't working labels Mar 30, 2026

gemini-code-assist bot reviewed Mar 30, 2026

View reviewed changes

Restore original num_expected_tokens

25d6f64

Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>

MatthewBonanni added the ready ONLY add when PR is ready to merge/full CI is needed label Mar 30, 2026

njhill approved these changes Mar 31, 2026

View reviewed changes

NickLucche approved these changes Mar 31, 2026

View reviewed changes

markmc mentioned this pull request Mar 31, 2026

[Bugfix][CI] Skip flaky test_eagle test #38566

Merged

NickLucche merged commit 7d65463 into vllm-project:main Mar 31, 2026
62 checks passed

markmc mentioned this pull request Mar 31, 2026

[Bug]: test_eagle_dp test is flaky #31913

Closed

1 task

MatthewBonanni deleted the fix_dp_batch_invariant branch March 31, 2026 14:11

MatthewBonanni restored the fix_dp_batch_invariant branch March 31, 2026 15:03

yewentao256 changed the title ~~[WIP][CI][Bugfix] Fix test_run_eagle_dp~~ [CI][Bugfix] Fix test_run_eagle_dp Mar 31, 2026

EricccYang pushed a commit to EricccYang/vllm that referenced this pull request Apr 1, 2026

[WIP][CI][Bugfix] Fix test_run_eagle_dp (vllm-project#38584)

69e9be9

Signed-off-by: Matthew Bonanni <mbonanni@redhat.com> Signed-off-by: EricccYang <yangyang4991@gmail.com>

bhargav-patel-29 pushed a commit to Bharatgen-Tech/vllm that referenced this pull request Apr 1, 2026

[WIP][CI][Bugfix] Fix test_run_eagle_dp (vllm-project#38584)

ad7ccac

Signed-off-by: Matthew Bonanni <mbonanni@redhat.com> Signed-off-by: bhargav-patel-29 <bhargav.patel@tihiitb.org>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[CI][Bugfix] Fix `test_run_eagle_dp`#38584

[CI][Bugfix] Fix `test_run_eagle_dp`#38584
NickLucche merged 2 commits intovllm-project:mainfrom
MatthewBonanni:fix_dp_batch_invariant

MatthewBonanni commented Mar 30, 2026 •

edited

Loading

Uh oh!

claude bot left a comment

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

markmc commented Mar 31, 2026

Uh oh!

NickLucche Mar 31, 2026

Uh oh!

Uh oh!

MatthewBonanni commented Mar 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

MatthewBonanni commented Mar 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

claude bot left a comment

Choose a reason for hiding this comment

Claude Code Review

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

markmc commented Mar 31, 2026

Uh oh!

NickLucche Mar 31, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

MatthewBonanni commented Mar 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

MatthewBonanni commented Mar 30, 2026 •

edited

Loading