Skip to content

[Bugfix][CI] Skip flaky test_eagle test#38566

Merged
MatthewBonanni merged 2 commits intovllm-project:mainfrom
NickLucche:fix-spec
Mar 31, 2026
Merged

[Bugfix][CI] Skip flaky test_eagle test#38566
MatthewBonanni merged 2 commits intovllm-project:mainfrom
NickLucche:fix-spec

Conversation

@NickLucche
Copy link
Copy Markdown
Collaborator

@NickLucche NickLucche commented Mar 30, 2026

Tentative fix for #31913, do not merge until reviewed and approved locally.

EDIT: I think disabling async with DP>1 is too harsh, still I would appreciate any comment with more context to clarify this is indeed not needed.

Signed-off-by: NickLucche <nlucches@redhat.com>
@NickLucche NickLucche requested a review from njhill as a code owner March 30, 2026 16:51
Copy link
Copy Markdown

@claude claude bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Claude Code Review

This pull request is from a fork — automated review is disabled. A repository maintainer can comment @claude review to run a one-time review.

@NickLucche NickLucche changed the title [Bugfix] Fix flaky test_eagle test [Bugfix][CI] Fix flaky test_eagle test Mar 30, 2026
@NickLucche NickLucche added the ready ONLY add when PR is ready to merge/full CI is needed label Mar 30, 2026
Copy link
Copy Markdown
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request modifies the GPUModelRunner to disable asynchronous speculative decoding when data parallelism is enabled by ensuring data_parallel_size is equal to 1. The review feedback suggests adding a comment or a TODO to document this restriction and track future work to support this feature.

self.use_async_scheduling and self.num_spec_tokens > 0
self.use_async_scheduling
and self.num_spec_tokens > 0
and self.vllm_config.parallel_config.data_parallel_size == 1
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

high

This condition is critical for correctness when using data parallelism. To prevent future regressions and improve code clarity, please add a comment explaining why asynchronous speculative decoding is disabled when data_parallel_size > 1. A TODO comment would also be appropriate to track future work to support this feature.

            # TODO: Support async speculative decoding with data parallelism.
            and self.vllm_config.parallel_config.data_parallel_size == 1

@mergify mergify bot added v1 bug Something isn't working labels Mar 30, 2026
@MatthewBonanni
Copy link
Copy Markdown
Collaborator

Like you mention, I don't think disabling async spec decode for DP > 1 is the right move, especially because that test is known flaky prior to #32951, and this is primarily an issue of batch invariance

@NickLucche
Copy link
Copy Markdown
Collaborator Author

NickLucche commented Mar 31, 2026

Running with batch invariance doesn't really help here. Also I can occasionally repro with a single request too.

Signed-off-by: NickLucche <nlucches@redhat.com>
@NickLucche NickLucche changed the title [Bugfix][CI] Fix flaky test_eagle test [Bugfix][CI] Skip flaky test_eagle test Mar 31, 2026
@NickLucche
Copy link
Copy Markdown
Collaborator Author

@markmc
Copy link
Copy Markdown
Member

markmc commented Mar 31, 2026

xref #38584

@NickLucche NickLucche closed this Mar 31, 2026
@NickLucche NickLucche reopened this Mar 31, 2026
Copy link
Copy Markdown
Collaborator

@MatthewBonanni MatthewBonanni left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM for now

@MatthewBonanni MatthewBonanni merged commit 7430389 into vllm-project:main Mar 31, 2026
18 checks passed
EricccYang pushed a commit to EricccYang/vllm that referenced this pull request Apr 1, 2026
Signed-off-by: NickLucche <nlucches@redhat.com>
Signed-off-by: EricccYang <yangyang4991@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

bug Something isn't working ready ONLY add when PR is ready to merge/full CI is needed v1

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants