[ROCm][CI] Fix spec decode profile assertion and logprob test determinism by AndreasKaratzas · Pull Request #35043 · vllm-project/vllm

AndreasKaratzas · 2026-02-22T04:07:10Z

Fixes two issues blocking spec decode logprob tests on ROCm:

Profile run assertion failure (gpu_model_runner.py): The _dummy_run method asserted num_tokens <= self.scheduler_config.max_num_batched_tokens, but with speculative decoding max_num_tokens (which accounts for verification tokens) can exceed max_num_batched_tokens. Updated the assertion to use self.max_num_tokens, consistent with the rest of the runner.
Non-deterministic logprob comparison (test_logprobs.py): The ref LLM and spec-decode LLM used different batch sizes, which on ROCm triggers non-associative floating-point reduction differences in attention/GEMM kernels. These numerical divergences were misattributed to spec decode incorrectness. Added ROCM_DETERMINISM_KWARGS (max_num_seqs=1) applied to both LLM instances on ROCm only, pinning identical execution paths. No behavioral change on other platforms.

Test Plan

pytest -s -v tests/v1/sample/test_logprobs.py

…ched_tokens Signed-off-by: Andreas Karatzas <akaratza@amd.com>

…rob test Signed-off-by: Andreas Karatzas <akaratza@amd.com>

dosubot · 2026-02-22T04:07:21Z

Related Documentation

Checked 0 published document(s) in 1 knowledge base(s). No updates required.

^{How did I do? Any feedback?}

mergify · 2026-02-22T04:11:40Z

Hi @AndreasKaratzas, the pre-commit checks have failed. Please run:

uv pip install pre-commit
pre-commit install
pre-commit run --all-files

Then, commit the changes and push to your branch.

For future commits, pre-commit will run automatically on changed files before each commit.

Tip

Is mypy or markdownlint failing?

mypy and markdownlint are run differently in CI. If the failure is related to either of these checks, please use the following commands to run them locally:

# For mypy (substitute "3.10" with the failing version if needed)
pre-commit run --hook-stage manual mypy-3.10
# For markdownlint
pre-commit run --hook-stage manual markdownlint

gemini-code-assist

Code Review

The pull request addresses two issues related to speculative decoding on ROCm: an assertion failure in gpu_model_runner.py and non-deterministic logprob comparisons in test_logprobs.py. The changes correctly update the assertion to use self.max_num_tokens and introduce ROCM_DETERMINISM_KWARGS to ensure deterministic execution for logprob tests on ROCm. The changes are well-explained and directly address the identified problems.

gemini-code-assist · 2026-02-22T04:13:55Z

tests/v1/sample/test_logprobs.py

+ROCM_DETERMINISM_KWARGS: dict = (
+    dict(
+        max_num_seqs=1,
+    )
+    if current_platform.is_rocm()
+    else {}
+)


The ROCM_DETERMINISM_KWARGS dictionary currently only sets max_num_seqs=1. The PR description mentions enforce_eager and async_scheduling=False as part of the determinism kwargs. These should also be included in the dictionary to fully align with the described fix and ensure consistent execution paths on ROCm.

Suggested change

ROCM_DETERMINISM_KWARGS: dict = (

dict(

max_num_seqs=1,

)

if current_platform.is_rocm()

else {}

)

ROCM_DETERMINISM_KWARGS: dict = (

dict(

enforce_eager=True,

async_scheduling=False,

max_num_seqs=1,

)

if current_platform.is_rocm()

else {}

)

I've updated the description already, apparently those args were unnecessary.

AndreasKaratzas · 2026-02-22T04:14:08Z

This PR depends on:

[BugFix] Refactor add max_num_tokens_per_forward_pass to account for drafting #35038

That's why pre-commit is failing.

EDIT: This is no longer true. I have reverted the change in vllm/v1/worker/gpu_model_runner.py in order to merge this PR quickly.

AndreasKaratzas · 2026-02-22T04:16:18Z

cc @LucasWilkinson

AndreasKaratzas · 2026-02-22T21:27:53Z

I'm going to revert the change in vllm/v1/worker/gpu_model_runner.py since this PR is critical for AMD CI.

Signed-off-by: Andreas Karatzas <akaratza@amd.com>

tjtanaa

LGTM

…nism (vllm-project#35043) Signed-off-by: Andreas Karatzas <akaratza@amd.com>

…nism (vllm-project#35043) Signed-off-by: Andreas Karatzas <akaratza@amd.com> Signed-off-by: Andrii Skliar <askliar@nvidia.com>

…nism (vllm-project#35043) Signed-off-by: Andreas Karatzas <akaratza@amd.com>

AndreasKaratzas added 2 commits February 21, 2026 22:05

Fix _dummy_run assertion to use max_num_tokens instead of max_num_bat…

d52d0c3

…ched_tokens Signed-off-by: Andreas Karatzas <akaratza@amd.com>

[ROCm][CI] Introduce deterministic engine kwargs for spec decode logp…

afc399f

…rob test Signed-off-by: Andreas Karatzas <akaratza@amd.com>

mergify bot added rocm Related to AMD ROCm v1 labels Feb 22, 2026

github-project-automation bot added this to AMD Feb 22, 2026

github-project-automation bot moved this to Todo in AMD Feb 22, 2026

AndreasKaratzas mentioned this pull request Feb 22, 2026

[CI Failure]: mi325_1: V1 Test others #31631

Closed

3 tasks

gemini-code-assist bot reviewed Feb 22, 2026

View reviewed changes

AndreasKaratzas mentioned this pull request Feb 22, 2026

[ROCm][CI] Added MI325 mirrors #34923

Merged

Merge remote-tracking branch 'origin/main' into akaratza_fix_v1_others

d40e632

Reverted gpu model runner modification in anticipatio of follow-up PR

26656e1

Signed-off-by: Andreas Karatzas <akaratza@amd.com>

tjtanaa approved these changes Feb 23, 2026

View reviewed changes

tjtanaa added the ready ONLY add when PR is ready to merge/full CI is needed label Feb 23, 2026

vllm-bot merged commit 5f68464 into vllm-project:main Feb 23, 2026
16 of 18 checks passed

github-project-automation bot moved this from Todo to Done in AMD Feb 23, 2026

AndreasKaratzas deleted the akaratza_fix_v1_others branch February 23, 2026 17:42

llsj14 pushed a commit to llsj14/vllm that referenced this pull request Mar 1, 2026

[ROCm][CI] Fix spec decode profile assertion and logprob test determi…

a03b18b

…nism (vllm-project#35043) Signed-off-by: Andreas Karatzas <akaratza@amd.com>

tunglinwood pushed a commit to tunglinwood/vllm that referenced this pull request Mar 4, 2026

[ROCm][CI] Fix spec decode profile assertion and logprob test determi…

266dabb

…nism (vllm-project#35043) Signed-off-by: Andreas Karatzas <akaratza@amd.com>

Copilot AI pushed a commit to machov/vllm that referenced this pull request Mar 10, 2026

[ROCm][CI] Fix spec decode profile assertion and logprob test determi…

c96a20f

…nism (vllm-project#35043) Signed-off-by: Andreas Karatzas <akaratza@amd.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ROCm][CI] Fix spec decode profile assertion and logprob test determinism#35043

[ROCm][CI] Fix spec decode profile assertion and logprob test determinism#35043
vllm-bot merged 4 commits intovllm-project:mainfrom
ROCm:akaratza_fix_v1_others

AndreasKaratzas commented Feb 22, 2026 •

edited by github-actions bot

Loading

Uh oh!

dosubot bot commented Feb 22, 2026

Uh oh!

mergify bot commented Feb 22, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Feb 22, 2026

Uh oh!

AndreasKaratzas Feb 22, 2026

Uh oh!

AndreasKaratzas commented Feb 22, 2026 •

edited

Loading

Uh oh!

AndreasKaratzas commented Feb 22, 2026

Uh oh!

AndreasKaratzas commented Feb 22, 2026

Uh oh!

tjtanaa left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

AndreasKaratzas commented Feb 22, 2026 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Test Plan

Uh oh!

dosubot bot commented Feb 22, 2026

Uh oh!

mergify bot commented Feb 22, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Feb 22, 2026

Choose a reason for hiding this comment

Uh oh!

AndreasKaratzas Feb 22, 2026

Choose a reason for hiding this comment

Uh oh!

AndreasKaratzas commented Feb 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AndreasKaratzas commented Feb 22, 2026

Uh oh!

AndreasKaratzas commented Feb 22, 2026

Uh oh!

tjtanaa left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

AndreasKaratzas commented Feb 22, 2026 •

edited by github-actions bot

Loading

AndreasKaratzas commented Feb 22, 2026 •

edited

Loading