[Bugfix] Fix Basic Models Test by MatthewBonanni · Pull Request #34818 · vllm-project/vllm

MatthewBonanni · 2026-02-18T15:41:11Z

Purpose

Fixes #34806, #34810, #34814, and #34819

Test Plan

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>

gemini-code-assist

Code Review

This pull request introduces two small but useful changes. The first is an update to the model registry in tests/models/registry.py to provide a more specific reason for skipping tests related to H2OVLChatModel, which will improve test stability. The second change, in vllm/model_executor/models/minicpm_eagle.py, explicitly disallows the use of inputs_embeds for the EagleMiniCPMForCausalLM model by raising a NotImplementedError. This is a good defensive measure to prevent incorrect usage of the model and provides a clear error message. Both changes are correct and improve the robustness of the codebase. I approve these changes.

Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>

DarkLight1337 · 2026-02-18T15:55:08Z

Hmm this simply skips the tests, which we didn't have to do before #33600

Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>

MatthewBonanni · 2026-02-18T16:05:18Z

@DarkLight1337 good point - I changed this in 6ac74b8

vllm/model_executor/models/minicpm_eagle.py

DarkLight1337 · 2026-02-18T17:42:05Z

Extra Initialization tests still fail

mgoin · 2026-02-18T17:50:55Z

The extra init tests seem to be failing because of OOM, so unsure of clear debug

DarkLight1337 · 2026-02-18T17:52:24Z

Might it be because #33600 caused the overrides/patches specific to the model initialization tests to not function correctly?

MatthewBonanni · 2026-02-18T17:59:34Z

@DarkLight1337 yes, I believe it's because _update_block_size_for_backend causes CUDA initialization

Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>

MatthewBonanni · 2026-02-18T18:41:02Z

As a workaround, I've decided to just only run _update_block_size_for_backend for MLA models. This was the behavior pre-33600. All standard attention backends support the default block size of 16. This avoids importing attention backends that initialize CUDA.

After this change, if the user sets a bad block size for a standard attention model, the attention backend selection won't factor this in. This was the pre-33600 behavior, though, so it should be alright.

~~I'll work on a fix that lets us get rid of the use_mla check in a follow up~~

EDIT: This workaround ended up not being any simpler than the full fix

Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>

mgoin

The compromise is good with me to fix CI, thanks Matt

Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>

Signed-off-by: mgoin <mgoin64@gmail.com>

This reverts commit 662205d.

Signed-off-by: Matthew Bonanni <mbonanni@redhat.com> Co-authored-by: Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>

commit 4f9b8be Author: mgoin <mgoin64@gmail.com> Date: Fri Feb 20 17:14:56 2026 +0000 Cleanup Signed-off-by: mgoin <mgoin64@gmail.com> commit feed637 Author: mgoin <mgoin64@gmail.com> Date: Fri Feb 20 17:08:37 2026 +0000 Fix block_size mismatch for MLA models after vllm-project#34818 Signed-off-by: mgoin <mgoin64@gmail.com> Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>

Signed-off-by: Matthew Bonanni <mbonanni@redhat.com> Co-authored-by: Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>

Signed-off-by: Matthew Bonanni <mbonanni@redhat.com> Co-authored-by: Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com> Signed-off-by: Andrii Skliar <askliar@nvidia.com>

Signed-off-by: Matthew Bonanni <mbonanni@redhat.com> Co-authored-by: Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>

Fix

2a25c4b

Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>

MatthewBonanni requested review from DarkLight1337 and ywang96 as code owners February 18, 2026 15:41

mergify bot added speculative-decoding bug Something isn't working labels Feb 18, 2026

gemini-code-assist bot reviewed Feb 18, 2026

View reviewed changes

mgoin added ready ONLY add when PR is ready to merge/full CI is needed ci-failure Issue about an unexpected test failure in CI labels Feb 18, 2026

github-project-automation bot added this to CI Failures Feb 18, 2026

MatthewBonanni added 2 commits February 18, 2026 10:46

Fix

1a40186

Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>

Fix

c617228

Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>

MatthewBonanni changed the title ~~[Bugfix] Fix Basic Models Test (Extra Initialization)~~ [Bugfix] Fix Basic Models Test Feb 18, 2026

Make _update_block_size_for_backend fault-tolerant

6ac74b8

Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>

mergify bot added the nvidia label Feb 18, 2026

github-project-automation bot added this to NVIDIA Feb 18, 2026

DarkLight1337 reviewed Feb 18, 2026

View reviewed changes

vllm/model_executor/models/minicpm_eagle.py Outdated Show resolved Hide resolved

DarkLight1337 requested a review from mgoin February 18, 2026 17:29

Workaround by limiting to MLA

dfed2a1

Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>

Try-except should no longer be necessary

4065757

Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>

mgoin approved these changes Feb 18, 2026

View reviewed changes

github-project-automation bot moved this to Ready in NVIDIA Feb 18, 2026

Lazy allocate workspaces

40d3782

Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>

MatthewBonanni requested a review from pavanimajety as a code owner February 18, 2026 22:20

MatthewBonanni added 8 commits February 19, 2026 12:17

Fix vllm config context

ce3fc1c

Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>

Fix chunked local attention

5111418

Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>

Fix config context

4bac453

Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>

Fix ray executor

e5ac83c

Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>

Re-add warning

99b3b3a

Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>

Clean up

f7b337a

Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>

Comment

982a892

Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>

Clean up

9b741db

Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>

MatthewBonanni mentioned this pull request Feb 19, 2026

[CI] Fix custome offline ci issue, V1 Others Test bug #34913

Closed

Use min across groups

3bb724a

Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>

mgoin approved these changes Feb 19, 2026

View reviewed changes

vllm-bot merged commit 662205d into vllm-project:main Feb 19, 2026
64 of 67 checks passed

github-project-automation bot moved this to Done in CI Failures Feb 19, 2026

github-project-automation bot moved this from Ready to Done in NVIDIA Feb 19, 2026

mgoin mentioned this pull request Feb 20, 2026

[CI Failure]: LM Eval Small Models (B200) - DeepSeek and Qwen3 Next #34969

Closed

3 tasks

mgoin added a commit to neuralmagic/vllm that referenced this pull request Feb 20, 2026

Fix block_size mismatch for MLA models after vllm-project#34818

feed637

Signed-off-by: mgoin <mgoin64@gmail.com>

mgoin mentioned this pull request Feb 20, 2026

[Bugfix] Fix block_size mismatch for MLA models after #34818 #34970

Closed

5 tasks

LucasWilkinson added a commit that referenced this pull request Feb 20, 2026

Revert "[Bugfix] Fix Basic Models Test (#34818)"

f8377bb

This reverts commit 662205d.

LucasWilkinson mentioned this pull request Feb 20, 2026

[CI] Revert PRs 34818 and 33600 #34979

Merged

MatthewBonanni mentioned this pull request Feb 23, 2026

Reapply [Attention] Refactor check_and_update_config #35122

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Bugfix] Fix Basic Models Test#34818

[Bugfix] Fix Basic Models Test#34818
vllm-bot merged 26 commits intovllm-project:mainfrom
MatthewBonanni:fix_basic_extra_init

MatthewBonanni commented Feb 18, 2026 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

DarkLight1337 commented Feb 18, 2026

Uh oh!

MatthewBonanni commented Feb 18, 2026 •

edited

Loading

Uh oh!

Uh oh!

DarkLight1337 commented Feb 18, 2026 •

edited

Loading

Uh oh!

mgoin commented Feb 18, 2026

Uh oh!

DarkLight1337 commented Feb 18, 2026 •

edited

Loading

Uh oh!

MatthewBonanni commented Feb 18, 2026

Uh oh!

MatthewBonanni commented Feb 18, 2026 •

edited

Loading

Uh oh!

mgoin left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Uh oh!

Conversation

MatthewBonanni commented Feb 18, 2026 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

DarkLight1337 commented Feb 18, 2026

Uh oh!

MatthewBonanni commented Feb 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

DarkLight1337 commented Feb 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mgoin commented Feb 18, 2026

Uh oh!

DarkLight1337 commented Feb 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

MatthewBonanni commented Feb 18, 2026

Uh oh!

MatthewBonanni commented Feb 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mgoin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

MatthewBonanni commented Feb 18, 2026 •

edited by github-actions bot

Loading

MatthewBonanni commented Feb 18, 2026 •

edited

Loading

DarkLight1337 commented Feb 18, 2026 •

edited

Loading

DarkLight1337 commented Feb 18, 2026 •

edited

Loading

MatthewBonanni commented Feb 18, 2026 •

edited

Loading