[Model] Introduce verify_and_update_model_config for VerifyAndUpdateConfig. by noooop · Pull Request #31131 · vllm-project/vllm

noooop · 2025-12-22T09:15:19Z

Purpose

VerifyAndUpdateConfig.verify_and_update_config too late to _set_default_chunked_prefill_and_prefix_caching_args.
Causing the logs to not match the actual behavior, and requiring the use of some non-obvious failback to get the correct results (which sounds like a bug)

e.g.
nvidia/llama-nemotron-embed-1b-v2
nvidia/llama-nemotron-rerank-1b-v2
google/embeddinggemma-300m

Let's introduce verify_and_update_model_config for VerifyAndUpdateConfig to make the logic here clearer.

Test Plan

tests/models/language/pooling_mteb_test/
tests/entrypoints/pooling/

Test Result

pass

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io>

gemini-code-assist

Code Review

This pull request refactors the model configuration update mechanism to allow for earlier updates. It introduces a new method verify_and_update_model_config on ModelConfig and migrates several model-specific configuration classes to use it. The changes also include simplifying ModelInfo classes in tests by removing default_pooling_type.

While the refactoring is a good idea, it is incomplete. Several configuration classes in vllm/model_executor/models/config.py have not been migrated to the new mechanism. This will lead to lost functionality for the corresponding models if the old configuration update path is removed. I've left a critical comment detailing the issue and suggesting potential solutions.

vllm/model_executor/models/config.py

Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io>

Signed-off-by: wang.yuqi <noooop@126.com>

vllm/config/model.py

vllm/model_executor/models/llama.py

chatgpt-codex-connector · 2025-12-24T06:23:55Z

Codex usage limits have been reached for code reviews. Please check with the admins of this repo to increase the limits by adding credits.

tests/models/language/pooling_mteb_test/mteb_embed_utils.py

Signed-off-by: wang.yuqi <noooop@126.com>

…onfig. (vllm-project#31131) Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io> Signed-off-by: wang.yuqi <noooop@126.com>

…onfig. (vllm-project#31131) Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io> Signed-off-by: wang.yuqi <noooop@126.com> Signed-off-by: dsuhinin <suhinin.dmitriy@gmail.com>

…onfig. (vllm-project#31131) Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io> Signed-off-by: wang.yuqi <noooop@126.com>

init

985bbf7

Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io>

mergify bot added the qwen Related to Qwen models label Dec 22, 2025

gemini-code-assist bot reviewed Dec 22, 2025

View reviewed changes

vllm/model_executor/models/config.py Show resolved Hide resolved

noooop added 4 commits December 23, 2025 09:52

split mteb_utils

afcd3dd

Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io>

fix

0d2c4a5

Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io>

fix

778b642

Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io>

fix

0eae285

Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io>

noooop changed the title ~~[Model] Add verify_and_update_model_config for VerifyAndUpdateConfig.~~ [Model] Introduce verify_and_update_model_config for VerifyAndUpdateConfig. Dec 23, 2025

noooop and others added 3 commits December 24, 2025 10:29

conflicts

f76a995

Signed-off-by: wang.yuqi <noooop@126.com>

Merge branch 'main' into verify_and_update_model_config

d06709e

fix

f769a4e

Signed-off-by: wang.yuqi <noooop@126.com>

mergify bot added the llama Related to Llama models label Dec 24, 2025

noooop and others added 3 commits December 24, 2025 12:06

update

830397e

Signed-off-by: wang.yuqi <noooop@126.com>

fix

517dc01

Signed-off-by: wang.yuqi <noooop@126.com>

Merge branch 'main' into verify_and_update_model_config

8483bb8

noooop commented Dec 24, 2025

View reviewed changes

vllm/config/model.py Show resolved Hide resolved

noooop commented Dec 24, 2025

View reviewed changes

vllm/model_executor/models/llama.py Show resolved Hide resolved

noooop marked this pull request as ready for review December 24, 2025 06:23

noooop requested review from DarkLight1337, ProExpertProg, WoosukKwon, hmellor, houseroad, mgoin, robertgshaw2-redhat, tlrmchlsmth, yewentao256, youkaichao and ywang96 as code owners December 24, 2025 06:23

DarkLight1337 reviewed Dec 24, 2025

View reviewed changes

tests/models/language/pooling_mteb_test/mteb_embed_utils.py Outdated Show resolved Hide resolved

noooop and others added 3 commits December 24, 2025 15:25

Merge branch 'main' into verify_and_update_model_config

46c90ad

- reorganization

1da3b33

Signed-off-by: wang.yuqi <noooop@126.com>

Merge branch 'main' into verify_and_update_model_config

89bcc6c

DarkLight1337 approved these changes Dec 24, 2025

View reviewed changes

DarkLight1337 added the ready ONLY add when PR is ready to merge/full CI is needed label Dec 24, 2025

noooop enabled auto-merge (squash) December 24, 2025 08:43

noooop merged commit bd89ce1 into vllm-project:main Dec 24, 2025
59 of 60 checks passed

noooop deleted the verify_and_update_model_config branch December 24, 2025 09:58

noooop mentioned this pull request Jan 4, 2026

[CI Failure] Fix NomicBert max_model_len validation #31662

Merged

5 tasks

ItzDEXX pushed a commit to ItzDEXX/vllm that referenced this pull request Feb 19, 2026

[Model] Introduce verify_and_update_model_config for VerifyAndUpdateC…

27ebe52

…onfig. (vllm-project#31131) Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io> Signed-off-by: wang.yuqi <noooop@126.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Model] Introduce verify_and_update_model_config for VerifyAndUpdateConfig.#31131

[Model] Introduce verify_and_update_model_config for VerifyAndUpdateConfig.#31131
noooop merged 14 commits intovllm-project:mainfrom
noooop:verify_and_update_model_config

noooop commented Dec 22, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

chatgpt-codex-connector bot commented Dec 24, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

noooop commented Dec 22, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

chatgpt-codex-connector bot commented Dec 24, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

noooop commented Dec 22, 2025 •

edited by github-actions bot

Loading