[Bugfix] Accept canonicalized `modelopt_*` quant_method in `_extract_modelopt_quant_algo` by vadiklyutiy · Pull Request #42181 · vllm-project/vllm

vadiklyutiy · 2026-05-09T20:24:32Z

ModelArchConfigConvertorBase._normalize_quantization_config rewrites quant_method to the family-specific name (e.g. "modelopt_fp4" for the legacy hf_quant_config.json shape with quant_algo: "NVFP4"). _extract_modelopt_quant_algo then strict-equals against "modelopt" and returns None, so the override loop in ModelConfig._verify_quantization finds no match and validation raises:

Quantization method specified in the model config (modelopt_fp4) does not match the quantization method specified in the quantization argument (modelopt).

This was hitting nvidia/Qwen3.5-397B-A17B-NVFP4 with --quantization modelopt intermittently — one of several spawn-context API-server processes resolved the legacy hf_quant_config.json instead of the modern config.json and the strict check tipped it over.

Replace the equality with startswith("modelopt"). Covers all four registered variants (modelopt, modelopt_fp4, modelopt_mxfp8, modelopt_mixed) and matches what humming.py (config["quant_method"] in [..., "modelopt"]) and utils/torch_utils.py:315 (quant_method.startswith("modelopt")) already accept.

Signed-off-by: Vadim Gimpelson <vadim.gimpelson@gmail.com>

claude

Claude Code Review

This pull request is from a fork — automated review is disabled. A repository maintainer can comment @claude review to run a one-time review.

gemini-code-assist

Code Review

The pull request updates the _extract_modelopt_quant_algo function in vllm/model_executor/layers/quantization/modelopt.py to check if the quant_method starts with "modelopt" instead of requiring an exact match. This change allows for more flexible quantization method naming. I have no feedback to provide as no review comments were present.

yewentao256

LGTM, thanks for the work!

…modelopt_quant_algo` (vllm-project#42181) Signed-off-by: Vadim Gimpelson <vadim.gimpelson@gmail.com>

…modelopt_quant_algo` (vllm-project#42181) Signed-off-by: Vadim Gimpelson <vadim.gimpelson@gmail.com> Signed-off-by: Matt Van Horn <455140+mvanhorn@users.noreply.github.com>

…modelopt_quant_algo` (vllm-project#42181) Signed-off-by: Vadim Gimpelson <vadim.gimpelson@gmail.com>

fix

264864f

Signed-off-by: Vadim Gimpelson <vadim.gimpelson@gmail.com>

vadiklyutiy requested review from mgoin, pavanimajety, robertgshaw2-redhat, tlrmchlsmth and yewentao256 as code owners May 9, 2026 20:24

claude Bot reviewed May 9, 2026

View reviewed changes

mergify Bot added the bug Something isn't working label May 9, 2026

gemini-code-assist Bot reviewed May 9, 2026

View reviewed changes

yewentao256 approved these changes May 9, 2026

View reviewed changes

yewentao256 added the ready ONLY add when PR is ready to merge/full CI is needed label May 9, 2026

yewentao256 approved these changes May 11, 2026

View reviewed changes

yewentao256 merged commit a2e776d into vllm-project:main May 11, 2026
76 checks passed

weifang231 pushed a commit to weifang231/eb-vllm that referenced this pull request May 13, 2026

[Bugfix] Accept canonicalized modelopt_* quant_method in `_extract_…

da4cee3

…modelopt_quant_algo` (vllm-project#42181) Signed-off-by: Vadim Gimpelson <vadim.gimpelson@gmail.com>

mfylcek pushed a commit to mfylcek/vllm that referenced this pull request May 19, 2026

[Bugfix] Accept canonicalized modelopt_* quant_method in `_extract_…

8022b7e

…modelopt_quant_algo` (vllm-project#42181) Signed-off-by: Vadim Gimpelson <vadim.gimpelson@gmail.com>

jhu960213 pushed a commit to jhu960213/vllm that referenced this pull request May 20, 2026

[Bugfix] Accept canonicalized modelopt_* quant_method in `_extract_…

425f245

…modelopt_quant_algo` (vllm-project#42181) Signed-off-by: Vadim Gimpelson <vadim.gimpelson@gmail.com>

h1t35h pushed a commit to h1t35h/vllm that referenced this pull request May 21, 2026

[Bugfix] Accept canonicalized modelopt_* quant_method in `_extract_…

a48ebc7

…modelopt_quant_algo` (vllm-project#42181) Signed-off-by: Vadim Gimpelson <vadim.gimpelson@gmail.com>

knight0528 pushed a commit to knight0528/vllm that referenced this pull request Jun 8, 2026

[Bugfix] Accept canonicalized modelopt_* quant_method in `_extract_…

0f6b326

…modelopt_quant_algo` (vllm-project#42181) Signed-off-by: Vadim Gimpelson <vadim.gimpelson@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Bugfix] Accept canonicalized `modelopt_*` quant_method in `_extract_modelopt_quant_algo`#42181

[Bugfix] Accept canonicalized `modelopt_*` quant_method in `_extract_modelopt_quant_algo`#42181
yewentao256 merged 1 commit into
vllm-project:mainfrom
vadiklyutiy:modelopt-prefix-fix

vadiklyutiy commented May 9, 2026

Uh oh!

claude Bot left a comment

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

yewentao256 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

vadiklyutiy commented May 9, 2026

Uh oh!

claude Bot left a comment

Choose a reason for hiding this comment

Claude Code Review

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

yewentao256 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants