[Fix] Add default rope theta for qwen1 model by iwzbi · Pull Request #30369 · vllm-project/vllm

iwzbi · 2025-12-10T02:36:54Z

Need to set default rope theta for qwen1 model.

error:

Qwen1 config:

Purpose

Test Plan

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

chatgpt-codex-connector · 2025-12-10T02:37:00Z

Codex usage limits have been reached for code reviews. Please check with the admins of this repo to increase the limits by adding credits.

gemini-code-assist

Code Review

This pull request aims to fix a KeyError for Qwen1 models that are missing the rope_theta parameter in their configuration. The fix involves adding a default value for rope_theta. While the change correctly addresses the issue, the implementation can be improved for efficiency and code quality by moving the function call to a more appropriate location where it is executed only once during model initialization, instead of for every layer.

gemini-code-assist · 2025-12-10T02:38:19Z

vllm/model_executor/models/qwen.py

@@ -149,6 +150,7 @@ def __init__(
        prefix: str = "",
    ):
        super().__init__()
+        set_default_rope_theta(config, default_theta=10000)


Calling set_default_rope_theta in QWenBlock.__init__ is inefficient as it's executed for every model layer. This function should be called only once per model initialization.

Please move this call to the beginning of QWenModel.__init__ (e.g., after line 201) to ensure it runs only once. This will improve model loading efficiency and align with best practices seen in other models in this repository.

Signed-off-by: root <iwzbi@zju.edu.cn>

DarkLight1337 · 2025-12-10T04:06:26Z

cc @hmellor

hmellor · 2025-12-10T17:18:36Z

Could you please share the checkpoint you are using so I can try to reproduce the error?

The latest main has:

vllm/vllm/transformers_utils/config.py

Lines 309 to 318 in 2dcbac9

    
           rope_theta_names = ("rope_theta", "rotary_emb_base") 
        
           rope_theta = getattr_iter(config, rope_theta_names, None) 
        
           if Version(version("transformers")) < Version("5.0.0.dev0"): 
        
               # Transformers v4 installed, legacy config fields may be present 
        
               if (rope_scaling := getattr(config, "rope_scaling", None)) is not None: 
        
                   config.rope_parameters = rope_scaling 
        
               if rope_theta is not None: 
        
                   if not hasattr(config, "rope_parameters"): 
        
                       config.rope_parameters = {"rope_type": "default"} 
        
                   config.rope_parameters["rope_theta"] = rope_theta

Which means that rope_parameters should be present for the config in the screenshot.

iwzbi · 2025-12-11T11:31:39Z

Could you please share the checkpoint you are using so I can try to reproduce the error?

The latest main has:

vllm/vllm/transformers_utils/config.py

Lines 309 to 318 in 2dcbac9

rope_theta_names = ("rope_theta", "rotary_emb_base")

rope_theta = getattr_iter(config, rope_theta_names, None)

if Version(version("transformers")) < Version("5.0.0.dev0"):

# Transformers v4 installed, legacy config fields may be present

if (rope_scaling := getattr(config, "rope_scaling", None)) is not None:

config.rope_parameters = rope_scaling

if rope_theta is not None:

if not hasattr(config, "rope_parameters"):

config.rope_parameters = {"rope_type": "default"}

config.rope_parameters["rope_theta"] = rope_theta

Which means that rope_parameters should be present for the config in the screenshot.

please try this model: https://huggingface.co/Qwen/Qwen-1_8B-Chat
thanks!

hmellor · 2025-12-11T17:31:30Z

I've confirmed that this issue is not present when using vLLM with Transformers 4.57.3. It is only present when using Transformesr main branch, which vLLM does not currently support.

The reason for the difference is that in Transformers v4 we hand roll our own standardisation, but for v5 we use the new built in standardisation:

vllm/vllm/transformers_utils/config.py

Lines 309 to 328 in aa3c250

    
           rope_theta_names = ("rope_theta", "rotary_emb_base") 
        
           rope_theta = getattr_iter(config, rope_theta_names, None) 
        
           if Version(version("transformers")) < Version("5.0.0.dev0"): 
        
               # Transformers v4 installed, legacy config fields may be present 
        
               if (rope_scaling := getattr(config, "rope_scaling", None)) is not None: 
        
                   config.rope_parameters = rope_scaling 
        
               if rope_theta is not None: 
        
                   if not hasattr(config, "rope_parameters"): 
        
                       config.rope_parameters = {"rope_type": "default"} 
        
                   config.rope_parameters["rope_theta"] = rope_theta 
        
               partial_rotary_factor_names = ("partial_rotary_factor", "rotary_pct") 
        
               partial_rotary_factor = getattr_iter(config, partial_rotary_factor_names, None) 
        
               if partial_rotary_factor is not None: 
        
                   if not hasattr(config, "rope_parameters"): 
        
                       config.rope_parameters = {"rope_type": "default"} 
        
                   config.rope_parameters["partial_rotary_factor"] = partial_rotary_factor 
        
           elif rope_theta is not None or hasattr(config, "rope_parameters"): 
        
               # Transformers v5 installed 
        
               config.standardize_rope_params() 
        
               config.validate_rope()

The v5 standardisation doesn't check the non-standard names (rotary_emb_base, rotary_pct, rotary_emb_fraction).

Since these non-standard names only appear in custom models, it doesn't make sense to check them in Transformers itself. I'll make a PR which ensures that these old custom models are forward compatible.

hmellor · 2025-12-11T17:36:02Z

This should be fixed in 69bfda0 of #30389

iwzbi requested a review from sighingnow as a code owner December 10, 2025 02:36

mergify bot added the qwen Related to Qwen models label Dec 10, 2025

gemini-code-assist bot reviewed Dec 10, 2025

View reviewed changes

go

7ac3974

Signed-off-by: root <iwzbi@zju.edu.cn>

iwzbi force-pushed the fix/qwen_rope branch from fa2f5ab to 7ac3974 Compare December 10, 2025 02:56

hmellor closed this Dec 11, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Fix] Add default rope theta for qwen1 model#30369

[Fix] Add default rope theta for qwen1 model#30369
iwzbi wants to merge 1 commit intovllm-project:mainfrom
iwzbi:fix/qwen_rope

iwzbi commented Dec 10, 2025 •

edited by github-actions bot

Loading

Uh oh!

chatgpt-codex-connector bot commented Dec 10, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Dec 10, 2025

Uh oh!

DarkLight1337 commented Dec 10, 2025

Uh oh!

hmellor commented Dec 10, 2025

Uh oh!

iwzbi commented Dec 11, 2025

Uh oh!

hmellor commented Dec 11, 2025

Uh oh!

hmellor commented Dec 11, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

iwzbi commented Dec 10, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

chatgpt-codex-connector bot commented Dec 10, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

DarkLight1337 commented Dec 10, 2025

Uh oh!

hmellor commented Dec 10, 2025

Uh oh!

iwzbi commented Dec 11, 2025

Uh oh!

hmellor commented Dec 11, 2025

Uh oh!

hmellor commented Dec 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

iwzbi commented Dec 10, 2025 •

edited by github-actions bot

Loading

hmellor commented Dec 11, 2025 •

edited

Loading