Skip to content

[Bugfix] fix default is_neox_style to be True for deepseekv3.2#34353

Merged
jeejeelee merged 1 commit intovllm-project:mainfrom
xyDong0223:ds-fix
Feb 11, 2026
Merged

[Bugfix] fix default is_neox_style to be True for deepseekv3.2#34353
jeejeelee merged 1 commit intovllm-project:mainfrom
xyDong0223:ds-fix

Conversation

@xyDong0223
Copy link
Contributor

@xyDong0223 xyDong0223 commented Feb 11, 2026

Purpose

To resolve the default style issue with index rope, neox should be True when it's not present in config.

Test Plan

Test Result


Essential Elements of an Effective PR Description Checklist
  • The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
  • The test plan, such as providing test command.
  • The test results, such as pasting the results comparison before and after, or e2e results
  • (Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
  • (Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: dongxinyu03 <dongxinyu03@baidu.com>
@mergify mergify bot added deepseek Related to DeepSeek models bug Something isn't working labels Feb 11, 2026
Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request addresses a bug in the DeepseekV2MLAAttention layer where the default value for is_neox_style for the indexer's rotary embedding was incorrect. The change modifies the default value for the indexer_rope_interleave attribute lookup from True to False. This ensures that if indexer_rope_interleave is not present in the model configuration, is_neox_style correctly defaults to True, which is the expected behavior for DeepSeek models. The fix is straightforward and correct.

@jeejeelee jeejeelee added the ready ONLY add when PR is ready to merge/full CI is needed label Feb 11, 2026
@jeejeelee jeejeelee enabled auto-merge (squash) February 11, 2026 16:29
@mgoin mgoin changed the title [Bugfix] fix default is_neox_style is True for deepseek [Bugfix] fix default is_neox_style to be True for deepseekv3.2 Feb 11, 2026
Copy link
Member

@mgoin mgoin left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It seems this was an unintentional behavior change in #34124
image

LGTM!

@mgoin mgoin added this to the v0.16.0 cherry picks milestone Feb 11, 2026
@jeejeelee jeejeelee merged commit be7f3d5 into vllm-project:main Feb 11, 2026
58 of 60 checks passed
@xyDong0223 xyDong0223 deleted the ds-fix branch February 12, 2026 03:23
khluu pushed a commit that referenced this pull request Feb 12, 2026
Signed-off-by: dongxinyu03 <dongxinyu03@baidu.com>
(cherry picked from commit be7f3d5)
eldarkurtic pushed a commit to eldarkurtic/vllm that referenced this pull request Feb 19, 2026
…#34353)

Signed-off-by: dongxinyu03 <dongxinyu03@baidu.com>
Signed-off-by: Eldar Kurtic <research@neuralmagic.com>
llsj14 pushed a commit to llsj14/vllm that referenced this pull request Mar 1, 2026
tunglinwood pushed a commit to tunglinwood/vllm that referenced this pull request Mar 4, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

bug Something isn't working deepseek Related to DeepSeek models ready ONLY add when PR is ready to merge/full CI is needed

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants