Skip to content

Revert "[Perf] DSV3.2 Indexer Fused Weights Projection" (#38684)#38806

Draft
vllm-agent wants to merge 1 commit intovllm-project:mainfrom
vllm-agent:auto-revert/pr-38684
Draft

Revert "[Perf] DSV3.2 Indexer Fused Weights Projection" (#38684)#38806
vllm-agent wants to merge 1 commit intovllm-project:mainfrom
vllm-agent:auto-revert/pr-38684

Conversation

@vllm-agent
Copy link
Copy Markdown

Revert of #38684

This reverts commit 5f96f9a (PR #38684).

Reason

Nightly CI build #59406 detected 1 new failure linked to this PR:

  • LM Eval Large Models (H200): KeyError: 'model.layers.0.self_attn.indexer.wk_weights_proj.weight_scale_inv' in deepseek_v2.py:1516 during model loading of DeepSeek-V3.2

The fused weights projection changes introduced new weight parameter names that do not exist in the model checkpoint, causing both TP and DP test variants to fail with RuntimeError: Server exited unexpectedly.


Auto-generated by CI failure analyzer

@mergify mergify Bot added the deepseek Related to DeepSeek models label Apr 2, 2026
Copy link
Copy Markdown
Contributor

@gemini-code-assist gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request refactors the DeepSeek model implementation by decoupling the fused 'wk' and 'weights_proj' linear layers into separate components in 'deepseek_v2.py' and updating the corresponding weight loading mappings in 'deepseek_mtp.py' and 'deepseek_v2.py'. I have no feedback to provide as there are no review comments.

@mergify
Copy link
Copy Markdown
Contributor

mergify Bot commented Apr 4, 2026

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @vllm-agent.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

@mergify mergify Bot added the needs-rebase label Apr 4, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

deepseek Related to DeepSeek models needs-rebase

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant