Skip to content

[Model][1/N] Delete deepseek v2/v3 modeling codes.#3189

Merged
wangxiyuan merged 6 commits intovllm-project:mainfrom
whx-sjtu:rm_ds_model
Oct 20, 2025
Merged

[Model][1/N] Delete deepseek v2/v3 modeling codes.#3189
wangxiyuan merged 6 commits intovllm-project:mainfrom
whx-sjtu:rm_ds_model

Conversation

@whx-sjtu
Copy link
Copy Markdown
Collaborator

@whx-sjtu whx-sjtu commented Sep 25, 2025

This PR deletes model codes of deepseek_v2 and deepseek_v3 to reuse the model file from vLLM.

vLLM Ascend now uses custom ops register way instead of model file hard-coding.

@github-actions
Copy link
Copy Markdown
Contributor

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

  • A PR should do only one thing, smaller PRs enable faster reviews.
  • Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
  • Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

Copy link
Copy Markdown
Contributor

@gemini-code-assist gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request deletes the file vllm_ascend/models/deepseek_v2.py, which contains the non-torchair implementations for DeepSeek V2 and V3 models. While the goal seems to be cleanup, this change is incomplete and introduces breaking changes. Specifically, vllm_ascend/models/__init__.py still attempts to register models from the deleted file, which will cause an ImportError. Additionally, vllm_ascend/models/deepseek_mtp.py relies on a monkey patch from the deleted file, and its behavior will change, potentially leading to correctness or performance issues. The test file tests/ut/models/test_deepseek_v2.py also has a direct dependency on the deleted file and will fail. This pull request should be updated to resolve these dangling dependencies.

Comment thread vllm_ascend/models/deepseek_v2.py
@whx-sjtu whx-sjtu force-pushed the rm_ds_model branch 3 times, most recently from 1730fb5 to e25f3ed Compare September 28, 2025 07:42
@whx-sjtu whx-sjtu force-pushed the rm_ds_model branch 4 times, most recently from 1d78a51 to 0b1e706 Compare September 28, 2025 13:20
@whx-sjtu whx-sjtu added ready read for review ready-for-test start test by label for PR labels Sep 28, 2025
@whx-sjtu whx-sjtu force-pushed the rm_ds_model branch 2 times, most recently from 71be184 to 380d3ef Compare September 29, 2025 13:18
@github-actions
Copy link
Copy Markdown
Contributor

This pull request has conflicts, please resolve those before we can evaluate the pull request.

@github-actions github-actions Bot removed the ready read for review label Sep 30, 2025
@LCAIZJ
Copy link
Copy Markdown
Collaborator

LCAIZJ commented Oct 9, 2025

Why is it possible to delete these two model files now?

@github-actions
Copy link
Copy Markdown
Contributor

This pull request has conflicts, please resolve those before we can evaluate the pull request.

@whx-sjtu whx-sjtu removed the ready-for-test start test by label for PR label Oct 15, 2025
@whx-sjtu whx-sjtu changed the title [Model] Delete deepseek modeling codes. [Model][WIP] Delete deepseek modeling codes. Oct 16, 2025
@whx-sjtu
Copy link
Copy Markdown
Collaborator Author

Why is it possible to delete these two model files now?

What do you think is the reason why it cannot be deleted? If you refer to the CUDA hard-coding issue of DeepSeek 3.2, I will circumvent it through a patch later. @LCAIZJ

@github-actions
Copy link
Copy Markdown
Contributor

This pull request has conflicts, please resolve those before we can evaluate the pull request.

Signed-off-by: whx-sjtu <2952154980@qq.com>
Signed-off-by: whx-sjtu <2952154980@qq.com>
Signed-off-by: whx-sjtu <2952154980@qq.com>
Signed-off-by: whx-sjtu <2952154980@qq.com>
Signed-off-by: whx-sjtu <2952154980@qq.com>
Signed-off-by: whx-sjtu <2952154980@qq.com>
@whx-sjtu whx-sjtu added ready read for review ready-for-test start test by label for PR labels Oct 20, 2025
@whx-sjtu whx-sjtu changed the title [Model][WIP] Delete deepseek modeling codes. [Model] Delete deepseek v2/v3 modeling codes. Oct 20, 2025
@whx-sjtu whx-sjtu changed the title [Model] Delete deepseek v2/v3 modeling codes. [Model][1/N] Delete deepseek v2/v3 modeling codes. Oct 20, 2025
@wangxiyuan
Copy link
Copy Markdown
Collaborator

Nice work

@wangxiyuan wangxiyuan merged commit f8b52fe into vllm-project:main Oct 20, 2025
39 of 42 checks passed
ZYang6263 pushed a commit to rjg-lyh/vllm-ascend that referenced this pull request Oct 23, 2025
This PR deletes model codes of deepseek_v2 and deepseek_v3 to reuse the
model file from vLLM.

vLLM Ascend now uses custom ops register way instead of model file
hard-coding.

- vLLM version: v0.11.0rc3
- vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0

---------

Signed-off-by: whx-sjtu <2952154980@qq.com>
wangxiyuan pushed a commit that referenced this pull request Oct 30, 2025
This is the follow-up PR to PR #3189, which continues to refactor sfa
into mla and finally remove deepseek_v3_2.py. This is the last PR of
deepseek modeling refactoring. After this, all deepseek-related model
codes are removed from vllm_ascend.

FurtherMore, after this PR deepseek v3.2 can run chunk-prefill with
correct accuracy.

- vLLM version: v0.11.0rc3
- vLLM main:
vllm-project/vllm@83f478b

---------

Signed-off-by: whx-sjtu <2952154980@qq.com>
luolun pushed a commit to luolun/vllm-ascend that referenced this pull request Nov 19, 2025
This PR deletes model codes of deepseek_v2 and deepseek_v3 to reuse the
model file from vLLM.

vLLM Ascend now uses custom ops register way instead of model file
hard-coding.

- vLLM version: v0.11.0rc3
- vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0

---------

Signed-off-by: whx-sjtu <2952154980@qq.com>
Signed-off-by: luolun <luolun1995@cmbchina.com>
luolun pushed a commit to luolun/vllm-ascend that referenced this pull request Nov 19, 2025
…project#3769)

This is the follow-up PR to PR vllm-project#3189, which continues to refactor sfa
into mla and finally remove deepseek_v3_2.py. This is the last PR of
deepseek modeling refactoring. After this, all deepseek-related model
codes are removed from vllm_ascend.

FurtherMore, after this PR deepseek v3.2 can run chunk-prefill with
correct accuracy.

- vLLM version: v0.11.0rc3
- vLLM main:
vllm-project/vllm@83f478b

---------

Signed-off-by: whx-sjtu <2952154980@qq.com>
Signed-off-by: luolun <luolun1995@cmbchina.com>
hwhaokun pushed a commit to hwhaokun/vllm-ascend that referenced this pull request Nov 19, 2025
This PR deletes model codes of deepseek_v2 and deepseek_v3 to reuse the
model file from vLLM.

vLLM Ascend now uses custom ops register way instead of model file
hard-coding.

- vLLM version: v0.11.0rc3
- vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0

---------

Signed-off-by: whx-sjtu <2952154980@qq.com>
Signed-off-by: hwhaokun <haokun0405@163.com>
hwhaokun pushed a commit to hwhaokun/vllm-ascend that referenced this pull request Nov 19, 2025
…project#3769)

This is the follow-up PR to PR vllm-project#3189, which continues to refactor sfa
into mla and finally remove deepseek_v3_2.py. This is the last PR of
deepseek modeling refactoring. After this, all deepseek-related model
codes are removed from vllm_ascend.

FurtherMore, after this PR deepseek v3.2 can run chunk-prefill with
correct accuracy.

- vLLM version: v0.11.0rc3
- vLLM main:
vllm-project/vllm@83f478b

---------

Signed-off-by: whx-sjtu <2952154980@qq.com>
Signed-off-by: hwhaokun <haokun0405@163.com>
NSDie pushed a commit to NSDie/vllm-ascend that referenced this pull request Nov 24, 2025
This PR deletes model codes of deepseek_v2 and deepseek_v3 to reuse the
model file from vLLM.

vLLM Ascend now uses custom ops register way instead of model file
hard-coding.

- vLLM version: v0.11.0rc3
- vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0

---------

Signed-off-by: whx-sjtu <2952154980@qq.com>
Signed-off-by: nsdie <yeyifan@huawei.com>
NSDie pushed a commit to NSDie/vllm-ascend that referenced this pull request Nov 24, 2025
…project#3769)

This is the follow-up PR to PR vllm-project#3189, which continues to refactor sfa
into mla and finally remove deepseek_v3_2.py. This is the last PR of
deepseek modeling refactoring. After this, all deepseek-related model
codes are removed from vllm_ascend.

FurtherMore, after this PR deepseek v3.2 can run chunk-prefill with
correct accuracy.

- vLLM version: v0.11.0rc3
- vLLM main:
vllm-project/vllm@83f478b

---------

Signed-off-by: whx-sjtu <2952154980@qq.com>
Signed-off-by: nsdie <yeyifan@huawei.com>
Clorist33 pushed a commit to Clorist33/vllm-ascend that referenced this pull request Dec 9, 2025
This PR deletes model codes of deepseek_v2 and deepseek_v3 to reuse the
model file from vLLM.

vLLM Ascend now uses custom ops register way instead of model file
hard-coding.

- vLLM version: v0.11.0rc3
- vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0

---------

Signed-off-by: whx-sjtu <2952154980@qq.com>
Clorist33 pushed a commit to Clorist33/vllm-ascend that referenced this pull request Dec 10, 2025
…project#3769)

This is the follow-up PR to PR vllm-project#3189, which continues to refactor sfa
into mla and finally remove deepseek_v3_2.py. This is the last PR of
deepseek modeling refactoring. After this, all deepseek-related model
codes are removed from vllm_ascend.

FurtherMore, after this PR deepseek v3.2 can run chunk-prefill with
correct accuracy.

- vLLM version: v0.11.0rc3
- vLLM main:
vllm-project/vllm@83f478b

---------

Signed-off-by: whx-sjtu <2952154980@qq.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants