[Model][1/N] Delete deepseek v2/v3 modeling codes. by whx-sjtu · Pull Request #3189 · vllm-project/vllm-ascend

whx-sjtu · 2025-09-25T11:45:51Z

This PR deletes model codes of deepseek_v2 and deepseek_v3 to reuse the model file from vLLM.

vLLM Ascend now uses custom ops register way instead of model file hard-coding.

vLLM version: v0.11.0rc3
vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0

github-actions · 2025-09-25T11:46:00Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

gemini-code-assist

Code Review

This pull request deletes the file vllm_ascend/models/deepseek_v2.py, which contains the non-torchair implementations for DeepSeek V2 and V3 models. While the goal seems to be cleanup, this change is incomplete and introduces breaking changes. Specifically, vllm_ascend/models/__init__.py still attempts to register models from the deleted file, which will cause an ImportError. Additionally, vllm_ascend/models/deepseek_mtp.py relies on a monkey patch from the deleted file, and its behavior will change, potentially leading to correctness or performance issues. The test file tests/ut/models/test_deepseek_v2.py also has a direct dependency on the deleted file and will fail. This pull request should be updated to resolve these dangling dependencies.

github-actions · 2025-09-29T19:28:35Z

This pull request has conflicts, please resolve those before we can evaluate the pull request.

LCAIZJ · 2025-10-09T01:54:41Z

Why is it possible to delete these two model files now?

github-actions · 2025-10-15T11:38:44Z

This pull request has conflicts, please resolve those before we can evaluate the pull request.

whx-sjtu · 2025-10-16T02:06:59Z

Why is it possible to delete these two model files now?

What do you think is the reason why it cannot be deleted? If you refer to the CUDA hard-coding issue of DeepSeek 3.2, I will circumvent it through a patch later. @LCAIZJ

github-actions · 2025-10-20T01:52:54Z

This pull request has conflicts, please resolve those before we can evaluate the pull request.

Signed-off-by: whx-sjtu <2952154980@qq.com>

wangxiyuan · 2025-10-20T07:30:23Z

Nice work

This PR deletes model codes of deepseek_v2 and deepseek_v3 to reuse the model file from vLLM. vLLM Ascend now uses custom ops register way instead of model file hard-coding. - vLLM version: v0.11.0rc3 - vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0 --------- Signed-off-by: whx-sjtu <2952154980@qq.com>

This is the follow-up PR to PR #3189, which continues to refactor sfa into mla and finally remove deepseek_v3_2.py. This is the last PR of deepseek modeling refactoring. After this, all deepseek-related model codes are removed from vllm_ascend. FurtherMore, after this PR deepseek v3.2 can run chunk-prefill with correct accuracy. - vLLM version: v0.11.0rc3 - vLLM main: vllm-project/vllm@83f478b --------- Signed-off-by: whx-sjtu <2952154980@qq.com>

This PR deletes model codes of deepseek_v2 and deepseek_v3 to reuse the model file from vLLM. vLLM Ascend now uses custom ops register way instead of model file hard-coding. - vLLM version: v0.11.0rc3 - vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0 --------- Signed-off-by: whx-sjtu <2952154980@qq.com> Signed-off-by: luolun <luolun1995@cmbchina.com>

…project#3769) This is the follow-up PR to PR vllm-project#3189, which continues to refactor sfa into mla and finally remove deepseek_v3_2.py. This is the last PR of deepseek modeling refactoring. After this, all deepseek-related model codes are removed from vllm_ascend. FurtherMore, after this PR deepseek v3.2 can run chunk-prefill with correct accuracy. - vLLM version: v0.11.0rc3 - vLLM main: vllm-project/vllm@83f478b --------- Signed-off-by: whx-sjtu <2952154980@qq.com> Signed-off-by: luolun <luolun1995@cmbchina.com>

This PR deletes model codes of deepseek_v2 and deepseek_v3 to reuse the model file from vLLM. vLLM Ascend now uses custom ops register way instead of model file hard-coding. - vLLM version: v0.11.0rc3 - vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0 --------- Signed-off-by: whx-sjtu <2952154980@qq.com> Signed-off-by: hwhaokun <haokun0405@163.com>

…project#3769) This is the follow-up PR to PR vllm-project#3189, which continues to refactor sfa into mla and finally remove deepseek_v3_2.py. This is the last PR of deepseek modeling refactoring. After this, all deepseek-related model codes are removed from vllm_ascend. FurtherMore, after this PR deepseek v3.2 can run chunk-prefill with correct accuracy. - vLLM version: v0.11.0rc3 - vLLM main: vllm-project/vllm@83f478b --------- Signed-off-by: whx-sjtu <2952154980@qq.com> Signed-off-by: hwhaokun <haokun0405@163.com>

This PR deletes model codes of deepseek_v2 and deepseek_v3 to reuse the model file from vLLM. vLLM Ascend now uses custom ops register way instead of model file hard-coding. - vLLM version: v0.11.0rc3 - vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0 --------- Signed-off-by: whx-sjtu <2952154980@qq.com> Signed-off-by: nsdie <yeyifan@huawei.com>

…project#3769) This is the follow-up PR to PR vllm-project#3189, which continues to refactor sfa into mla and finally remove deepseek_v3_2.py. This is the last PR of deepseek modeling refactoring. After this, all deepseek-related model codes are removed from vllm_ascend. FurtherMore, after this PR deepseek v3.2 can run chunk-prefill with correct accuracy. - vLLM version: v0.11.0rc3 - vLLM main: vllm-project/vllm@83f478b --------- Signed-off-by: whx-sjtu <2952154980@qq.com> Signed-off-by: nsdie <yeyifan@huawei.com>

This PR deletes model codes of deepseek_v2 and deepseek_v3 to reuse the model file from vLLM. vLLM Ascend now uses custom ops register way instead of model file hard-coding. - vLLM version: v0.11.0rc3 - vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0 --------- Signed-off-by: whx-sjtu <2952154980@qq.com>

…project#3769) This is the follow-up PR to PR vllm-project#3189, which continues to refactor sfa into mla and finally remove deepseek_v3_2.py. This is the last PR of deepseek modeling refactoring. After this, all deepseek-related model codes are removed from vllm_ascend. FurtherMore, after this PR deepseek v3.2 can run chunk-prefill with correct accuracy. - vLLM version: v0.11.0rc3 - vLLM main: vllm-project/vllm@83f478b --------- Signed-off-by: whx-sjtu <2952154980@qq.com>

gemini-code-assist Bot reviewed Sep 25, 2025

View reviewed changes

Comment thread vllm_ascend/models/deepseek_v2.py

whx-sjtu force-pushed the rm_ds_model branch from 5c376dd to 1a341b7 Compare September 25, 2025 12:13

github-actions Bot added the module:tests label Sep 25, 2025

whx-sjtu force-pushed the rm_ds_model branch 3 times, most recently from 1730fb5 to e25f3ed Compare September 28, 2025 07:42

github-actions Bot added module:ops module:quantization labels Sep 28, 2025

whx-sjtu force-pushed the rm_ds_model branch 4 times, most recently from 1d78a51 to 0b1e706 Compare September 28, 2025 13:20

whx-sjtu added ready read for review ready-for-test start test by label for PR labels Sep 28, 2025

whx-sjtu force-pushed the rm_ds_model branch 2 times, most recently from 71be184 to 380d3ef Compare September 29, 2025 13:18

github-actions Bot added the merge-conflicts label Sep 29, 2025

github-actions Bot removed the ready read for review label Sep 30, 2025

whx-sjtu force-pushed the rm_ds_model branch from 380d3ef to eca5bcf Compare October 15, 2025 10:07

github-actions Bot added merge-conflicts and removed merge-conflicts labels Oct 15, 2025

whx-sjtu force-pushed the rm_ds_model branch from eca5bcf to 366b8b4 Compare October 15, 2025 17:53

github-actions Bot removed the merge-conflicts label Oct 15, 2025

whx-sjtu removed the ready-for-test start test by label for PR label Oct 15, 2025

whx-sjtu changed the title ~~[Model] Delete deepseek modeling codes.~~ [Model][WIP] Delete deepseek modeling codes. Oct 16, 2025

whx-sjtu mentioned this pull request Oct 20, 2025

[DeepSeek] Seperate deepseek v3.2 modeling form deepseek v2 #3531

Merged

github-actions Bot added the merge-conflicts label Oct 20, 2025

whx-sjtu added 5 commits October 20, 2025 09:58

delete deepseek modeling

5d78373

Signed-off-by: whx-sjtu <2952154980@qq.com>

adapt mlapo with fused_qkv

ff4ee32

Signed-off-by: whx-sjtu <2952154980@qq.com>

fix lint

ec70d1c

Signed-off-by: whx-sjtu <2952154980@qq.com>

rm deepseek v2 ut

be48aed

Signed-off-by: whx-sjtu <2952154980@qq.com>

adapt main and 0.11.0 for mla

d4f9d81

Signed-off-by: whx-sjtu <2952154980@qq.com>

whx-sjtu force-pushed the rm_ds_model branch from 075e4b0 to d4f9d81 Compare October 20, 2025 01:58

github-actions Bot removed the merge-conflicts label Oct 20, 2025

fix ut

8d4cca0

Signed-off-by: whx-sjtu <2952154980@qq.com>

whx-sjtu added ready read for review ready-for-test start test by label for PR labels Oct 20, 2025

whx-sjtu changed the title ~~[Model][WIP] Delete deepseek modeling codes.~~ [Model] Delete deepseek v2/v3 modeling codes. Oct 20, 2025

whx-sjtu changed the title ~~[Model] Delete deepseek v2/v3 modeling codes.~~ [Model][1/N] Delete deepseek v2/v3 modeling codes. Oct 20, 2025

wangxiyuan approved these changes Oct 20, 2025

View reviewed changes

wangxiyuan merged commit f8b52fe into vllm-project:main Oct 20, 2025
39 of 42 checks passed

whx-sjtu mentioned this pull request Oct 25, 2025

[Model][3/N] Refactor sfa into mla and remove deepseek_v3_2.py #3769

Merged

wangxiyuan mentioned this pull request Jan 26, 2026

[Community] Nominate whx-sjtu as maintainer #6268

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Model][1/N] Delete deepseek v2/v3 modeling codes.#3189

[Model][1/N] Delete deepseek v2/v3 modeling codes.#3189
wangxiyuan merged 6 commits intovllm-project:mainfrom
whx-sjtu:rm_ds_model

whx-sjtu commented Sep 25, 2025 •

edited by wangxiyuan

Loading

Uh oh!

github-actions Bot commented Sep 25, 2025

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

github-actions Bot commented Sep 29, 2025

Uh oh!

LCAIZJ commented Oct 9, 2025

Uh oh!

github-actions Bot commented Oct 15, 2025

Uh oh!

whx-sjtu commented Oct 16, 2025

Uh oh!

github-actions Bot commented Oct 20, 2025

Uh oh!

wangxiyuan commented Oct 20, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

whx-sjtu commented Sep 25, 2025 • edited by wangxiyuan Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented Sep 25, 2025

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

github-actions Bot commented Sep 29, 2025

Uh oh!

LCAIZJ commented Oct 9, 2025

Uh oh!

github-actions Bot commented Oct 15, 2025

Uh oh!

whx-sjtu commented Oct 16, 2025

Uh oh!

github-actions Bot commented Oct 20, 2025

Uh oh!

wangxiyuan commented Oct 20, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

whx-sjtu commented Sep 25, 2025 •

edited by wangxiyuan

Loading