Revert "[Feat] Shared expert dp for deepseek and deepseek_mtp (#3495)"#3586
Revert "[Feat] Shared expert dp for deepseek and deepseek_mtp (#3495)"#3586wangxiyuan merged 1 commit intovllm-project:mainfrom
Conversation
|
👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:
If CI fails, you can run linting and testing checks locally according Contributing and Testing. |
463a42a to
d27f82a
Compare
There was a problem hiding this comment.
Code Review
This pull request is intended to revert the 'Shared expert dp for deepseek and deepseek_mtp' feature. While most of the changes correctly revert the feature, I've found that some files are incorrectly adding parts of the feature back in. Specifically, vllm_ascend/platform.py and vllm_ascend/spec_decode/mtp_proposer.py contain additions related to enable_shared_expert_dp which contradict the goal of this revert. These changes need to be corrected to complete the revert properly. Please see my detailed comments.
|
This pull request has conflicts, please resolve those before we can evaluate the pull request. |
…roject#3495)" This reverts commit bf87606. Signed-off-by: linfeng-yuan <1102311262@qq.com>
d27f82a to
45a67fe
Compare
…roject#3495)" (vllm-project#3586) This reverts commit vllm-project@bf87606. No. E2E vllm serving with `enable_shared_expert_dp: true` in eager mode as before. - vLLM version: v0.11.0rc3 - vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0 Signed-off-by: linfeng-yuan <1102311262@qq.com>
…roject#3495)" (vllm-project#3586) ### What this PR does / why we need it? This reverts commit vllm-project@bf87606. ### Does this PR introduce _any_ user-facing change? No. ### How was this patch tested? E2E vllm serving with `enable_shared_expert_dp: true` in eager mode as before. - vLLM version: v0.11.0rc3 - vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0 Signed-off-by: linfeng-yuan <1102311262@qq.com> Signed-off-by: luolun <luolun1995@cmbchina.com>
…roject#3495)" (vllm-project#3586) ### What this PR does / why we need it? This reverts commit vllm-project@bf87606. ### Does this PR introduce _any_ user-facing change? No. ### How was this patch tested? E2E vllm serving with `enable_shared_expert_dp: true` in eager mode as before. - vLLM version: v0.11.0rc3 - vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0 Signed-off-by: linfeng-yuan <1102311262@qq.com> Signed-off-by: hwhaokun <haokun0405@163.com>
…roject#3495)" (vllm-project#3586) ### What this PR does / why we need it? This reverts commit vllm-project@bf87606. ### Does this PR introduce _any_ user-facing change? No. ### How was this patch tested? E2E vllm serving with `enable_shared_expert_dp: true` in eager mode as before. - vLLM version: v0.11.0rc3 - vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0 Signed-off-by: linfeng-yuan <1102311262@qq.com> Signed-off-by: nsdie <yeyifan@huawei.com>
…roject#3495)" (vllm-project#3586) ### What this PR does / why we need it? This reverts commit vllm-project@bf87606. ### Does this PR introduce _any_ user-facing change? No. ### How was this patch tested? E2E vllm serving with `enable_shared_expert_dp: true` in eager mode as before. - vLLM version: v0.11.0rc3 - vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0 Signed-off-by: linfeng-yuan <1102311262@qq.com>
What this PR does / why we need it?
This reverts commit bf87606.
Does this PR introduce any user-facing change?
No.
How was this patch tested?
E2E vllm serving with
enable_shared_expert_dp: truein eager mode as before.