Skip to content

[Quantization][MoE] remove unused ep logic from moe marlin#31571

Merged
vllm-bot merged 2 commits intovllm-project:mainfrom
jinzhen-lin:remove-moe-marlin-unused-ep-logic
Jan 6, 2026
Merged

[Quantization][MoE] remove unused ep logic from moe marlin#31571
vllm-bot merged 2 commits intovllm-project:mainfrom
jinzhen-lin:remove-moe-marlin-unused-ep-logic

Conversation

@jinzhen-lin
Copy link
Copy Markdown
Contributor

@jinzhen-lin jinzhen-lin commented Dec 31, 2025

After #29642 , the ep support for moe marlin kernel is no longer required. Therefore, we can remove this dead code.

Signed-off-by: Jinzhen Lin <jinzhen.ljz@antgroup.com>
Copy link
Copy Markdown
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request removes unused is_ep (expert parallelism) logic from the MoE Marlin kernel. The changes are consistent across Python, C++, and CUDA files, simplifying the codebase by removing dead code. This refactoring is a good improvement for maintainability, as it makes the kernel logic more straightforward. The assumption is that expert parallelism details are now handled before the kernel is invoked, which is a sound design choice. The changes appear correct and well-executed.

@mgoin mgoin added the ready ONLY add when PR is ready to merge/full CI is needed label Jan 5, 2026
@mgoin mgoin enabled auto-merge (squash) January 5, 2026 21:06
@vllm-bot vllm-bot merged commit 2f4bdee into vllm-project:main Jan 6, 2026
89 of 91 checks passed
yugong333 pushed a commit to yugong333/vllm that referenced this pull request Jan 9, 2026
…ect#31571)

Signed-off-by: Jinzhen Lin <jinzhen.ljz@antgroup.com>
Co-authored-by: Michael Goin <mgoin64@gmail.com>
akh64bit pushed a commit to akh64bit/vllm that referenced this pull request Jan 16, 2026
…ect#31571)

Signed-off-by: Jinzhen Lin <jinzhen.ljz@antgroup.com>
Co-authored-by: Michael Goin <mgoin64@gmail.com>
dsuhinin pushed a commit to dsuhinin/vllm that referenced this pull request Jan 21, 2026
…ect#31571)

Signed-off-by: Jinzhen Lin <jinzhen.ljz@antgroup.com>
Co-authored-by: Michael Goin <mgoin64@gmail.com>
Signed-off-by: dsuhinin <suhinin.dmitriy@gmail.com>
ItzDEXX pushed a commit to ItzDEXX/vllm that referenced this pull request Feb 19, 2026
…ect#31571)

Signed-off-by: Jinzhen Lin <jinzhen.ljz@antgroup.com>
Co-authored-by: Michael Goin <mgoin64@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ready ONLY add when PR is ready to merge/full CI is needed

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants