[bugfix] remove the EP buffer allocation introduced by fused-op dispatch_ffn_c… by kiscad · Pull Request #5284 · vllm-project/vllm-ascend

kiscad · 2025-12-23T07:24:39Z

What this PR does / why we need it?

This PR removes the Expert Parallel (EP) HCCL buffer allocation that was previously introduced by the fused-op dispatch_ffn_combine (add dispatch_gmm_combine kernel #3532 ), since the fused-op has switch to MC2 HCCL buffer ([bugfix] Use FUSED_MC2 MoE comm path for the op dispatch_ffn_combine #5156 ).

Does this PR introduce any user-facing change?

How was this patch tested?

vLLM version: release/v0.13.0
vLLM main: vllm-project/vllm@ad32e3e

gemini-code-assist

Code Review

This pull request removes the calculate_ep_buffer_size function and its usage for configuring the buffer size for the 'ep' (expert parallel) process group. This change appears to be a cleanup of obsolete code. Based on the pull request title, this specific buffer allocation was likely introduced for the dispatch_ffn_combine fused operator. The codebase indicates that this operator now uses the 'mc2' communication group, which has a different buffer configuration mechanism, rendering the 'ep' buffer calculation unnecessary. By removing this, the 'ep' group will fall back to using the default buffer size, which is appropriate for its remaining uses. The change is sound and improves code maintainability by removing dead code.

github-actions · 2025-12-23T07:38:09Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

…ombine Signed-off-by: Chen Chen <0109chenchen@gmail.com>

…tch_ffn_c… (vllm-project#5284) ### What this PR does / why we need it? - This PR removes the Expert Parallel (EP) HCCL buffer allocation that was previously introduced by the fused-op `dispatch_ffn_combine` (vllm-project#3532 ), since the fused-op has switch to MC2 HCCL buffer (vllm-project#5156 ). ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? - vLLM version: release/v0.13.0 - vLLM main: vllm-project/vllm@ad32e3e Signed-off-by: Chen Chen <0109chenchen@gmail.com> Signed-off-by: zrj026 <zhangrunjiang026@gmail.com>

gemini-code-assist bot reviewed Dec 23, 2025

View reviewed changes

github-actions bot added the module:core label Dec 23, 2025

weijinqian0 approved these changes Dec 23, 2025

View reviewed changes

zzzzwwjj approved these changes Dec 23, 2025

View reviewed changes

kiscad changed the title ~~remove the EP buffer allocation introduced by fused-op dispatch_ffn_c…~~ [bugfix] remove the EP buffer allocation introduced by fused-op dispatch_ffn_c… Dec 23, 2025

weijinqian0 added ready read for review ready-for-test start test by label for PR labels Dec 23, 2025

remove the EP buffer allocation introduced by fused-op dispatch_ffn_c…

a9e09a1

…ombine Signed-off-by: Chen Chen <0109chenchen@gmail.com>

zzzzwwjj force-pushed the fix-utils branch from 5ccefb3 to a9e09a1 Compare December 24, 2025 03:25

zzzzwwjj merged commit 9227e6a into vllm-project:main Dec 24, 2025
10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[bugfix] remove the EP buffer allocation introduced by fused-op dispatch_ffn_c…#5284

[bugfix] remove the EP buffer allocation introduced by fused-op dispatch_ffn_c…#5284
zzzzwwjj merged 1 commit intovllm-project:mainfrom
kiscad:fix-utils

kiscad commented Dec 23, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

github-actions bot commented Dec 23, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

kiscad commented Dec 23, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

github-actions bot commented Dec 23, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

kiscad commented Dec 23, 2025 •

edited by github-actions bot

Loading