[0.13.0][cherry-pick][bugfix](CP,MLA) fix wrong slot_mapping of decode for mixed p/d batch by pisceskkk · Pull Request #6346 · vllm-project/vllm-ascend

pisceskkk · 2026-01-28T07:23:58Z

What this PR does / why we need it?

PR #5672 attempted to remove the -1 padding for duplicate tokens in the decode slot_mapping when adapting PCP for MLAPO, and adopted a simpler slicing approach. However, in the single-ops logic and mixed PD batches, the decode slot_mapping did not eliminate the -1 and also shared the slicing method, resulting in incorrect slot_mapping. This PR resolves this issue, and the logic will be further consolidated in subsequent refactoring PRs.
ref: #6344

Signed-off-by: QiuChunshuo <qiuchunshuo@huawei.com>

gemini-code-assist

Code Review

This pull request addresses a bug in how slot_mapping is handled for decode tokens in mixed prefill/decode batches under context parallelism. The change correctly removes a condition that limited a necessary slot_mapping adjustment to decode-only batches. By applying this logic to mixed batches as well, the slot_mapping for decode tokens is now correctly computed. The fix is well-targeted and appears correct. I have no further comments.

…e for mixed p/d batch (vllm-project#6346) ### What this PR does / why we need it? PR vllm-project#5672 attempted to remove the -1 padding for duplicate tokens in the decode slot_mapping when adapting PCP for MLAPO, and adopted a simpler slicing approach. However, in the single-ops logic and mixed PD batches, the decode slot_mapping did not eliminate the -1 and also shared the slicing method, resulting in incorrect slot_mapping. This PR resolves this issue, and the logic will be further consolidated in subsequent refactoring PRs. ref: vllm-project#6344 Signed-off-by: QiuChunshuo <qiuchunshuo@huawei.com>

[bugfix](CP,MLA) fix wrong slot_mapping of decode for mixed p/d batch

8023ad0

Signed-off-by: QiuChunshuo <qiuchunshuo@huawei.com>

gemini-code-assist bot reviewed Jan 28, 2026

View reviewed changes

weiguihua2 added ready read for review ready-for-test start test by label for PR labels Jan 29, 2026

wangxiyuan merged commit 6ba7a5a into vllm-project:releases/v0.13.0 Jan 29, 2026
19 checks passed

pisceskkk deleted the pcp/mla/bugfix-013 branch February 3, 2026 02:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[0.13.0][cherry-pick][bugfix](CP,MLA) fix wrong slot_mapping of decode for mixed p/d batch#6346

[0.13.0][cherry-pick][bugfix](CP,MLA) fix wrong slot_mapping of decode for mixed p/d batch#6346
wangxiyuan merged 1 commit intovllm-project:releases/v0.13.0from
pisceskkk:pcp/mla/bugfix-013

pisceskkk commented Jan 28, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

pisceskkk commented Jan 28, 2026

What this PR does / why we need it?

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants