[bugfix](CP) Fix and unify the PD request discrimination logic. by pisceskkk · Pull Request #5939 · vllm-project/vllm-ascend

pisceskkk · 2026-01-15T13:28:56Z

What this PR does / why we need it?

Since the PR (vllm-project/vllm#32118) has modified the criteria for judging Prefill and Decode requests in vLLM, PCPManager needs to synchronize with this standard. As PCPManager involves multiple calculations of PD request counts, this PR attempts to consolidate the related logic and update the PD request count once per batch.

How was this patch tested?

pytest tests/e2e/multicard/4-cards/long_sequence/test_mtp.py

vLLM version: v0.13.0
vLLM main: vllm-project/vllm@11b6af5

github-actions · 2026-01-15T13:29:14Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

wangxiyuan · 2026-01-16T09:06:49Z

vllm_ascend/attention/utils.py

        if query_lens_pcp_full is None else query_lens_pcp_full
-    is_prefill = query_lens > decode_threshold
+    num_computed_tokens = common_attn_metadata.num_computed_tokens_cpu
+    is_prefill = (query_lens > decode_threshold) | (num_computed_tokens == 0)


@zzzzwwjj @whx-sjtu

github-actions · 2026-01-19T01:02:21Z

This pull request has conflicts, please resolve those before we can evaluate the pull request.

github-actions · 2026-01-23T01:46:28Z

This pull request has conflicts, please resolve those before we can evaluate the pull request.

Signed-off-by: QiuChunshuo <qiuchunshuo@huawei.com>

…-project#5939) ### What this PR does / why we need it? Since the PR (vllm-project/vllm#32118) has modified the criteria for judging Prefill and Decode requests in vLLM, PCPManager needs to synchronize with this standard. As PCPManager involves multiple calculations of PD request counts, this PR attempts to consolidate the related logic and update the PD request count once per batch. ### How was this patch tested? ```bash pytest tests/e2e/multicard/4-cards/long_sequence/test_mtp.py ``` - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@11b6af5 Signed-off-by: QiuChunshuo <qiuchunshuo@huawei.com> Signed-off-by: momochenchuw <chenchuw@huawei.com>

…-project#5939) ### What this PR does / why we need it? Since the PR (vllm-project/vllm#32118) has modified the criteria for judging Prefill and Decode requests in vLLM, PCPManager needs to synchronize with this standard. As PCPManager involves multiple calculations of PD request counts, this PR attempts to consolidate the related logic and update the PD request count once per batch. ### How was this patch tested? ```bash pytest tests/e2e/multicard/4-cards/long_sequence/test_mtp.py ``` - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@11b6af5 Signed-off-by: QiuChunshuo <qiuchunshuo@huawei.com> Signed-off-by: zrj026 <zhangrunjiang026@gmail.com>

…-project#5939) ### What this PR does / why we need it? Since the PR (vllm-project/vllm#32118) has modified the criteria for judging Prefill and Decode requests in vLLM, PCPManager needs to synchronize with this standard. As PCPManager involves multiple calculations of PD request counts, this PR attempts to consolidate the related logic and update the PD request count once per batch. ### How was this patch tested? ```bash pytest tests/e2e/multicard/4-cards/long_sequence/test_mtp.py ``` - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@11b6af5 Signed-off-by: QiuChunshuo <qiuchunshuo@huawei.com>

…-project#5939) ### What this PR does / why we need it? Since the PR (vllm-project/vllm#32118) has modified the criteria for judging Prefill and Decode requests in vLLM, PCPManager needs to synchronize with this standard. As PCPManager involves multiple calculations of PD request counts, this PR attempts to consolidate the related logic and update the PD request count once per batch. ### How was this patch tested? ```bash pytest tests/e2e/multicard/4-cards/long_sequence/test_mtp.py ``` - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@11b6af5 Signed-off-by: QiuChunshuo <qiuchunshuo@huawei.com> Signed-off-by: zrj026 <zhangrunjiang026@gmail.com>

…-project#5939) ### What this PR does / why we need it? Since the PR (vllm-project/vllm#32118) has modified the criteria for judging Prefill and Decode requests in vLLM, PCPManager needs to synchronize with this standard. As PCPManager involves multiple calculations of PD request counts, this PR attempts to consolidate the related logic and update the PD request count once per batch. ### How was this patch tested? ```bash pytest tests/e2e/multicard/4-cards/long_sequence/test_mtp.py ``` - vLLM version: v0.13.0 - vLLM main: vllm-project/vllm@11b6af5 Signed-off-by: QiuChunshuo <qiuchunshuo@huawei.com>

pisceskkk requested review from MengqingCao, wangxiyuan and weijinqian0 as code owners January 15, 2026 13:28

github-actions bot added the module:tests label Jan 15, 2026

pisceskkk force-pushed the fix_main2main branch 2 times, most recently from fadf7ef to 253ce21 Compare January 16, 2026 07:27

wangxiyuan approved these changes Jan 16, 2026

View reviewed changes

pisceskkk force-pushed the fix_main2main branch from 253ce21 to cc430a4 Compare January 16, 2026 09:08

weiguihua2 added ready read for review ready-for-test start test by label for PR labels Jan 16, 2026

github-actions bot added the merge-conflicts label Jan 19, 2026

pisceskkk force-pushed the fix_main2main branch from cc430a4 to e794cd8 Compare January 21, 2026 01:42

github-actions bot removed the merge-conflicts label Jan 21, 2026

pisceskkk force-pushed the fix_main2main branch 7 times, most recently from 90a854e to 5cc8fe9 Compare January 23, 2026 01:02

github-actions bot added the merge-conflicts label Jan 23, 2026

pisceskkk closed this Jan 23, 2026

pisceskkk force-pushed the fix_main2main branch from 5cc8fe9 to 418a43e Compare January 23, 2026 03:37

pisceskkk reopened this Jan 23, 2026

github-actions bot removed the merge-conflicts label Jan 23, 2026

pisceskkk force-pushed the fix_main2main branch from 0c76066 to 8e7a058 Compare January 28, 2026 06:10

pisceskkk requested a review from whx-sjtu as a code owner January 28, 2026 06:10

pisceskkk force-pushed the fix_main2main branch 8 times, most recently from 2c7dbb3 to 85491d8 Compare January 30, 2026 07:37

[bugfix](cp,mtp) Fix and unify the PD request discrimination logic.

85491d8

Signed-off-by: QiuChunshuo <qiuchunshuo@huawei.com>

wangxiyuan merged commit 638cae8 into vllm-project:main Jan 31, 2026
26 checks passed

wangxiyuan mentioned this pull request Feb 24, 2026

[Misc]: test #6787

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[bugfix](CP) Fix and unify the PD request discrimination logic.#5939

[bugfix](CP) Fix and unify the PD request discrimination logic.#5939
wangxiyuan merged 1 commit intovllm-project:mainfrom
pisceskkk:fix_main2main

pisceskkk commented Jan 15, 2026 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Jan 15, 2026

Uh oh!

wangxiyuan Jan 16, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Jan 19, 2026

Uh oh!

github-actions bot commented Jan 23, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

pisceskkk commented Jan 15, 2026 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

How was this patch tested?

Uh oh!

github-actions bot commented Jan 15, 2026

Uh oh!

wangxiyuan Jan 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Jan 19, 2026

Uh oh!

github-actions bot commented Jan 23, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

pisceskkk commented Jan 15, 2026 •

edited by github-actions bot

Loading

wangxiyuan Jan 16, 2026 •

edited

Loading