[CI/UT] Add test for chunk prefill and prefix cache on v1/AscendScheduler by MengqingCao · Pull Request #1505 · vllm-project/vllm-ascend

MengqingCao · 2025-06-28T10:21:48Z

What this PR does / why we need it?

Add test for chunked prefill and prefix cache on v1/AscendScheduler

Covered scenarios:

Qwen/Qwen3-0.6B-Base and deepseek-ai/DeepSeek-V2-Lite-Chat --- multicard CI time increased by 19 min
- V1 + default scheduler vs V1 + default scheduler + enable prefix cache
- V1 + Ascend scheduler vs V1 + Ascend scheduler + enable prefix cache vs V1 + Ascend scheduler + enable prefix cache + enable chunked prefill
Qwen/Qwen3-0.6B-Base --- singlecard CI time increased by 8 min
- V1 + Ascend scheduler vs V1 + Ascend scheduler + enable chunked prefill

should rebase after #1498 and #1446

Does this PR introduce any user-facing change?

N/A

How was this patch tested?

CI passed with new added test.

codecov · 2025-06-28T10:36:18Z

Codecov Report

❌ Patch coverage is 46.15385% with 7 lines in your changes missing coverage. Please review.
✅ Project coverage is 34.17%. Comparing base (c30ddb8) to head (0c10291).
⚠️ Report is 609 commits behind head on main.

Files with missing lines	Patch %	Lines
tests/conftest.py	46.15%	7 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1505      +/-   ##
==========================================
+ Coverage   27.39%   34.17%   +6.77%     
==========================================
  Files          56       63       +7     
  Lines        6191     7328    +1137     
==========================================
+ Hits         1696     2504     +808     
- Misses       4495     4824     +329

Flag	Coverage Δ
unittests	`34.17% <46.15%> (+6.77%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

github-actions · 2025-06-30T08:36:50Z

This pull request has conflicts, please resolve those before we can evaluate the pull request.

Yikun

LGTM if CI passed

github-actions · 2025-07-01T22:07:32Z

This pull request has conflicts, please resolve those before we can evaluate the pull request.

…uler Signed-off-by: MengqingCao <cmq0113@163.com>

…uler (vllm-project#1505) ### What this PR does / why we need it? Add test for chunked prefill and prefix cache on v1/AscendScheduler Covered scenarios: - `Qwen/Qwen3-0.6B-Base` and `deepseek-ai/DeepSeek-V2-Lite-Chat` --- multicard CI time increased by 19 min - `V1 + default scheduler` vs `V1 + default scheduler + enable prefix cache` - `V1 + Ascend scheduler` vs `V1 + Ascend scheduler + enable prefix cache` vs `V1 + Ascend scheduler + enable prefix cache + enable chunked prefill` - `Qwen/Qwen3-0.6B-Base` --- singlecard CI time increased by 8 min - `V1 + Ascend scheduler` vs `V1 + Ascend scheduler + enable chunked prefill` should rebase after vllm-project#1498 and vllm-project#1446 ### Does this PR introduce _any_ user-facing change? N/A ### How was this patch tested? CI passed with new added test. Signed-off-by: MengqingCao <cmq0113@163.com> Signed-off-by: ZhengWG <zwg0606@gmail.com>

…uler (vllm-project#1505) ### What this PR does / why we need it? Add test for chunked prefill and prefix cache on v1/AscendScheduler Covered scenarios: - `Qwen/Qwen3-0.6B-Base` and `deepseek-ai/DeepSeek-V2-Lite-Chat` --- multicard CI time increased by 19 min - `V1 + default scheduler` vs `V1 + default scheduler + enable prefix cache` - `V1 + Ascend scheduler` vs `V1 + Ascend scheduler + enable prefix cache` vs `V1 + Ascend scheduler + enable prefix cache + enable chunked prefill` - `Qwen/Qwen3-0.6B-Base` --- singlecard CI time increased by 8 min - `V1 + Ascend scheduler` vs `V1 + Ascend scheduler + enable chunked prefill` should rebase after vllm-project#1498 and vllm-project#1446 ### Does this PR introduce _any_ user-facing change? N/A ### How was this patch tested? CI passed with new added test. Signed-off-by: MengqingCao <cmq0113@163.com>

github-actions Bot added the module:tests label Jun 28, 2025

This was referenced Jun 28, 2025

[BugFix] Address PrefillCacheHit state to fix prefix cache accuracy bug #1498

Merged

[Core] Fix block table shape to make Prefix cache work with Ascend scheduler #1446

Merged

Yikun added the ready read for review label Jun 29, 2025

MengqingCao mentioned this pull request Jun 30, 2025

[RFC]: E2E CI test for key features #413

Closed

83 tasks

github-actions Bot added merge-conflicts and removed ready read for review labels Jun 30, 2025

MengqingCao force-pushed the e2e branch from 76f7ec2 to 9c5c327 Compare June 30, 2025 09:03

github-actions Bot removed the merge-conflicts label Jun 30, 2025

MengqingCao force-pushed the e2e branch from 9c5c327 to 7e15c1a Compare June 30, 2025 12:03

Yikun approved these changes Jun 30, 2025

View reviewed changes

Yikun added the ready read for review label Jun 30, 2025

MengqingCao force-pushed the e2e branch from 9638902 to 7e15c1a Compare July 1, 2025 06:03

github-actions Bot added merge-conflicts and removed ready read for review labels Jul 1, 2025

[CI/UT] Add test for chunk prefill and prefix cache on v1/AscendSched…

0c10291

…uler Signed-off-by: MengqingCao <cmq0113@163.com>

MengqingCao force-pushed the e2e branch from 7e15c1a to 0c10291 Compare July 2, 2025 04:58

github-actions Bot removed the merge-conflicts label Jul 2, 2025

Yikun merged commit 59237ea into vllm-project:main Jul 2, 2025
13 checks passed

MengqingCao deleted the e2e branch July 8, 2025 02:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CI/UT] Add test for chunk prefill and prefix cache on v1/AscendScheduler#1505

[CI/UT] Add test for chunk prefill and prefix cache on v1/AscendScheduler#1505
Yikun merged 1 commit intovllm-project:mainfrom
MengqingCao:e2e

MengqingCao commented Jun 28, 2025 •

edited

Loading

Uh oh!

codecov Bot commented Jun 28, 2025 •

edited

Loading

Uh oh!

github-actions Bot commented Jun 30, 2025

Uh oh!

Yikun left a comment

Uh oh!

github-actions Bot commented Jul 1, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

MengqingCao commented Jun 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

codecov Bot commented Jun 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

github-actions Bot commented Jun 30, 2025

Uh oh!

Yikun left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions Bot commented Jul 1, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

MengqingCao commented Jun 28, 2025 •

edited

Loading

codecov Bot commented Jun 28, 2025 •

edited

Loading