add qwq testcase by ck-hw-1018 · Pull Request #3757 · vllm-project/vllm-ascend

ck-hw-1018 · 2025-10-25T07:29:15Z

What this PR does / why we need it?

This PR adds a qwq case for nightly test for qwen-qwq on A3 ,we need test them daily

Does this PR introduce any user-facing change?

no

How was this patch tested?

by running the test

vLLM version: v0.11.0rc3
vLLM main: vllm-project/vllm@c9461e0

Signed-off-by: ckhw <cuikai1@huawei.com>

github-actions · 2025-10-25T07:29:28Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

gemini-code-assist

Code Review

This PR adds a new end-to-end test for the Qwen/QwQ-32B model. The overall structure is good, but I've found an area for improvement in how the server arguments are constructed. The current implementation is brittle and hard to maintain. I've provided a suggestion to make it more robust and readable.

gemini-code-assist · 2025-10-25T07:31:11Z

tests/e2e/nightly/models/test_qwq_32b.py

+    server_args = [
+        "--tensor-parallel-size",
+        str(tp_size), "--port",
+        str(port), "--max-model-len", "36864", "--max-num-batched-tokens",
+        "36864", "--block-size", "128", "--trust-remote-code",
+        "--gpu-memory-utilization", "0.9", "--compilation_config",
+        '{"cudagraph_mode":"FULL_DECODE_ONLY", "cudagraph_capture_sizes": [1, 8, 24, 48, 60]}',
+        "--reasoning-parser", "deepseek_r1", "--distributed_executor_backend",
+        "mp"
+    ]
+    if mode == "single":
+        server_args.remove("--compilation_config")
+        server_args.remove(
+            '{"cudagraph_mode":"FULL_DECODE_ONLY", "cudagraph_capture_sizes": [1, 8, 24, 48, 60]}'
+        )
+        server_args.append("--additional-config")
+        server_args.append('{"ascend_scheduler_config":{"enabled":true}}')
+        server_args.append("--enforce-eager")


The current method of constructing server_args by defining a default list and then modifying it with list.remove() is brittle and can lead to runtime errors. If the initial list is changed, the remove() calls might fail with a ValueError. Additionally, the long JSON string for compilation_config is duplicated, making the code harder to maintain.

It's better to build the argument list conditionally from common and mode-specific parts. This approach is more robust, readable, and avoids duplicating configuration strings.

server_args = [ "--tensor-parallel-size", str(tp_size), "--port", str(port), "--max-model-len", "36864", "--max-num-batched-tokens", "36864", "--block-size", "128", "--trust-remote-code", "--gpu-memory-utilization", "0.9", ] if mode == "single": server_args.extend([ "--additional-config", '{"ascend_scheduler_config":{"enabled":true}}', "--enforce-eager", ]) else: # aclgraph server_args.extend([ "--compilation_config", '{"cudagraph_mode":"FULL_DECODE_ONLY", "cudagraph_capture_sizes": [1, 8, 24, 48, 60]}', ]) server_args.extend([ "--reasoning-parser", "deepseek_r1", "--distributed_executor_backend", "mp" ])

Signed-off-by: ckhw <cuikai1@huawei.com>

ck-hw-1018 · 2025-10-25T08:32:14Z

https://github.com/vllm-project/vllm-ascend/actions/runs/18800163569/job/53646637879?pr=3757

Signed-off-by: ckhw <cuikai1@huawei.com>

jiangyunfan1 · 2025-10-25T09:10:40Z

LGTM

### What this PR does / why we need it? This PR adds a qwq case for nightly test for qwen-qwq on A3 ,we need test them daily ### Does this PR introduce _any_ user-facing change? no ### How was this patch tested? by running the test - vLLM version: v0.11.0rc3 - vLLM main: vllm-project/vllm@c9461e0 --------- Signed-off-by: ckhw <cuikai1@huawei.com> Signed-off-by: luolun <luolun1995@cmbchina.com>

### What this PR does / why we need it? This PR adds a qwq case for nightly test for qwen-qwq on A3 ,we need test them daily ### Does this PR introduce _any_ user-facing change? no ### How was this patch tested? by running the test - vLLM version: v0.11.0rc3 - vLLM main: vllm-project/vllm@c9461e0 --------- Signed-off-by: ckhw <cuikai1@huawei.com> Signed-off-by: hwhaokun <haokun0405@163.com>

### What this PR does / why we need it? This PR adds a qwq case for nightly test for qwen-qwq on A3 ,we need test them daily ### Does this PR introduce _any_ user-facing change? no ### How was this patch tested? by running the test - vLLM version: v0.11.0rc3 - vLLM main: vllm-project/vllm@c9461e0 --------- Signed-off-by: ckhw <cuikai1@huawei.com> Signed-off-by: nsdie <yeyifan@huawei.com>

### What this PR does / why we need it? This PR adds a qwq case for nightly test for qwen-qwq on A3 ,we need test them daily ### Does this PR introduce _any_ user-facing change? no ### How was this patch tested? by running the test - vLLM version: v0.11.0rc3 - vLLM main: vllm-project/vllm@c9461e0 --------- Signed-off-by: ckhw <cuikai1@huawei.com>

ck-hw-1018 added 2 commits October 25, 2025 15:15

add qwen qwq testcase

51206f4

Signed-off-by: ckhw <cuikai1@huawei.com>

add qwen qwq testcase

ce52e70

Signed-off-by: ckhw <cuikai1@huawei.com>

github-actions bot added the module:tests label Oct 25, 2025

gemini-code-assist bot reviewed Oct 25, 2025

View reviewed changes

ck-hw-1018 added 2 commits October 25, 2025 15:37

add qwen qwq testcase

de716e5

Signed-off-by: ckhw <cuikai1@huawei.com>

add qwen qwq testcase

b564dd3

Signed-off-by: ckhw <cuikai1@huawei.com>

ck-hw-1018 added 3 commits October 25, 2025 16:39

add qwen qwq testcase

dc69155

Signed-off-by: ckhw <cuikai1@huawei.com>

add qwq testcase

4ad7a0f

Signed-off-by: ckhw <cuikai1@huawei.com>

add qwen qwq testcase

abf627c

Signed-off-by: ckhw <cuikai1@huawei.com>

wangxiyuan approved these changes Oct 25, 2025

View reviewed changes

wangxiyuan merged commit 7572939 into vllm-project:main Oct 25, 2025
6 checks passed

MrZ20 mentioned this pull request Mar 2, 2026

[Nightly][Refactor]Migrate nightly single-node model tests from .py to .yaml #6503

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add qwq testcase#3757

add qwq testcase#3757
wangxiyuan merged 7 commits intovllm-project:mainfrom
ck-hw-1018:main

ck-hw-1018 commented Oct 25, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Oct 25, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Oct 25, 2025

Uh oh!

ck-hw-1018 commented Oct 25, 2025

Uh oh!

jiangyunfan1 commented Oct 25, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

ck-hw-1018 commented Oct 25, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

github-actions bot commented Oct 25, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Oct 25, 2025

Choose a reason for hiding this comment

Uh oh!

ck-hw-1018 commented Oct 25, 2025

Uh oh!

jiangyunfan1 commented Oct 25, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ck-hw-1018 commented Oct 25, 2025 •

edited by github-actions bot

Loading