[CI] Add multi-nodes longseq configs of DeepSeek-R1-W8A8 & Qwen3-235B-W8A8 by dsxsteven · Pull Request #5381 · vllm-project/vllm-ascend

dsxsteven · 2025-12-26T03:20:10Z

What this PR does / why we need it?

add DeepSeek-R1-W8A8 and Qwen3-235B-W8A8 configs in multi-nodes and longseq (PCP&DCP) scenario

Does this PR introduce any user-facing change?

NO

How was this patch tested?

vLLM version: release/v0.13.0
vLLM main: vllm-project/vllm@bc0a5a0

Signed-off-by: daishixun <dsxsteven@sina.com>

github-actions · 2025-12-26T03:20:19Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

gemini-code-assist

Code Review

This pull request introduces new multi-node long-sequence test configurations for DeepSeek-R1-W8A8 and Qwen3-235B-W8A8. The changes are straightforward additions of YAML configuration files. My review has identified two potential issues: one is a likely incorrect engine_id in the DeepSeek configuration, and the other is an incomplete benchmarks section in the Qwen3 configuration. Both issues could lead to incorrect or ineffective testing and should be addressed.

gemini-code-assist · 2025-12-26T03:21:26Z

+        '{"kv_connector": "MooncakeConnectorV1",
+        "kv_role": "kv_consumer",
+        "kv_port": "30200",
+        "engine_id": "2",


The engine_id for the consumer node is set to "2", which appears to be incorrect for a 2-node setup. In a producer-consumer configuration, engine IDs are typically sequential, starting from "0". The accompanying Qwen3-235B-W8A8-longseq.yaml config uses "0" and "1", which is the expected pattern. Please correct this to "1" to ensure proper node communication.

"engine_id": "1",

gemini-code-assist · 2025-12-26T03:21:26Z

+                  }
+            }
+        }'
+benchmarks:


The benchmarks section is empty. This will result in the performance and accuracy tests for this model being skipped, making the test configuration ineffective. Please provide the necessary benchmark configurations for perf and acc, similar to the DeepSeek-R1-W8A8-longseq.yaml file.

@dsxsteven is this skip expected?

weiguihua2 · 2025-12-26T03:33:19Z

+          --max-num-seqs 4
+          --max-model-len 32768
+          --max-num-batched-tokens 16384
+          --trust-remote-code


Subsequent cases need to be supplemented for TP asymmetry.

#Todo after #5224 merge

Signed-off-by: daishixun <dsxsteven@sina.com>

dsxsteven · 2025-12-29T02:48:43Z

Local successful test results

Signed-off-by: daishixun <dsxsteven@sina.com>

Angazenn · 2025-12-30T08:31:17Z

+        --trust-remote-code
+        --no-enable-prefix-caching
+        --gpu-memory-utilization 0.9
+        --compilation_config '{"cudagraph_capture_sizes":[1,2,4,8,16,32], "cudagraph_mode": "FULL_DECODE_ONLY"}'


why do we need to specify "cudagraph_capture_sizes":[1,2,4,8,16,32] here?

Signed-off-by: daishixun <dsxsteven@sina.com>

…ven/vllm-ascend_dsx into 12_26_add_longseq_nightly

Signed-off-by: daishixun <dsxsteven@sina.com>

…-W8A8 (vllm-project#5381) ### What this PR does / why we need it? add DeepSeek-R1-W8A8 and Qwen3-235B-W8A8 configs in multi-nodes and longseq (PCP&DCP) scenario - vLLM version: release/v0.13.0 - vLLM main: vllm-project/vllm@bc0a5a0 --------- Signed-off-by: daishixun <dsxsteven@sina.com>

…-W8A8 (vllm-project#5381) ### What this PR does / why we need it? add DeepSeek-R1-W8A8 and Qwen3-235B-W8A8 configs in multi-nodes and longseq (PCP&DCP) scenario - vLLM version: release/v0.13.0 - vLLM main: vllm-project/vllm@bc0a5a0 --------- Signed-off-by: daishixun <dsxsteven@sina.com> Signed-off-by: wjunLu <wjunlu217@gmail.com>

…-W8A8 (vllm-project#5381) ### What this PR does / why we need it? add DeepSeek-R1-W8A8 and Qwen3-235B-W8A8 configs in multi-nodes and longseq (PCP&DCP) scenario - vLLM version: release/v0.13.0 - vLLM main: vllm-project/vllm@bc0a5a0 --------- Signed-off-by: daishixun <dsxsteven@sina.com>

…-W8A8 (vllm-project#5381) ### What this PR does / why we need it? add DeepSeek-R1-W8A8 and Qwen3-235B-W8A8 configs in multi-nodes and longseq (PCP&DCP) scenario - vLLM version: release/v0.13.0 - vLLM main: vllm-project/vllm@bc0a5a0 --------- Signed-off-by: daishixun <dsxsteven@sina.com> Signed-off-by: zrj026 <zhangrunjiang026@gmail.com>

…-W8A8 (vllm-project#5381) ### What this PR does / why we need it? add DeepSeek-R1-W8A8 and Qwen3-235B-W8A8 configs in multi-nodes and longseq (PCP&DCP) scenario - vLLM version: release/v0.13.0 - vLLM main: vllm-project/vllm@bc0a5a0 --------- Signed-off-by: daishixun <dsxsteven@sina.com>

…-W8A8 (vllm-project#5381) ### What this PR does / why we need it? add DeepSeek-R1-W8A8 and Qwen3-235B-W8A8 configs in multi-nodes and longseq (PCP&DCP) scenario - vLLM version: release/v0.13.0 - vLLM main: vllm-project/vllm@bc0a5a0 --------- Signed-off-by: daishixun <dsxsteven@sina.com> Signed-off-by: zrj026 <zhangrunjiang026@gmail.com>

add multi-nodes longseq configs of ds and qwen

61ddfea

Signed-off-by: daishixun <dsxsteven@sina.com>

github-actions bot added ci/build module:tests labels Dec 26, 2025

gemini-code-assist bot reviewed Dec 26, 2025

View reviewed changes

weiguihua2 reviewed Dec 26, 2025

View reviewed changes

Comment thread tests/e2e/nightly/multi_node/config/models/Qwen3-235B-W8A8-longseq.yaml

weiguihua2 reviewed Dec 26, 2025

View reviewed changes

Comment thread tests/e2e/nightly/multi_node/config/models/Qwen3-235B-W8A8-longseq.yaml

weiguihua2 reviewed Dec 26, 2025

View reviewed changes

add mtp and full graph in the nitghtly test

f14e3ea

Signed-off-by: daishixun <dsxsteven@sina.com>

gemini-code-assist bot mentioned this pull request Dec 26, 2025

[CI] Add nightly ci test for deepseek v3.1 #5386

Merged

fix typo

992f42c

Signed-off-by: daishixun <dsxsteven@sina.com>

dsxsteven added 2 commits December 29, 2025 11:59

fix vllm serve command

7c81306

Signed-off-by: daishixun <dsxsteven@sina.com>

remove perf benchmark

fa5f99b

Signed-off-by: daishixun <dsxsteven@sina.com>

weiguihua2 added ready read for review ready-for-test start test by label for PR and removed ready read for review ready-for-test start test by label for PR labels Dec 30, 2025

Merge branch 'main' into 12_26_add_longseq_nightly

245ea5e

Angazenn reviewed Dec 30, 2025

View reviewed changes

dsxsteven added 4 commits December 30, 2025 16:51

remove cuda_capture_sizes

7471c13

Signed-off-by: daishixun <dsxsteven@sina.com>

Merge branch '12_26_add_longseq_nightly' of https://github.com/dsxste…

c2ff57a

…ven/vllm-ascend_dsx into 12_26_add_longseq_nightly

fix ds&mtp cuda capture sizes

f6e0a47

Signed-off-by: daishixun <dsxsteven@sina.com>

fix typo

01a765e

Signed-off-by: daishixun <dsxsteven@sina.com>

MengqingCao approved these changes Jan 4, 2026

View reviewed changes

MengqingCao merged commit 3c7e6c6 into vllm-project:main Jan 4, 2026
16 checks passed

dsxsteven deleted the 12_26_add_longseq_nightly branch January 6, 2026 08:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CI] Add multi-nodes longseq configs of DeepSeek-R1-W8A8 & Qwen3-235B-W8A8#5381

[CI] Add multi-nodes longseq configs of DeepSeek-R1-W8A8 & Qwen3-235B-W8A8#5381
MengqingCao merged 10 commits intovllm-project:mainfrom
dsxsteven:12_26_add_longseq_nightly

dsxsteven commented Dec 26, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Dec 26, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Dec 26, 2025

Uh oh!

gemini-code-assist bot Dec 26, 2025

Uh oh!

MengqingCao Jan 4, 2026

Uh oh!

dsxsteven Jan 4, 2026

Uh oh!

Uh oh!

Uh oh!

weiguihua2 Dec 26, 2025

Uh oh!

dsxsteven Dec 26, 2025

Uh oh!

dsxsteven commented Dec 29, 2025

Uh oh!

Angazenn Dec 30, 2025

Uh oh!

dsxsteven Dec 30, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

dsxsteven commented Dec 26, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

github-actions bot commented Dec 26, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Dec 26, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Dec 26, 2025

Choose a reason for hiding this comment

Uh oh!

MengqingCao Jan 4, 2026

Choose a reason for hiding this comment

Uh oh!

dsxsteven Jan 4, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

weiguihua2 Dec 26, 2025

Choose a reason for hiding this comment

Uh oh!

dsxsteven Dec 26, 2025

Choose a reason for hiding this comment

Uh oh!

dsxsteven commented Dec 29, 2025

Uh oh!

Angazenn Dec 30, 2025

Choose a reason for hiding this comment

Uh oh!

dsxsteven Dec 30, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

dsxsteven commented Dec 26, 2025 •

edited by github-actions bot

Loading