[Nightly][Refactor]Migrate nightly single-node model tests from `.py` to `.yaml` by MrZ20 · Pull Request #6503 · vllm-project/vllm-ascend

MrZ20 · 2026-02-03T08:15:30Z

What this PR does / why we need it?

This PR refactors the nightly single-node model test by migrating test configurations from Python scripts to a more maintainable YAML-based format.

Original PR	Python (`.py`)	YAML (`.yaml`)
#3568	`test_deepseek_r1_0528_w8a8_eplb.py`	`DeepSeek-R1-0528-W8A8.yaml`
#3631	`test_deepseek_r1_0528_w8a8.py`	`DeepSeek-R1-0528-W8A8.yaml`
#5874	`test_deepseek_r1_w8a8_hbm.py`	`DeepSeek-R1-W8A8-HBM.yaml`
#3908	`test_deepseek_v3_2_w8a8.py`	`DeepSeek-V3.2-W8A8.yaml`
#5682	`test_kimi_k2_thinking.py`	`Kimi-K2-Thinking.yaml`
#4111	`test_mtpx_deepseek_r1_0528_w8a8.py`	`MTPX-DeepSeek-R1-0528-W8A8.yaml`
#3733	`test_prefix_cache_deepseek_r1_0528_w8a8.py`	`Prefix-Cache-DeepSeek-R1-0528-W8A8.yaml`
#6543	`test_qwen3_235b_w8a8.py`	`Qwen3-235B-A22B-W8A8.yaml`
#6543	`test_qwen3_235b_a22b_w8a8_eplb.py`	`Qwen3-235B-A22B-W8A8.yaml`
#3973	`test_qwen3_30b_w8a8.py`	`Qwen3-30B-A3B-W8A8.yaml`
#3541	`test_qwen3_32b_int8.py`	`Qwen3-32B-Int8.yaml`
#3757	`test_qwq_32b.py`	`QwQ-32B.yaml`
#5616	`test_qwen3_next_w8a8.py`	`Qwen3-Next-80B-A3B-Instruct-W8A8.yaml`
#3541	`test_qwen2_5_vl_7b.py`	`Qwen2.5-VL-7B-Instruct.yaml`
#5301	`test_qwen2_5_vl_7b_epd.py`	`Qwen2.5-VL-7B-Instruct-EPD.yaml`
#3707	`test_qwen2_5_vl_32b.py`	`Qwen2.5-VL-32B-Instruct.yaml`
#3676	`test_qwen3_32b_int8_a3_feature_stack3.py`	`Qwen3-32B-Int8-A3-Feature-Stack3.yaml`
#3709	`test_prefix_cache_qwen3_32b_int8.py`	`Prefix-Cache-Qwen3-32B-Int8.yaml`
#5395	`test_qwen3_next.py`	`Qwen3-Next-80B-A3B-Instruct-A2.yaml`
#3474	`test_qwen3_32b.py`	`Qwen3-32B.yaml`
#3541	`test_qwen3_32b_int8.py`	`Qwen3-32B-Int8-A2.yaml`

Does this PR introduce any user-facing change?

How was this patch tested?

vLLM version: v0.15.0
vLLM main: https://github.com/vllm-project/vllm/commit/v0.15.0

github-actions · 2026-02-03T08:15:44Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

gemini-code-assist · 2026-02-03T08:15:45Z

Summary of Changes

Hello @MrZ20, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request refactors the existing nightly single-node model tests by transitioning their setup from Python code to a declarative YAML format. This change aims to streamline the definition and management of test parameters, making it simpler to configure and execute various model tests. The new structure enhances maintainability and allows for easier extension of test scenarios without modifying core test logic.

Highlights

YAML Configuration for Nightly Tests: Migrated nightly single-node model test configurations from Python scripts to a more maintainable YAML-based format, improving readability and ease of management.
New Configuration Loader: Introduced SingleNodeConfigLoader in single_node_config.py to parse and validate the new YAML test configurations, abstracting the configuration loading logic.
Updated Test Execution: The test_single_node.py script now dynamically loads test parameters and environment variables from the YAML configuration, enabling flexible test execution for different models and benchmarks.
DeepSeek Model Test Added: Added a new YAML configuration file for the DeepSeek-R1-0528-W8A8 model, including specific server commands, environment variables, and benchmark settings for accuracy testing with Expert Parallel Load Balancing (EPLB).

🧠 New Feature in Public Preview: You can now enable Memory to help Gemini Code Assist learn from your team's feedback. This makes future code reviews more consistent and personalized to your project's style. Click here to enable Memory in your admin console.

Changelog

tests/e2e/nightly/single_node/config/DeepSeek-R1-0528-W8A8-EPLB.yaml
- Added a new YAML configuration file for the DeepSeek-R1-0528-W8A8 model, defining its server command, common environment variables, and accuracy benchmark settings, including Expert Parallel Load Balancing (EPLB) configurations.
tests/e2e/nightly/single_node/scripts/init.py
- Added an empty __init__.py file to designate the scripts directory as a Python package.
tests/e2e/nightly/single_node/scripts/single_node_config.py
- Added a new Python module containing SingleNodeConfig and SingleNodeConfigLoader classes.
- SingleNodeConfig encapsulates model, environment variables, server commands, and benchmark commands.
- SingleNodeConfigLoader provides methods to load, parse, and validate test configurations from YAML files, including environment variable expansion within commands.
tests/e2e/nightly/single_node/scripts/test_single_node.py
- Modified the test_single_node function to utilize SingleNodeConfigLoader.from_yaml() to load test configurations.
- The test now dynamically uses the model, server command, server port, and environment variables defined in the loaded YAML configuration.
- Integrated aisbench test cases (accuracy and performance) based on the configurations provided in the YAML file.

Ignored Files

Ignored by pattern: .github/workflows/** (2)
- .github/workflows/_e2e_nightly_single_node_yaml.yaml
- .github/workflows/schedule_nightly_test_a3.yaml

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

The pull request successfully migrates the nightly single-node model test configuration to a YAML-based format, improving maintainability. However, there are a couple of critical issues related to the default configuration loading and a high-severity inconsistency in configuration validation that need to be addressed to ensure the new YAML configuration is correctly utilized and validated.

github-actions · 2026-02-06T09:34:35Z

This pull request has conflicts, please resolve those before we can evaluate the pull request.

Yikun

Looks a great improvement

Yikun · 2026-03-02T13:05:38Z

+      - name: Checkout PR 6503
+        working-directory: /vllm-workspace/vllm-ascend
+        run: |
+          echo "Fetching PR 6503..."


Why need this, any plan to remove?

Currently under testing, will be removed before merging

github-actions · 2026-03-03T01:36:00Z

This pull request has conflicts, please resolve those before we can evaluate the pull request.

github-actions · 2026-03-03T01:36:00Z

This pull request has conflicts, please resolve those before we can evaluate the pull request.

Signed-off-by: MrZ20 <2609716663@qq.com> v1 Signed-off-by: MrZ20 <2609716663@qq.com> add test Signed-off-by: MrZ20 <2609716663@qq.com> fix Signed-off-by: MrZ20 <2609716663@qq.com>

Signed-off-by: MrZ20 <2609716663@qq.com>

Signed-off-by: MrZ20 <2609716663@qq.com> add port diy Signed-off-by: MrZ20 <2609716663@qq.com> add fun diy Signed-off-by: MrZ20 <2609716663@qq.com> add pd func Signed-off-by: MrZ20 <2609716663@qq.com> refactor Signed-off-by: MrZ20 <2609716663@qq.com> start nightly test Signed-off-by: MrZ20 <2609716663@qq.com> start nightly test 2 Signed-off-by: MrZ20 <2609716663@qq.com> fix Signed-off-by: MrZ20 <2609716663@qq.com> fix Signed-off-by: MrZ20 <2609716663@qq.com> fix Signed-off-by: MrZ20 <2609716663@qq.com> test Signed-off-by: MrZ20 <2609716663@qq.com> fix Signed-off-by: MrZ20 <2609716663@qq.com> fix Signed-off-by: MrZ20 <2609716663@qq.com>

Signed-off-by: MrZ20 <2609716663@qq.com>

…to qwen3next_graph * 'main' of https://github.com/vllm-project/vllm-ascend: (40 commits) [Feature] Add docs of batch invariance and make some extra operators patch (vllm-project#6910) [bugfix]Qwen2.5VL accurate question (vllm-project#6975) [CI] Add DeepSeek-V3.2 large EP nightly ci (vllm-project#6378) [Ops][BugFix] Fix RoPE shape mismatch for mtp models with flashcomm v1 enabled (vllm-project#6939) [bugfix]fix file not found error in nightly of single-node (vllm-project#6976) [Bugfix] Fix the acceptance rates dorp issue when applying eagle3 to QuaRot model (vllm-project#6914) [CI] Enable auto upgrade e2e estimated time for auto-partition suites (vllm-project#6840) [Doc][Misc] Fix msprobe_guide.md documentation issues (vllm-project#6965) [Nightly][Refactor]Migrate nightly single-node model tests from `.py` to `.yaml` (vllm-project#6503) [BugFix] Improve GDN layer detection for multimodal models (vllm-project#6941) [feat]ds3.2 pcp support mtp and chunkprefill (vllm-project#6917) [CPU binding] Implement global CPU slicing and improve IRQ binding for Ascend NPUs (vllm-project#6945) [Triton] Centralize Ascend extension op dispatch in triton_utils (vllm-project#6937) [csrc][bugfix] Add compile-time Ascend950/910_95 compatibility for custom ops between CANN8.5 and 9.0 (vllm-project#6936) [300I][Bugfix] fix unquant model weight nd2nz error (vllm-project#6851) [doc] fix supported_models (vllm-project#6930) [CI] nightly test timeout (vllm-project#6912) [CI] Upgrade CANN to 8.5.1 (vllm-project#6897) [Model]Add Qwen3-Omni quantization Ascend NPU adaptation and optimization (vllm-project#6828) [P/D][v0.16.0]Adapt to RecomputeScheduler in vLLM 0.16.0 (vllm-project#6898) ...

github-actions bot added ci/build module:tests labels Feb 3, 2026

gemini-code-assist bot reviewed Feb 3, 2026

View reviewed changes

Comment thread tests/e2e/nightly/single_node/scripts/single_node_config.py Outdated

Comment thread tests/e2e/nightly/single_node/scripts/test_single_node.py Outdated

Comment thread tests/e2e/nightly/single_node/scripts/single_node_config.py Outdated

github-actions bot added the merge-conflicts label Feb 6, 2026

MrZ20 force-pushed the single_nightly branch from ad138c5 to 08e576a Compare February 26, 2026 07:44

github-actions bot removed the merge-conflicts label Feb 26, 2026

MrZ20 force-pushed the single_nightly branch 6 times, most recently from eeb4598 to f8fd663 Compare March 2, 2026 07:46

MrZ20 marked this pull request as ready for review March 2, 2026 08:12

MrZ20 requested review from Yikun and wangxiyuan as code owners March 2, 2026 08:12

MrZ20 force-pushed the single_nightly branch from d409e3b to 141f73f Compare March 2, 2026 08:13

wangxiyuan approved these changes Mar 2, 2026

View reviewed changes

Yikun reviewed Mar 2, 2026

View reviewed changes

github-actions bot added merge-conflicts labels Mar 3, 2026

MrZ20 added 3 commits March 3, 2026 10:33

v4

6ef799c

Signed-off-by: MrZ20 <2609716663@qq.com> v1 Signed-off-by: MrZ20 <2609716663@qq.com> add test Signed-off-by: MrZ20 <2609716663@qq.com> fix Signed-off-by: MrZ20 <2609716663@qq.com>

test

342018d

Signed-off-by: MrZ20 <2609716663@qq.com>

MrZ20 force-pushed the single_nightly branch from 0e06262 to a7f6473 Compare March 3, 2026 02:37

github-actions bot removed the merge-conflicts label Mar 3, 2026

MrZ20 added 2 commits March 3, 2026 10:38

test

da61165

Signed-off-by: MrZ20 <2609716663@qq.com>

sunset

c49b65b

Signed-off-by: MrZ20 <2609716663@qq.com>

MrZ20 force-pushed the single_nightly branch from 5321930 to c49b65b Compare March 3, 2026 06:26

wangxiyuan merged commit 859f2c2 into vllm-project:main Mar 3, 2026
25 checks passed

MrZ20 deleted the single_nightly branch March 4, 2026 06:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Nightly][Refactor]Migrate nightly single-node model tests from `.py` to `.yaml`#6503

[Nightly][Refactor]Migrate nightly single-node model tests from `.py` to `.yaml`#6503
wangxiyuan merged 5 commits intovllm-project:mainfrom
MrZ20:single_nightly

MrZ20 commented Feb 3, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Feb 3, 2026

Uh oh!

gemini-code-assist bot commented Feb 3, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Feb 6, 2026

Uh oh!

Yikun left a comment

Uh oh!

Yikun Mar 2, 2026

Uh oh!

MrZ20 Mar 2, 2026

Uh oh!

github-actions bot commented Mar 3, 2026

Uh oh!

github-actions bot commented Mar 3, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

MrZ20 commented Feb 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

github-actions bot commented Feb 3, 2026

Uh oh!

gemini-code-assist bot commented Feb 3, 2026

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Feb 6, 2026

Uh oh!

Yikun left a comment

Choose a reason for hiding this comment

Uh oh!

Yikun Mar 2, 2026

Choose a reason for hiding this comment

Uh oh!

MrZ20 Mar 2, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Mar 3, 2026

Uh oh!

github-actions bot commented Mar 3, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

MrZ20 commented Feb 3, 2026 •

edited

Loading