[TEST] Add initial aisbench support and Qwen3 32B acc/perf test by jiangyunfan1 · Pull Request #3474 · vllm-project/vllm-ascend

jiangyunfan1 · 2025-10-15T07:14:36Z

What this PR does / why we need it?

This PR adds the first aisbench case for nightly test, it lays a foundation for following performance and accuracy tests in nightly test.

Does this PR introduce any user-facing change?

No

How was this patch tested?

By running the test

vLLM version: v0.11.0rc3
vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0

gemini-code-assist

Code Review

This pull request adds new nightly tests using aisbench and a corresponding helper script. The overall approach is sound, but the new tools/aisbench.py script contains several critical and high-severity issues. These include potential UnboundLocalError exceptions due to incorrect variable scoping, a resource leak from an unmanaged subprocess, a risk of an infinite loop when monitoring the subprocess, and fragile logic that relies on hardcoded values or specific string formats. Addressing these issues is crucial for making the test runner robust and reliable.

gemini-code-assist · 2025-10-15T07:17:33Z

tools/aisbench.py

+        if self.task_type == "accuracy":
+            aisbench_cmd = [
+                'ais_bench', '--models', f'{self.request_conf}_custom',
+                '--datasets', f'{dataset_conf}', '--debug'
+            ]
+        if self.task_type == "performance":
+            aisbench_cmd = [
+                'ais_bench', '--models', f'{self.request_conf}_custom',
+                '--datasets', f'{dataset_conf}_custom', '--debug', '--mode', 
+                'perf'
+            ]
+            if self.num_prompts:
+                aisbench_cmd.extend(['--num-prompts', str(self.num_prompts)])


The aisbench_cmd variable is only defined within the if self.task_type == "accuracy": and if self.task_type == "performance": blocks. If self.task_type has any other value, aisbench_cmd will be unbound when used on line 56, causing an UnboundLocalError. It's better to use an if/elif/else structure to handle all cases, raising an error for unsupported task types.

if self.task_type == "accuracy": aisbench_cmd = [ 'ais_bench', '--models', f'{self.request_conf}_custom', '--datasets', f'{dataset_conf}', '--debug' ] elif self.task_type == "performance": aisbench_cmd = [ 'ais_bench', '--models', f'{self.request_conf}_custom', '--datasets', f'{dataset_conf}_custom', '--debug', '--mode', 'perf' ] if self.num_prompts: aisbench_cmd.extend(['--num-prompts', str(self.num_prompts)]) else: raise ValueError(f"Unsupported task_type: {self.task_type}")

tools/aisbench.py

gemini-code-assist · 2025-10-15T07:17:34Z

tools/aisbench.py

+        while True:
+            line = self.proc.stdout.readline().strip()
+            print(line)
+            if result_msg in line:
+                self.result_line = line
+                return
+            if "ERROR" in line:
+                raise RuntimeError(
+                    "Some errors happen to Aisbench task.") from None


The while True loop reading from self.proc.stdout can hang indefinitely if the subprocess closes its standard output without printing either the expected result_msg or an "ERROR" line. readline() will block, waiting for data that will never arrive. The loop should also check if the process has terminated. When an error does occur, including stderr in the exception message would be very helpful for debugging.

while True: line = self.proc.stdout.readline().strip() if not line and self.proc.poll() is not None: # Process ended without finding the result message stderr = self.proc.stderr.read() raise RuntimeError(f"Aisbench task finished unexpectedly. Stderr: {stderr}") print(line) if result_msg in line: self.result_line = line return if "ERROR" in line: stderr = self.proc.stderr.read() raise RuntimeError(f"Some errors happen to Aisbench task. Stderr: {stderr}")

gemini-code-assist · 2025-10-15T07:17:34Z

tools/aisbench.py

+        result_csv_file = os.path.join(result_dir, "gsm8kdataset.csv")
+        result_json_file = os.path.join(result_dir, "gsm8kdataset.json")


The result filenames gsm8kdataset.csv and gsm8kdataset.json are hardcoded. This restricts this method to only work with the gsm8k dataset. To make this runner more flexible and reusable for other datasets, the dataset name should be derived dynamically, for instance from self.dataset_conf.

dataset_name = self.dataset_conf.split('/')[0] result_csv_file = os.path.join(result_dir, f"{dataset_name}dataset.csv") result_json_file = os.path.join(result_dir, f"{dataset_name}dataset.json")

This is validate review, pls resolve this. @jiangyunfan1 You should address this later

github-actions · 2025-10-15T07:20:31Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

Yikun

Please see comments inline, remove unrelated unready file.

tests/e2e/nightly/models/test_qwen2_5_vl_7b.py

tests/e2e/nightly/models/test_qwen3_32b_int8.py

tools/send_mm_request.py

tools/aisbench.py

Yikun · 2025-10-19T01:24:15Z

tools/aisbench.py

+        result_csv_file = os.path.join(result_dir, "gsm8kdataset.csv")
+        result_json_file = os.path.join(result_dir, "gsm8kdataset.json")


This is validate review, pls resolve this. @jiangyunfan1 You should address this later

Signed-off-by: Yikun Jiang <yikunkero@gmail.com>

Yikun · 2025-10-19T01:47:50Z

LGTM if @jiangyunfan1 you confirm below two comments should be addressed later:

…-project#3474) ### What this PR does / why we need it? This PR adds the first aisbench case for nightly test, it lays a foundation for following performance and accuracy tests in nightly test. ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? By running the test - vLLM version: v0.11.0rc3 - vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0 Signed-off-by: Yikun Jiang <yikunkero@gmail.com> Co-authored-by: jiangyunfan1 <jiangyunfan1@h-partners.com>

…-project#3474) ### What this PR does / why we need it? This PR adds the first aisbench case for nightly test, it lays a foundation for following performance and accuracy tests in nightly test. ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? By running the test - vLLM version: v0.11.0rc3 - vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0 Signed-off-by: Yikun Jiang <yikunkero@gmail.com> Co-authored-by: jiangyunfan1 <jiangyunfan1@h-partners.com> Signed-off-by: luolun <luolun1995@cmbchina.com>

…-project#3474) ### What this PR does / why we need it? This PR adds the first aisbench case for nightly test, it lays a foundation for following performance and accuracy tests in nightly test. ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? By running the test - vLLM version: v0.11.0rc3 - vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0 Signed-off-by: Yikun Jiang <yikunkero@gmail.com> Co-authored-by: jiangyunfan1 <jiangyunfan1@h-partners.com> Signed-off-by: hwhaokun <haokun0405@163.com>

…-project#3474) ### What this PR does / why we need it? This PR adds the first aisbench case for nightly test, it lays a foundation for following performance and accuracy tests in nightly test. ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? By running the test - vLLM version: v0.11.0rc3 - vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0 Signed-off-by: Yikun Jiang <yikunkero@gmail.com> Co-authored-by: jiangyunfan1 <jiangyunfan1@h-partners.com> Signed-off-by: nsdie <yeyifan@huawei.com>

…-project#3474) ### What this PR does / why we need it? This PR adds the first aisbench case for nightly test, it lays a foundation for following performance and accuracy tests in nightly test. ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? By running the test - vLLM version: v0.11.0rc3 - vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0 Signed-off-by: Yikun Jiang <yikunkero@gmail.com> Co-authored-by: jiangyunfan1 <jiangyunfan1@h-partners.com>

gemini-code-assist bot reviewed Oct 15, 2025

View reviewed changes

github-actions bot added module:tests module:tools labels Oct 15, 2025

wangxiyuan added the ready read for review label Oct 15, 2025

Yikun force-pushed the main branch from 95b14ff to a90f960 Compare October 17, 2025 07:14

Yikun reviewed Oct 19, 2025

View reviewed changes

Add initial aisbench and Qwen3 32B

54d2b09

Signed-off-by: Yikun Jiang <yikunkero@gmail.com>

Yikun force-pushed the main branch from 91f25f2 to 54d2b09 Compare October 19, 2025 01:39

Yikun changed the title ~~Add aisbench nightly test cases~~ [TEST] Add initial aisbench support and Qwen3 32B acc test Oct 19, 2025

Yikun changed the title ~~[TEST] Add initial aisbench support and Qwen3 32B acc test~~ [TEST] Add initial aisbench support and Qwen3 32B acc perf test Oct 19, 2025

Yikun changed the title ~~[TEST] Add initial aisbench support and Qwen3 32B acc perf test~~ [TEST] Add initial aisbench support and Qwen3 32B acc/perf test Oct 19, 2025

Yikun approved these changes Oct 20, 2025

View reviewed changes

Yikun merged commit 9e59fc1 into vllm-project:main Oct 20, 2025
14 checks passed

MrZ20 mentioned this pull request Mar 2, 2026

[Nightly][Refactor]Migrate nightly single-node model tests from .py to .yaml #6503

Merged

		result_csv_file = os.path.join(result_dir, "gsm8kdataset.csv")
		result_json_file = os.path.join(result_dir, "gsm8kdataset.json")

Conversation

jiangyunfan1 commented Oct 15, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Oct 15, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gemini-code-assist bot Oct 15, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Oct 15, 2025

Choose a reason for hiding this comment

Uh oh!

Yikun Oct 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Oct 15, 2025

Uh oh!

Yikun left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Yikun Oct 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Yikun commented Oct 19, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jiangyunfan1 commented Oct 15, 2025 •

edited by github-actions bot

Loading

Yikun Oct 19, 2025 •

edited

Loading

Yikun Oct 19, 2025 •

edited

Loading