[TEST]Add initial multi modal cases of Qwen2.5-VL-32B-Instruct for nightly test by yenuo26 · Pull Request #3707 · vllm-project/vllm-ascend

yenuo26 · 2025-10-24T03:39:31Z

What this PR does / why we need it?

This PR adds the initial multi modal model for nightly test, including 2 cases for Qwen2.5-vl-32b acc/perf test on A3, we need test them daily.

Does this PR introduce any user-facing change?

No

How was this patch tested?

by running the test

vLLM version: v0.11.0rc3
vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0

vLLM version: v0.11.0rc3
vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0

Signed-off-by: wangyu31577 <wangyu31577@hundsun.com>

gemini-code-assist

Code Review

This pull request adds a new nightly end-to-end test for the Qwen/Qwen2.5-VL-32B-Instruct multi-modal model and includes a fix in aisbench.py for configuration generation. The new test is comprehensive, covering an API check, a multi-modal request, and benchmarks. My review focuses on improving the correctness of the new test. I've suggested using the chat completions API, which is more appropriate for the model under test, to make the test more robust.

gemini-code-assist · 2025-10-24T03:41:53Z

+        batch = await client.completions.create(
+            model=model,
+            prompt=prompts,
+            **request_keyword_args,
+        )
+        choices: list[openai.types.CompletionChoice] = batch.choices
+        assert choices[0].text, "empty response"


The model Qwen/Qwen2.5-VL-32B-Instruct is an instruction-tuned chat model. For consistency with other parts of the test (like send_image_request and the aisbench cases which use chat endpoints) and to follow best practices, the initial smoke test should use the chat completions API (client.chat.completions.create) instead of the legacy completions API (client.completions.create). This ensures the correct API is used for the model type and makes the test more robust and maintainable.

Suggested change

batch = await client.completions.create(

model=model,

prompt=prompts,

**request_keyword_args,

)

choices: list[openai.types.CompletionChoice] = batch.choices

assert choices[0].text, "empty response"

chat_response = await client.chat.completions.create(

model=model,

messages=[{

"role": "user",

"content": prompts[0]

}],

**request_keyword_args,

)

choices = chat_response.choices

assert choices[0].message.content, "empty response"

github-actions · 2025-10-24T03:52:14Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

Signed-off-by: wangyu31577 <wangyu31577@hundsun.com>

jiangyunfan1 · 2025-10-24T07:46:34Z

CI passed https://github.com/vllm-project/vllm-ascend/actions/runs/18772453760/job/53559572638?pr=3707

Signed-off-by: wangyu31577 <wangyu31577@hundsun.com>

…ghtly test (vllm-project#3707) ### What this PR does / why we need it? This PR adds the initial multi modal model for nightly test, including 2 cases for Qwen2.5-vl-32b acc/perf test on A3, we need test them daily. ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? by running the test vLLM version: v0.11.0rc3 vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0 - vLLM version: v0.11.0rc3 - vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0 --------- Signed-off-by: wangyu31577 <wangyu31577@hundsun.com> Co-authored-by: wangyu31577 <wangyu31577@hundsun.com> Signed-off-by: luolun <luolun1995@cmbchina.com>

…ghtly test (vllm-project#3707) ### What this PR does / why we need it? This PR adds the initial multi modal model for nightly test, including 2 cases for Qwen2.5-vl-32b acc/perf test on A3, we need test them daily. ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? by running the test vLLM version: v0.11.0rc3 vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0 - vLLM version: v0.11.0rc3 - vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0 --------- Signed-off-by: wangyu31577 <wangyu31577@hundsun.com> Co-authored-by: wangyu31577 <wangyu31577@hundsun.com> Signed-off-by: hwhaokun <haokun0405@163.com>

…ghtly test (vllm-project#3707) ### What this PR does / why we need it? This PR adds the initial multi modal model for nightly test, including 2 cases for Qwen2.5-vl-32b acc/perf test on A3, we need test them daily. ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? by running the test vLLM version: v0.11.0rc3 vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0 - vLLM version: v0.11.0rc3 - vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0 --------- Signed-off-by: wangyu31577 <wangyu31577@hundsun.com> Co-authored-by: wangyu31577 <wangyu31577@hundsun.com> Signed-off-by: nsdie <yeyifan@huawei.com>

…ghtly test (vllm-project#3707) ### What this PR does / why we need it? This PR adds the initial multi modal model for nightly test, including 2 cases for Qwen2.5-vl-32b acc/perf test on A3, we need test them daily. ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? by running the test vLLM version: v0.11.0rc3 vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0 - vLLM version: v0.11.0rc3 - vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0 --------- Signed-off-by: wangyu31577 <wangyu31577@hundsun.com> Co-authored-by: wangyu31577 <wangyu31577@hundsun.com>

wangyu31577 added 2 commits October 24, 2025 11:26

新增QWEN-VL-32B用例

35ec234

Signed-off-by: wangyu31577 <wangyu31577@hundsun.com>

workflow新增用例配置

f46dae1

Signed-off-by: wangyu31577 <wangyu31577@hundsun.com>

gemini-code-assist bot reviewed Oct 24, 2025

View reviewed changes

修改配置

181bbcd

Signed-off-by: wangyu31577 <wangyu31577@hundsun.com>

github-actions bot added module:tests module:tools labels Oct 24, 2025

更改代码格式

2352150

Signed-off-by: wangyu31577 <wangyu31577@hundsun.com>

删除yaml配置

7d173c2

Signed-off-by: wangyu31577 <wangyu31577@hundsun.com>

wangxiyuan approved these changes Oct 24, 2025

View reviewed changes

wangxiyuan merged commit d301c56 into vllm-project:main Oct 24, 2025
22 checks passed

MrZ20 mentioned this pull request Mar 2, 2026

[Nightly][Refactor]Migrate nightly single-node model tests from .py to .yaml #6503

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TEST]Add initial multi modal cases of Qwen2.5-VL-32B-Instruct for nightly test#3707

[TEST]Add initial multi modal cases of Qwen2.5-VL-32B-Instruct for nightly test#3707
wangxiyuan merged 5 commits intovllm-project:mainfrom
yenuo26:main

yenuo26 commented Oct 24, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Oct 24, 2025

Uh oh!

github-actions bot commented Oct 24, 2025

Uh oh!

jiangyunfan1 commented Oct 24, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

yenuo26 commented Oct 24, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Oct 24, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Oct 24, 2025

Uh oh!

jiangyunfan1 commented Oct 24, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

yenuo26 commented Oct 24, 2025 •

edited by github-actions bot

Loading