[Test] Add initial multi modal cases of Qwen2.5-VL-7B-Instruct for disaggregated encoder by yenuo26 · Pull Request #5301 · vllm-project/vllm-ascend

yenuo26 · 2025-12-23T14:38:05Z

What this PR does / why we need it?

This PR adds disaggregated encoder tests for Qwen2.5-VL-7B-Instruct

Does this PR introduce any user-facing change?

No

How was this patch tested?

by running the test
by running ci

Test Result

test case:
local:

ci:

examples:

vLLM version: release/v0.12.0

Signed-off-by: wangyu31577 <wangyu31577@hundsun.com>

gemini-code-assist

Code Review

This PR adds a disaggregated encoder proxy and corresponding e2e tests for Qwen2.5-VL. The changes are well-structured, introducing a proxy server, test helpers in conftest.py, and a new test case. The implementation is mostly solid, but I've found a few issues related to correctness and best practices. Specifically, the health check endpoint in the proxy has a bug, a test helper function uses blocking I/O in an async context, another helper class is not robust to its declared input types, and the test case construction of JSON strings could be improved for safety and readability. My detailed comments and suggestions are below.

examples/disaggregated_encoder/disagg_epd_proxy.py

tests/e2e/conftest.py

gemini-code-assist · 2025-12-23T14:40:42Z

tests/e2e/conftest.py

+    def __init__(self,
+                 proxy_args: Union[list[str], str] = None,
+                 env_dict: Optional[dict[str, str]] = None) -> None:
+        self.proxy_args = proxy_args
+        self.env_dict = env_dict
+        self._proc_list = list()


The __init__ method of DisaggEpdProxy accepts proxy_args as Union[list[str], str], but it doesn't handle the str case. If a string is passed, _start_disagg_proxy will incorrectly unpack it character by character, and __aenter__ will fail on self.proxy_args.index("--port"). For robustness and consistency with RemoteOpenAIServer, you should handle the string case by splitting it into a list of arguments using shlex.split.

Suggested change

def __init__(self,

proxy_args: Union[list[str], str] = None,

env_dict: Optional[dict[str, str]] = None) -> None:

self.proxy_args = proxy_args

self.env_dict = env_dict

self._proc_list = list()

def __init__(self,

proxy_args: Union[list[str], str] = None,

env_dict: Optional[dict[str, str]] = None) -> None:

if isinstance(proxy_args, str):

self.proxy_args = shlex.split(proxy_args)

else:

self.proxy_args = proxy_args

self.env_dict = env_dict

self._proc_list = list()

gemini-code-assist · 2025-12-23T14:40:42Z

tests/e2e/nightly/models/test_qwen2_5_vl_7b_epd.py

+        '{"ec_connector_extra_config":{"shared_storage_path":"' +
+        SHARED_STORAGE_PATH +
+        '"},"ec_connector":"ECSharedStorageConnector","ec_role": "ec_producer"}'


Constructing JSON strings via concatenation is error-prone and hard to read. It's better to define the structure as a Python dictionary and use json.dumps() to create a valid JSON string. This improves readability and correctness. Please also remember to import json at the top of the file.

json.dumps({ "ec_connector_extra_config": { "shared_storage_path": SHARED_STORAGE_PATH }, "ec_connector": "ECSharedStorageConnector", "ec_role": "ec_producer" })

gemini-code-assist · 2025-12-23T14:40:42Z

tests/e2e/nightly/models/test_qwen2_5_vl_7b_epd.py

+        '{"ec_connector_extra_config":{"shared_storage_path":"' +
+        SHARED_STORAGE_PATH +
+        '"},"ec_connector":"ECSharedStorageConnector","ec_role": "ec_consumer"}'


Constructing JSON strings via concatenation is error-prone and hard to read. It's better to define the structure as a Python dictionary and use json.dumps() to create a valid JSON string. This improves readability and correctness. Please also remember to import json at the top of the file.

json.dumps({ "ec_connector_extra_config": { "shared_storage_path": SHARED_STORAGE_PATH }, "ec_connector": "ECSharedStorageConnector", "ec_role": "ec_consumer" })

github-actions · 2025-12-23T15:15:40Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

MengqingCao · 2025-12-24T10:45:05Z

examples/disaggregated_encoder/disagg_epd_proxy.py

@@ -0,0 +1,629 @@
+#!/usr/bin/env python3


Is there any difference between this example and https://github.com/vllm-project/vllm/blob/main/examples/online_serving/disaggregated_encoder/disagg_epd_proxy.py?

MengqingCao · 2025-12-24T10:45:41Z

tests/e2e/conftest.py

+
+    def _start_vllm_serve(self):
+        self.env_dict['VLLM_ALLOW_LONG_MAX_MODEL_LEN'] = "1"
+        self.env_dict['VLLM_USE_V1'] = "1"


Suggested change

self.env_dict['VLLM_USE_V1'] = "1"

wangxiyuan · 2026-01-05T11:50:19Z

Any progress? If this PR is still alive, please rebase to main and make CI happy, otherwise you can close it. Thanks

Signed-off-by: wangyu31577 <wangyu31577@hundsun.com>

github-actions · 2026-01-20T13:09:37Z

This pull request has conflicts, please resolve those before we can evaluate the pull request.

Signed-off-by: wangyu31577 <wangyu31577@hundsun.com>

Signed-off-by: wangyu <53896905+yenuo26@users.noreply.github.com>

Signed-off-by: wangyu31577 <wangyu31577@hundsun.com>

yenuo26 · 2026-01-28T08:22:00Z

examples/disaggregated_encoder/disagg_1e1pd_example.sh

+    --max-num-batched-tokens 114688 \
+    --max-num-seqs 128 \
+    --ec-transfer-config '{
+        "ec_connector": "ECSharedStorageConnector",


适配vllm新版本

Signed-off-by: wangyu31577 <wangyu31577@hundsun.com>

…to qwen3next_rebase * 'main' of https://github.com/vllm-project/vllm-ascend: [Patch] Remove the patch of MiniCPM (vllm-project#5975) [P/D] layerwise connector support recompute scheduler (vllm-project#5900) [CI] Add workflow support for lint image build (vllm-project#6489) [Bugfix] Fix problematic dummy_run & improper input_batch_size in eagle (vllm-project#6517) [Refactor]310p_e2e test case update (vllm-project#6539) [Refactor]refactor p2p connector (vllm-project#6551) [Refactor]refactor 310p attention impl and add ut (vllm-project#6579) [Refactor]refactor 310p ops and add ut (vllm-project#6591) [Ops][Refactor] Remove custom rotary_embedding operator (vllm-project#6523) [Lint]Style: Convert `vllm-ascend/` to ruff format(new Batch vllm-project#8) (vllm-project#6604) [Test] Add initial multi modal cases of Qwen2.5-VL-7B-Instruct for disaggregated encoder (vllm-project#5301) [CI] Fix broken CI (vllm-project#6599) [Lint]Style: Convert `vllm-ascend/` to ruff format(Batch vllm-project#10) (vllm-project#6173) [Lint]Style: Convert `vllm-ascend/` to ruff format(Batch vllm-project#11) (vllm-project#6176) [Lint]Style: Convert `vllm-ascend/` to ruff format(Batch vllm-project#8) (vllm-project#6129) [Lint]Style: Convert `vllm-ascend/` to ruff format(Batch vllm-project#7) (vllm-project#6023) [CI][Misc] Some improvement for github action (vllm-project#6587) [Image] Bump mooncake version to v0.3.8.post1 (vllm-project#6428)

MrZ20 · 2026-02-09T09:29:18Z

Hi! The nighltly test failed; there seem to be some errors that you need to resolve.
https://github.com/vllm-project/vllm-ascend/actions/runs/21801287662/job/62907968683

yenuo26 · 2026-02-09T09:38:02Z

OK，i will check it

…saggregated encoder (vllm-project#5301) ### What this PR does / why we need it? This PR adds disaggregated encoder tests for Qwen2.5-VL-7B-Instruct ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? by running the test by running ci - vLLM version: release/v0.12.0 --------- Signed-off-by: wangyu31577 <wangyu31577@hundsun.com> Signed-off-by: wangyu <53896905+yenuo26@users.noreply.github.com> Co-authored-by: wangyu31577 <wangyu31577@hundsun.com> Signed-off-by: momochenchuw <chenchuw@huawei.com>

…saggregated encoder (vllm-project#5301) ### What this PR does / why we need it? This PR adds disaggregated encoder tests for Qwen2.5-VL-7B-Instruct ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? by running the test by running ci - vLLM version: release/v0.12.0 --------- Signed-off-by: wangyu31577 <wangyu31577@hundsun.com> Signed-off-by: wangyu <53896905+yenuo26@users.noreply.github.com> Co-authored-by: wangyu31577 <wangyu31577@hundsun.com> Signed-off-by: zrj026 <zhangrunjiang026@gmail.com>

…saggregated encoder (vllm-project#5301) ### What this PR does / why we need it? This PR adds disaggregated encoder tests for Qwen2.5-VL-7B-Instruct ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? by running the test by running ci - vLLM version: release/v0.12.0 --------- Signed-off-by: wangyu31577 <wangyu31577@hundsun.com> Signed-off-by: wangyu <53896905+yenuo26@users.noreply.github.com> Co-authored-by: wangyu31577 <wangyu31577@hundsun.com>

…saggregated encoder (vllm-project#5301) ### What this PR does / why we need it? This PR adds disaggregated encoder tests for Qwen2.5-VL-7B-Instruct ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? by running the test by running ci - vLLM version: release/v0.12.0 --------- Signed-off-by: wangyu31577 <wangyu31577@hundsun.com> Signed-off-by: wangyu <53896905+yenuo26@users.noreply.github.com> Co-authored-by: wangyu31577 <wangyu31577@hundsun.com> Signed-off-by: zrj026 <zhangrunjiang026@gmail.com>

…saggregated encoder (vllm-project#5301) ### What this PR does / why we need it? This PR adds disaggregated encoder tests for Qwen2.5-VL-7B-Instruct ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? by running the test by running ci - vLLM version: release/v0.12.0 --------- Signed-off-by: wangyu31577 <wangyu31577@hundsun.com> Signed-off-by: wangyu <53896905+yenuo26@users.noreply.github.com> Co-authored-by: wangyu31577 <wangyu31577@hundsun.com>

wangyu31577 added 5 commits December 23, 2025 20:09

新增EPD用例

ab66230

Signed-off-by: wangyu31577 <wangyu31577@hundsun.com>

新增epd用例

c9763cf

Signed-off-by: wangyu31577 <wangyu31577@hundsun.com>

新增epd用例

27578dd

Signed-off-by: wangyu31577 <wangyu31577@hundsun.com>

新增epd用例

10a3d4d

Signed-off-by: wangyu31577 <wangyu31577@hundsun.com>

新增epd用例

45dbdb7

Signed-off-by: wangyu31577 <wangyu31577@hundsun.com>

gemini-code-assist bot reviewed Dec 23, 2025

View reviewed changes

github-actions bot added the module:tests label Dec 23, 2025

MengqingCao reviewed Dec 24, 2025

View reviewed changes

Merge branch 'vllm-project:main' into open

bc99f5b

This was referenced Jan 20, 2026

[RFC]: Tracking Remaining Work for Encode–Prefill–Decode Disaggregation #6026

Open

[CI]Add EPD end-to-end (E2E) test cases #6028

Closed

[usage]Provide EPD proxy example #6027

Closed

Merge branch 'vllm-project:main' into open

38c02f0

yenuo26 requested a review from wangxiyuan as a code owner January 20, 2026 12:01

modify conftest

f9a88a4

Signed-off-by: wangyu31577 <wangyu31577@hundsun.com>

github-actions bot added the merge-conflicts label Jan 20, 2026

add test case and example

4ab0cd8

Signed-off-by: wangyu31577 <wangyu31577@hundsun.com>

yenuo26 requested a review from Yikun as a code owner January 20, 2026 13:17

Merge branch 'main' into open

826caba

Signed-off-by: wangyu <53896905+yenuo26@users.noreply.github.com>

github-actions bot removed the merge-conflicts label Jan 21, 2026

wangyu31577 added 2 commits January 27, 2026 09:27

pre-commit

fe2e268

Signed-off-by: wangyu31577 <wangyu31577@hundsun.com>

pre-commit

771b31d

Signed-off-by: wangyu31577 <wangyu31577@hundsun.com>

yenuo26 force-pushed the open branch from 761cdba to 771b31d Compare January 28, 2026 07:35

yenuo26 commented Jan 28, 2026

View reviewed changes

yenuo26 and others added 2 commits February 2, 2026 23:18

Merge branch 'vllm-project:main' into open

948b2a4

add pr test case

86b45a6

Signed-off-by: wangyu31577 <wangyu31577@hundsun.com>

add test yaml

cc1b680

Signed-off-by: wangyu31577 <wangyu31577@hundsun.com>

weijinqian0 added ready read for review ready-for-test start test by label for PR labels Feb 5, 2026

Merge branch 'vllm-project:main' into open

b18ef38

wangxiyuan merged commit c63b7a1 into vllm-project:main Feb 6, 2026
17 checks passed

MrZ20 mentioned this pull request Mar 2, 2026

[Nightly][Refactor]Migrate nightly single-node model tests from .py to .yaml #6503

Merged

Conversation

yenuo26 commented Dec 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

gemini-code-assist bot Dec 23, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Dec 23, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Dec 23, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Dec 23, 2025

Uh oh!

MengqingCao Dec 24, 2025

Choose a reason for hiding this comment

Uh oh!

MengqingCao Dec 24, 2025

Choose a reason for hiding this comment

Uh oh!

wangxiyuan commented Jan 5, 2026

Uh oh!

github-actions bot commented Jan 20, 2026

Uh oh!

yenuo26 Jan 28, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

MrZ20 commented Feb 9, 2026

Uh oh!

yenuo26 commented Feb 9, 2026 via email

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

yenuo26 commented Dec 23, 2025 •

edited

Loading