[CI][BugFix] Qwen3-Next nightly test fix. by InSec · Pull Request #6247 · vllm-project/vllm-ascend

InSec · 2026-01-26T03:25:31Z

What this PR does / why we need it?

Qwen3-Next nightly test fix. Temporarily avoid the accuracy issue in the full graph mode.

Does this PR introduce any user-facing change?

N/A

How was this patch tested?

vLLM version: v0.14.1
vLLM main: vllm-project/vllm@d682094

Signed-off-by: InSec <1790766300@qq.com>

github-actions · 2026-01-26T03:25:45Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

gemini-code-assist

Code Review

This pull request provides a temporary fix for a nightly test failure involving the Qwen3-Next model. The changes are focused on avoiding an accuracy issue that occurs in full graph mode. To achieve this, the FULL_DECODE_ONLY CUDA graph mode is disabled by removing the --compilation-config server argument. Additionally, new server arguments (--async-scheduling, --no-enable-prefix-caching, --enable-expert-parallel) are introduced as part of the workaround. The test's scope is also narrowed by reducing the range of MAX_NUM_BATCHED_TOKENS. These changes appear to be a clear and targeted approach for a temporary CI fix.

…to qwen3next_rebase * 'main' of https://github.com/vllm-project/vllm-ascend: (86 commits) [refactor] refactor excute_model and _dymmy_run method (vllm-project#6043) [Refactor] profiler config optimze (vllm-project#6141) [Graph][Fusion] Add MatmulAllReduceAddRMSNorm graph fusion for npugraph_ex. (vllm-project#6006) [UT]: refactoring 310p ops ut (vllm-project#6296) [Refact.]: refactoring 310p-kv cache allocator, align with main branch (vllm-project#6270) [Misc] Removes unnecessary graph size re-initialization (vllm-project#6280) [Main2Main] Upgrade vllm commit to 0123 (vllm-project#6169) [BugFix] Fix wheel package build workflow (vllm-project#6276) [CI][BugFix] Qwen3-Next nightly test fix. (vllm-project#6247) [Doc] quick fix for vllm-ascend version (vllm-project#6278) [Community] Nominate whx-sjtu as maintainer (vllm-project#6268) [Lint] Fix mypy issue to make CI happy (vllm-project#6272) BugFix: Fix moe_load accumulation error in ACL graph mode (vllm-project#6182) [Patch] Remove the patch of ECExampleConnector (vllm-project#5976) [Bugfix] Fix PP+PCP and PP+flashcomm1 bugs (vllm-project#5416) [Feat] proxy delay to remove instances (vllm-project#5934) [CI] Add workfolw_dispatch for nightly image build (vllm-project#6269) [bugfix][npugraph_ex]fix static kernel uninstall issue (vllm-project#6128) [Doc] 310P Documents update (vllm-project#6246) [Feature] Mooncake connector get remote ptp size (vllm-project#5822) ...

### What this PR does / why we need it? Qwen3-Next nightly test fix. Temporarily avoid the accuracy issue in the **full graph** mode. ### Does this PR introduce _any_ user-facing change? N/A ### How was this patch tested? - vLLM version: v0.14.1 - vLLM main: vllm-project/vllm@d682094 Signed-off-by: InSec <1790766300@qq.com>

### What this PR does / why we need it? Qwen3-Next nightly test fix. Temporarily avoid the accuracy issue in the **full graph** mode. ### Does this PR introduce _any_ user-facing change? N/A ### How was this patch tested? - vLLM version: v0.14.1 - vLLM main: vllm-project/vllm@d682094 Signed-off-by: InSec <1790766300@qq.com> Signed-off-by: momochenchuw <chenchuw@huawei.com>

### What this PR does / why we need it? Qwen3-Next nightly test fix. Temporarily avoid the accuracy issue in the **full graph** mode. ### Does this PR introduce _any_ user-facing change? N/A ### How was this patch tested? - vLLM version: v0.14.1 - vLLM main: vllm-project/vllm@d682094 Signed-off-by: InSec <1790766300@qq.com> Signed-off-by: zrj026 <zhangrunjiang026@gmail.com>

### What this PR does / why we need it? Qwen3-Next nightly test fix. Temporarily avoid the accuracy issue in the **full graph** mode. ### Does this PR introduce _any_ user-facing change? N/A ### How was this patch tested? - vLLM version: v0.14.1 - vLLM main: vllm-project/vllm@d682094 Signed-off-by: InSec <1790766300@qq.com>

### What this PR does / why we need it? Qwen3-Next nightly test fix. Temporarily avoid the accuracy issue in the **full graph** mode. ### Does this PR introduce _any_ user-facing change? N/A ### How was this patch tested? - vLLM version: v0.14.1 - vLLM main: vllm-project/vllm@d682094 Signed-off-by: InSec <1790766300@qq.com> Signed-off-by: zrj026 <zhangrunjiang026@gmail.com>

### What this PR does / why we need it? Qwen3-Next nightly test fix. Temporarily avoid the accuracy issue in the **full graph** mode. ### Does this PR introduce _any_ user-facing change? N/A ### How was this patch tested? - vLLM version: v0.14.1 - vLLM main: vllm-project/vllm@d682094 Signed-off-by: InSec <1790766300@qq.com>

[CI][BugFix] Qwen3-Next nightly test fix.

e425ed2

Signed-off-by: InSec <1790766300@qq.com>

InSec requested review from Yikun and wangxiyuan as code owners January 26, 2026 03:25

github-actions Bot added ci/build module:tests labels Jan 26, 2026

gemini-code-assist Bot reviewed Jan 26, 2026

View reviewed changes

wangxiyuan merged commit 595b57c into vllm-project:main Jan 26, 2026
18 checks passed

wangxiyuan mentioned this pull request Feb 24, 2026

[Misc]: test #6787

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CI][BugFix] Qwen3-Next nightly test fix.#6247

[CI][BugFix] Qwen3-Next nightly test fix.#6247
wangxiyuan merged 1 commit intovllm-project:mainfrom
InSec:qwen3_next_ci_bugfix_3

InSec commented Jan 26, 2026 •

edited by github-actions Bot

Loading

Uh oh!

github-actions Bot commented Jan 26, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

InSec commented Jan 26, 2026 • edited by github-actions Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

github-actions Bot commented Jan 26, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

InSec commented Jan 26, 2026 •

edited by github-actions Bot

Loading