[doc] Update Qwen3-235B doc for reproducing latest performance by Angazenn · Pull Request #5323 · vllm-project/vllm-ascend

Angazenn · 2025-12-24T08:11:54Z

What this PR does / why we need it?

This PR updates Qwen3-235B doc to give a simple recipe for repreducing our latest perfomance on Atlas A3 servers.

Does this PR introduce any user-facing change?

How was this patch tested?

vLLM version: release/v0.13.0
vLLM main: vllm-project/vllm@5fbfa8d

gemini-code-assist

Code Review

This pull request adds a new tutorial section for achieving the best performance with the Qwen3-235B model. The documentation includes scripts for both single-node and multi-node (prefill-decode disaggregation) setups. I've found several critical configuration errors in the provided scripts that would prevent users from successfully running the tutorial. My review includes corrections for these issues to ensure the documentation is accurate and the scripts are functional.

github-actions · 2025-12-24T10:55:00Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

MengqingCao · 2025-12-25T14:36:48Z

plz fix lint and update the pr message

Signed-off-by: Angazenn <supperccell@163.com>

…to eplb_refactor * 'main' of https://github.com/vllm-project/vllm-ascend: (46 commits) [Feature] Support to use fullgraph with eagle (vllm-project#5118) [EPLB][refactor] Modification of the initialization logic for expert_map and log2phy（depend on pr5285） (vllm-project#5311) [Refactor]6/N Extract common code of class AscendMLAImpl (vllm-project#5314) [Refactor] cache cos/sin in mla & remove parameter model in builder. (vllm-project#5277) update vllm pin to 12.27 (vllm-project#5412) [ReleaseNote] Add release note for v0.13.0rc1 (vllm-project#5334) [Bugfix] Correctly handle the output shape in multimodal attention (vllm-project#5443) Fix nightly (vllm-project#5413) [bugfix] fix typo of _skip_all_reduce_across_dp_group (vllm-project#5435) [Doc]modify pcp tutorial doc (vllm-project#5440) [Misc] fast fail for exiting if tools/install_flash_infer_attention_score_ops_a2.sh (vllm-project#5422) [Doc] Update DeepSeek V3.1/R1 2P1D doc (vllm-project#5387) [DOC]Fix model weight download links (vllm-project#5436) [Doc] Modify DeepSeek-R1/V3.1 documentation (vllm-project#5426) Revert "[feat] enable hierarchical mc2 ops on A2 by default (vllm-project#5300)" (vllm-project#5434) [Bugfix] fix greedy temperature detection (vllm-project#5417) [doc] Update Qwen3-235B doc for reproducing latest performance (vllm-project#5323) [feat] enable hierarchical mc2 ops on A2 by default (vllm-project#5300) [Doc] delete environment variable HCCL_OP_EXPANSION_MODE in DeepSeekV3.1/R1 (vllm-project#5419) [Doc] add long_sequence feature user guide (vllm-project#5343) ...

…project#5323) ### What this PR does / why we need it? This PR updates Qwen3-235B doc to give a simple recipe for repreducing our latest perfomance on Atlas A3 servers. - vLLM version: release/v0.13.0 - vLLM main: vllm-project/vllm@5fbfa8d --------- Signed-off-by: Angazenn <supperccell@163.com> Signed-off-by: zrj026 <zhangrunjiang026@gmail.com>

…project#5323) ### What this PR does / why we need it? This PR updates Qwen3-235B doc to give a simple recipe for repreducing our latest perfomance on Atlas A3 servers. - vLLM version: release/v0.13.0 - vLLM main: vllm-project/vllm@5fbfa8d --------- Signed-off-by: Angazenn <supperccell@163.com>

…project#5323) ### What this PR does / why we need it? This PR updates Qwen3-235B doc to give a simple recipe for repreducing our latest perfomance on Atlas A3 servers. - vLLM version: release/v0.13.0 - vLLM main: vllm-project/vllm@5fbfa8d --------- Signed-off-by: Angazenn <supperccell@163.com> Signed-off-by: zrj026 <zhangrunjiang026@gmail.com>

gemini-code-assist bot reviewed Dec 24, 2025

View reviewed changes

Comment thread docs/source/tutorials/Qwen3-235B-A22B.md

Comment thread docs/source/tutorials/Qwen3-235B-A22B.md

Comment thread docs/source/tutorials/Qwen3-235B-A22B.md

github-actions bot added the documentation Improvements or additions to documentation label Dec 24, 2025

zhanghw0354 reviewed Dec 25, 2025

View reviewed changes

Comment thread docs/source/tutorials/Qwen3-235B-A22B.md Outdated

Angazenn mentioned this pull request Dec 25, 2025

[Release]: Release checklist for v0.13.0rc1 #5229

Closed

46 tasks

MengqingCao reviewed Dec 25, 2025

View reviewed changes

Comment thread docs/source/tutorials/Qwen3-235B-A22B.md Outdated

MengqingCao reviewed Dec 25, 2025

View reviewed changes

Comment thread docs/source/tutorials/Qwen3-235B-A22B.md

MengqingCao reviewed Dec 25, 2025

View reviewed changes

Comment thread docs/source/tutorials/Qwen3-235B-A22B.md

Angazenn added 3 commits December 26, 2025 16:01

Qwen3-235B perf doc

326d373

Signed-off-by: Angazenn <supperccell@163.com>

modify

a28e91d

Signed-off-by: Angazenn <supperccell@163.com>

update

f59ed82

Signed-off-by: Angazenn <supperccell@163.com>

Angazenn force-pushed the doc branch from d0e75ae to f59ed82 Compare December 26, 2025 08:11

update

0989785

Signed-off-by: Angazenn <supperccell@163.com>

Angazenn changed the title ~~[doc]Qwen3-235B perf doc~~ [doc] Update Qwen3-235B doc for reproducing latest performance Dec 26, 2025

Angazenn added 2 commits December 26, 2025 16:37

update

aa962a2

Signed-off-by: Angazenn <supperccell@163.com>

update

a910cf5

Signed-off-by: Angazenn <supperccell@163.com>

wangxiyuan approved these changes Dec 26, 2025

View reviewed changes

fix

2f9eaf6

Signed-off-by: Angazenn <supperccell@163.com>

Angazenn force-pushed the doc branch from c20a547 to dbec8c8 Compare December 27, 2025 06:45

fix

0645e0d

Signed-off-by: Angazenn <supperccell@163.com>

Angazenn force-pushed the doc branch from dbec8c8 to 0645e0d Compare December 27, 2025 06:47

MengqingCao approved these changes Dec 27, 2025

View reviewed changes

Merge branch 'main' into doc

af38e48

MengqingCao merged commit eab306b into vllm-project:main Dec 27, 2025
6 of 8 checks passed

Angazenn deleted the doc branch February 4, 2026 06:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[doc] Update Qwen3-235B doc for reproducing latest performance#5323

[doc] Update Qwen3-235B doc for reproducing latest performance#5323
MengqingCao merged 9 commits intovllm-project:mainfrom
Angazenn:doc

Angazenn commented Dec 24, 2025 •

edited

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Dec 24, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

MengqingCao commented Dec 25, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

Angazenn commented Dec 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Dec 24, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

MengqingCao commented Dec 25, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Angazenn commented Dec 24, 2025 •

edited

Loading