[Doc] Refact benchmark doc by Potabk · Pull Request #5173 · vllm-project/vllm-ascend

Potabk · 2025-12-18T12:22:08Z

What this PR does / why we need it?

Refactor some outdated doc

Does this PR introduce any user-facing change?

How was this patch tested?

vLLM version: v0.12.0
vLLM main: vllm-project/vllm@ad32e3e

gemini-code-assist

Code Review

This pull request significantly refactors and improves the performance benchmark documentation, making it much more comprehensive and easier for users to follow. The new structure with detailed sections for datasets and various benchmark types is a great improvement. I've identified a few minor issues in the example commands, such as typos in arguments and a hardcoded path, which could cause problems for users. Correcting these will enhance the quality and usability of the documentation.

Signed-off-by: wangli <wangli858794774@gmail.com>

github-actions · 2025-12-18T12:27:40Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

…to eplb_refactor * 'main' of https://github.com/vllm-project/vllm-ascend: (52 commits) [Doc]Add the user_guide doc file regarding fine-grained TP. (vllm-project#5084) [pref] qwen3_next add triton ops : fused_sigmoid_gating_delta_rule_update (vllm-project#4818) [Feature] Add token mask for DispatchGmmCombineDecode operator (vllm-project#5171) [CI] Improve CI (vllm-project#5078) [Refactor] remove some metadata variables in attention_v1. (vllm-project#5160) Add Qwen3-VL-235B-A22B-Instruct tutorials (vllm-project#5167) [Doc] Add a perf tune section (vllm-project#5127) [Image] Refactor image build (vllm-project#5175) [refactor] refactor weight trans nz and transpose (vllm-project#4878) [BugFix]Fix precision issue for LoRA feature (vllm-project#4141) 【Doc】Deepseekv3.1/R1 doc enhancement (vllm-project#4827) support basic long_seq feature st (vllm-project#5140) [Bugfix] install trition for test_custom_op (vllm-project#5112) [2/N][Pangu][MoE] Remove Pangu Related Code (vllm-project#5130) [bugfix] Use FUSED_MC2 MoE comm path for the op `dispatch_ffn_combine` (vllm-project#5156) [BugFix] Fix top_p,top_k issue with EAGLE and add top_p,top_k in EAGLE e2e (vllm-project#5131) [Doc][P/D] Fix MooncakeConnector's name (vllm-project#5172) [Bugfix] Fix in_profile_run in mtp_proposer dummy_run (vllm-project#5165) [Doc] Refact benchmark doc (vllm-project#5173) [Nightly] Avoid max_model_len being smaller than the decoder prompt to prevent single-node-accuray-tests from failing (vllm-project#5174) ... Signed-off-by: 白永斌 <baiyongbin3@h-partners.com>

### What this PR does / why we need it? Refactor some outdated doc - vLLM version: v0.12.0 - vLLM main: vllm-project/vllm@ad32e3e Signed-off-by: wangli <wangli858794774@gmail.com>

### What this PR does / why we need it? Refactor some outdated doc - vLLM version: v0.12.0 - vLLM main: vllm-project/vllm@ad32e3e Signed-off-by: wangli <wangli858794774@gmail.com> Signed-off-by: zrj026 <zhangrunjiang026@gmail.com>

gemini-code-assist bot reviewed Dec 18, 2025

View reviewed changes

Comment thread docs/source/developer_guide/performance_and_debug/performance_benchmark.md Outdated

Comment thread docs/source/developer_guide/performance_and_debug/performance_benchmark.md

Comment thread docs/source/developer_guide/performance_and_debug/performance_benchmark.md Outdated

Potabk force-pushed the doc branch from 1f7cf39 to 5534486 Compare December 18, 2025 12:26

refact benchmark doc

b444999

Signed-off-by: wangli <wangli858794774@gmail.com>

Potabk force-pushed the doc branch from 5534486 to b444999 Compare December 18, 2025 12:27

github-actions bot added the documentation Improvements or additions to documentation label Dec 18, 2025

wangxiyuan approved these changes Dec 18, 2025

View reviewed changes

wangxiyuan merged commit 7d32371 into vllm-project:main Dec 18, 2025
16 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Doc] Refact benchmark doc#5173

[Doc] Refact benchmark doc#5173
wangxiyuan merged 1 commit intovllm-project:mainfrom
Potabk:doc

Potabk commented Dec 18, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Dec 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Potabk commented Dec 18, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Dec 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Potabk commented Dec 18, 2025 •

edited by github-actions bot

Loading