[CI/Build][Doc] Fully deprecate old bench scripts for serving / throughput / latency #24411

yeqcharlotte · 2025-09-08T04:39:51Z

Purpose

Follow up of #21355. Delete old benchmarks/benchmark_(latency|throughput|serving).py to avoid confusion for contributing into these scripts.

Given CI has been running on the new script for 1 month so far, it should be fine to delete these scripts.

Test Plan

Make sure no more references in: https://github.com/search?q=repo%3Avllm-project%2Fvllm+%2Fbenchmark_%28latency%7Cserving%7Cthroughput%29.py%2F&type=code

Test current output:

python benchmarks/benchmark_latency.py

Test Result

terminal shows it prints deprecation and exits 1

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

gemini-code-assist

Code Review

This pull request effectively deprecates the old benchmark scripts (benchmark_latency.py, benchmark_serving.py, benchmark_throughput.py) by replacing their contents with a helpful message pointing to the new vLLM CLI commands. The documentation in benchmarks/README.md is also updated accordingly. My main feedback is to improve the deprecation scripts to exit with a non-zero status code and print to stderr, which is crucial for automation and CI/CD pipelines to correctly detect that the old scripts are no longer functional.

benchmarks/benchmark_latency.py

benchmarks/benchmark_serving.py

benchmarks/benchmark_throughput.py

benchmarks/benchmark_latency.py

.claude/settings.local.json

mergify · 2025-09-08T07:33:41Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @yeqcharlotte.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

docs/contributing/profiling.md

.gitignore

hmellor · 2025-09-08T08:25:49Z

benchmarks/README.md

Maybe something for a follow up PR but could the content of this README be moved to docs/contributing/benchmarks.md? Right now the information about benchmarking in the docs is quite sparse and this README contains loads of useful information

that's a good idea. possibly i can follow up with @ywang96 to see if we can take a more systematic approach about all these random benchmark scripts. :D

Signed-off-by: Ye (Charlotte) Qi <[email protected]>

hmellor

LGTM!

…ghput / latency (vllm-project#24411) Signed-off-by: Ye (Charlotte) Qi <[email protected]>

…ghput / latency (vllm-project#24411) Signed-off-by: Ye (Charlotte) Qi <[email protected]> Signed-off-by: xuebwang-amd <[email protected]>

yeqcharlotte requested a review from hmellor as a code owner September 8, 2025 04:39

yeqcharlotte requested review from ywang96 and removed request for hmellor September 8, 2025 04:39

mergify bot added documentation Improvements or additions to documentation performance Performance-related issues labels Sep 8, 2025

gemini-code-assist bot reviewed Sep 8, 2025

View reviewed changes

benchmarks/benchmark_latency.py Outdated Show resolved Hide resolved

benchmarks/benchmark_serving.py Outdated Show resolved Hide resolved

benchmarks/benchmark_throughput.py Outdated Show resolved Hide resolved

yeqcharlotte requested a review from DarkLight1337 September 8, 2025 04:43

yeqcharlotte added the ready ONLY add when PR is ready to merge/full CI is needed label Sep 8, 2025

DarkLight1337 reviewed Sep 8, 2025

View reviewed changes

benchmarks/benchmark_latency.py Outdated Show resolved Hide resolved

DarkLight1337 reviewed Sep 8, 2025

View reviewed changes

.claude/settings.local.json Outdated Show resolved Hide resolved

mergify bot added the needs-rebase label Sep 8, 2025

hmellor reviewed Sep 8, 2025

View reviewed changes

docs/contributing/profiling.md Outdated Show resolved Hide resolved

hmellor reviewed Sep 8, 2025

View reviewed changes

.gitignore Outdated Show resolved Hide resolved

hmellor reviewed Sep 8, 2025

View reviewed changes

yeqcharlotte force-pushed the delete_old_bench branch 2 times, most recently from 7c5cf6e to 77f9f70 Compare September 9, 2025 08:10

yeqcharlotte added 5 commits September 9, 2025 01:11

clean up messages

708ce4c

Signed-off-by: Ye (Charlotte) Qi <[email protected]>

doc fix

8f1b3a9

Signed-off-by: Ye (Charlotte) Qi <[email protected]>

2 more doc fixes

d65b411

Signed-off-by: Ye (Charlotte) Qi <[email protected]>

exit 1

4d14437

Signed-off-by: Ye (Charlotte) Qi <[email protected]>

remove claude local settings and add them to .gitignore

cefc4da

Signed-off-by: Ye (Charlotte) Qi <[email protected]>

yeqcharlotte force-pushed the delete_old_bench branch from 77f9f70 to cefc4da Compare September 9, 2025 08:11

mergify bot removed the needs-rebase label Sep 9, 2025

hmellor approved these changes Sep 9, 2025

View reviewed changes

hmellor enabled auto-merge (squash) September 9, 2025 08:15

hmellor merged commit 6fb2788 into vllm-project:main Sep 9, 2025
17 checks passed

eicherseiji pushed a commit to eicherseiji/vllm that referenced this pull request Sep 9, 2025

[CI/Build][Doc] Fully deprecate old bench scripts for serving / throu…

f83e6a2

…ghput / latency (vllm-project#24411) Signed-off-by: Ye (Charlotte) Qi <[email protected]>

gshtras mentioned this pull request Sep 11, 2025

[Bug]: vllm bench serve isn't an exact replacement of benchmark_serving.py #24684

Open

1 task

skyloevil pushed a commit to skyloevil/vllm that referenced this pull request Sep 13, 2025

[CI/Build][Doc] Fully deprecate old bench scripts for serving / throu…

a35ecf7

…ghput / latency (vllm-project#24411) Signed-off-by: Ye (Charlotte) Qi <[email protected]>

yeqcharlotte mentioned this pull request Sep 14, 2025

[Docs] move benchmarks README to contributing guides #24820

Merged

5 tasks

FeiDaLI pushed a commit to FeiDaLI/vllm that referenced this pull request Sep 25, 2025

[CI/Build][Doc] Fully deprecate old bench scripts for serving / throu…

1e6c954

…ghput / latency (vllm-project#24411) Signed-off-by: Ye (Charlotte) Qi <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[CI/Build][Doc] Fully deprecate old bench scripts for serving / throughput / latency #24411

[CI/Build][Doc] Fully deprecate old bench scripts for serving / throughput / latency #24411

Uh oh!

yeqcharlotte commented Sep 8, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mergify bot commented Sep 8, 2025

Uh oh!

Uh oh!

Uh oh!

hmellor Sep 8, 2025 •

edited

Loading

Uh oh!

yeqcharlotte Sep 9, 2025

Uh oh!

hmellor left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

[CI/Build][Doc] Fully deprecate old bench scripts for serving / throughput / latency #24411

[CI/Build][Doc] Fully deprecate old bench scripts for serving / throughput / latency #24411

Uh oh!

Conversation

yeqcharlotte commented Sep 8, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mergify bot commented Sep 8, 2025

Uh oh!

Uh oh!

Uh oh!

hmellor Sep 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

yeqcharlotte Sep 9, 2025

Choose a reason for hiding this comment

Uh oh!

hmellor left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

yeqcharlotte commented Sep 8, 2025 •

edited by github-actions bot

Loading

hmellor Sep 8, 2025 •

edited

Loading