[Benchmark] Cleanup deprecated nightly benchmark and adjust the docstring for performance benchmark by KuntaiDu · Pull Request #25786 · vllm-project/vllm

KuntaiDu · 2025-09-26T21:41:58Z

Purpose

Cleanup the nightly benchmark suite because of two reasons:

The codepath of comparing vLLM against other engines is no longer maintained.
The performance benchmarking CI is now triggered under Pytorch Hub instead of in vLLM's CI pipeline.

Test Plan

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: KuntaiDu <kuntai@uchicago.edu>

gemini-code-assist

Code Review

This pull request effectively cleans up the deprecated nightly benchmark suite by removing numerous files and reorganizing the directory structure. The changes align well with the stated purpose. However, I've identified a few inconsistencies in the documentation that were likely overlooked during the refactoring. These include incorrect paths in commands, broken links, and descriptions of functionality that has been removed. I've provided specific comments to help resolve these documentation issues.

.buildkite/performance-benchmarks/README.md

docs/contributing/benchmarks.md

Signed-off-by: KuntaiDu <kuntai@uchicago.edu>

docs/contributing/benchmarks.md

yeqcharlotte · 2025-09-26T23:57:57Z

could you fix markdown precommit?

yeqcharlotte · 2025-09-27T00:03:59Z

.buildkite/performance-benchmarks/README.md

 ## Performance benchmark quick overview

-**Benchmarking Coverage**: latency, throughput and fix-qps serving on A100 (the support for FP8 benchmark on H100 is coming!) and Intel® Xeon® Processors, with different models.
+**Benchmarking Coverage**: latency, throughput and fix-qps serving on A100, H100 and Intel® Xeon® Processors, with different models.


we have b200 too~

not blocking this PR. cc: @huydhn @linzebing -- we need to add some new docs for vllm's continuous profiling and benchmarking runs! especially if these are locally runnable https://github.com/pytorch/pytorch-integration-testing/tree/main/vllm-benchmarks/benchmarks

@namanlalitnyu : can you take this one?

Sure, I'll take this up.

But, I have a small concern with these changes, that we might have to consider. Here, we are migrating the file run-performance-benchmarks.sh from the nightly-benchmarks directory to the performance-benchmarks directory.
In order for our continuous vLLM benchmarking CI to work successfully in Pytorch-infra, I think we might have to update the vllm-benchmark.yml workflow as well, to adapt to these new changes, once they are merged. As, we would be referencing the incorrect directory location at a couple of places there.
This Draft PR (84) is ready and can be merged once these changes are verified from vLLM.

cc: @huydhn @linzebing

@yeqcharlotte @linzebing
Added the documentation for vLLM continuous profiling and benchmarking in this PR: 25819. But, I am not able to explicitly add reviewers to it 😅

Thank you @namanlalitnyu and @yeqcharlotte for checking. Please let me know if you prefer to put the renaming of nightly-benchmarks folder in a follow-up PR.

Also B200 added.

docs/contributing/benchmarks.md

yeqcharlotte

thanks for cleaning this up! please fix some broken links and mkdown lints.

huydhn · 2025-09-29T22:17:15Z

For the sake of completeness, here is a round of benchmark https://github.com/pytorch/pytorch-integration-testing/actions/runs/18112289981 running with the change from this PR

It looks like we need to tweak the workflows on PyTorch side a bit to handle the missing .buildkite/nightly-benchmarks/tests directory. Also, I don't think we can delete .buildkite/nightly-benchmarks/scripts/run-nightly-benchmarks.sh without moving that script to PyTorch and call the new script here https://github.com/pytorch/pytorch-integration-testing/blob/main/.github/workflows/vllm-benchmark.yml#L295

mergify · 2025-10-03T08:36:36Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @KuntaiDu.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

mergify · 2025-10-08T12:08:06Z

Documentation preview: https://vllm--25786.org.readthedocs.build/en/25786/

KuntaiDu · 2025-10-08T12:17:57Z

Thank you for all the feedbacks. I am travelling and will be back soon.

KuntaiDu · 2025-10-24T21:17:45Z

Now back to work

Signed-off-by: KuntaiDu <kuntai@uchicago.edu>

KuntaiDu · 2025-10-24T21:32:26Z

@yeqcharlotte I have updated this PR, would be really good if you can take a look again.

yeqcharlotte · 2025-10-25T01:59:56Z

@KuntaiDu fix the markdown lints?

Error: .buildkite/performance-benchmarks/README.md:8 MD012/no-multiple-blanks Multiple consecutive blank lines [Expected: 1; Actual: 2]
Error: .buildkite/performance-benchmarks/README.md:19 MD012/no-multiple-blanks Multiple consecutive blank lines [Expected: 1; Actual: 2]
Error: .buildkite/performance-benchmarks/README.md:137 MD012/no-multiple-blanks Multiple consecutive blank lines [Expected: 1; Actual: 2]

Signed-off-by: KuntaiDu <kuntai@uchicago.edu>

…aiDu/vllm into kuntai-remove-nightly-bench

yeqcharlotte

thanks for cleaning this up!

KuntaiDu · 2025-10-30T20:07:24Z

@namanlalitnyu It would be nice if we could also start the process of changing the pytorch CI correspondingly (like this draft PR (pytorch/pytorch-integration-testing#84))

namanlalitnyu · 2025-10-30T20:42:00Z

@namanlalitnyu It would be nice if we could also start the process of changing the pytorch CI correspondingly (like this draft PR (pytorch/pytorch-integration-testing#84))
@KuntaiDu thanks for updating and the heads up. Yes, we have trigged the CI pipeline with these changes, and once they pass successfully, we will merge the updates there as well.

…ring for performance benchmark (vllm-project#25786) Signed-off-by: KuntaiDu <kuntai@uchicago.edu>

KuntaiDu added 2 commits September 26, 2025 14:34

cleanup outdated nightly benchmark pipeline

85abd22

Signed-off-by: KuntaiDu <kuntai@uchicago.edu>

move the directory from nightly-benchmarks to performance-benchmarks

67bc9a3

Signed-off-by: KuntaiDu <kuntai@uchicago.edu>

KuntaiDu requested a review from yeqcharlotte September 26, 2025 21:42

mergify bot added documentation Improvements or additions to documentation ci/build performance Performance-related issues labels Sep 26, 2025

gemini-code-assist bot reviewed Sep 26, 2025

View reviewed changes

.buildkite/performance-benchmarks/README.md Show resolved Hide resolved

docs/contributing/benchmarks.md Outdated Show resolved Hide resolved

docs/contributing/benchmarks.md Outdated Show resolved Hide resolved

KuntaiDu requested a review from bigPYJ1151 September 26, 2025 21:46

KuntaiDu added 2 commits September 26, 2025 14:48

fully change nightly-benchmarks to performance-benchmarks

5a5d7a1

Signed-off-by: KuntaiDu <kuntai@uchicago.edu>

remove the old docs related to nightly benchmarks

9bf170b

Signed-off-by: KuntaiDu <kuntai@uchicago.edu>

yeqcharlotte reviewed Sep 26, 2025

View reviewed changes

docs/contributing/benchmarks.md Outdated Show resolved Hide resolved

yeqcharlotte reviewed Sep 27, 2025

View reviewed changes

docs/contributing/benchmarks.md Outdated Show resolved Hide resolved

yeqcharlotte requested changes Sep 27, 2025

View reviewed changes

mergify bot added the needs-rebase label Oct 3, 2025

KuntaiDu added 3 commits October 24, 2025 14:26

resolve merge conflicts

405f4e1

Signed-off-by: KuntaiDu <kuntai@uchicago.edu>

add B200 into README

f4722ba

Signed-off-by: KuntaiDu <kuntai@uchicago.edu>

move launch-server.sh into performance-benchmarks

24fe788

Signed-off-by: KuntaiDu <kuntai@uchicago.edu>

mergify bot removed the needs-rebase label Oct 24, 2025

fix wrong link when referring intel-ai-tce/vllm

5692a16

Signed-off-by: KuntaiDu <kuntai@uchicago.edu>

KuntaiDu requested a review from yeqcharlotte October 24, 2025 21:32

Merge branch 'main' into kuntai-remove-nightly-bench

4fe4203

KuntaiDu added 2 commits October 27, 2025 09:11

fix markdown format

9baec39

Signed-off-by: KuntaiDu <kuntai@uchicago.edu>

Merge branch 'kuntai-remove-nightly-bench' of https://github.com/Kunt…

4cf93ae

…aiDu/vllm into kuntai-remove-nightly-bench

yeqcharlotte approved these changes Oct 28, 2025

View reviewed changes

yeqcharlotte enabled auto-merge (squash) October 28, 2025 03:19

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Oct 28, 2025

Merge branch 'main' into kuntai-remove-nightly-bench

df4ee7d

yeqcharlotte merged commit 8bff831 into vllm-project:main Oct 30, 2025
19 checks passed

KuntaiDu deleted the kuntai-remove-nightly-bench branch October 30, 2025 20:05

huydhn mentioned this pull request Oct 30, 2025

Update benchmark directory for vLLM pytorch/pytorch-integration-testing#84

Merged

ilmarkov pushed a commit to neuralmagic/vllm that referenced this pull request Nov 7, 2025

[Benchmark] Cleanup deprecated nightly benchmark and adjust the docst…

596ae02

…ring for performance benchmark (vllm-project#25786) Signed-off-by: KuntaiDu <kuntai@uchicago.edu>

ZhengHongming888 pushed a commit to ZhengHongming888/vllm that referenced this pull request Nov 8, 2025

[Benchmark] Cleanup deprecated nightly benchmark and adjust the docst…

1eb89d2

…ring for performance benchmark (vllm-project#25786) Signed-off-by: KuntaiDu <kuntai@uchicago.edu>

rtourgeman pushed a commit to rtourgeman/vllm that referenced this pull request Nov 10, 2025

[Benchmark] Cleanup deprecated nightly benchmark and adjust the docst…

e257912

…ring for performance benchmark (vllm-project#25786) Signed-off-by: KuntaiDu <kuntai@uchicago.edu>

devpatelio pushed a commit to SumanthRH/vllm that referenced this pull request Nov 29, 2025

[Benchmark] Cleanup deprecated nightly benchmark and adjust the docst…

be306f5

…ring for performance benchmark (vllm-project#25786) Signed-off-by: KuntaiDu <kuntai@uchicago.edu>

Uh oh!

Conversation

KuntaiDu commented Sep 26, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

yeqcharlotte commented Sep 26, 2025

Uh oh!

yeqcharlotte Sep 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

linzebing Sep 27, 2025

Choose a reason for hiding this comment

Uh oh!

namanlalitnyu Sep 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

namanlalitnyu Sep 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

KuntaiDu Oct 24, 2025

Choose a reason for hiding this comment

Uh oh!

KuntaiDu Oct 24, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

yeqcharlotte left a comment

Choose a reason for hiding this comment

Uh oh!

huydhn commented Sep 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mergify bot commented Oct 3, 2025

Uh oh!

mergify bot commented Oct 8, 2025

Uh oh!

KuntaiDu commented Oct 8, 2025

Uh oh!

KuntaiDu commented Oct 24, 2025

Uh oh!

KuntaiDu commented Oct 24, 2025

Uh oh!

yeqcharlotte commented Oct 25, 2025

Uh oh!

yeqcharlotte left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

KuntaiDu commented Oct 30, 2025

Uh oh!

namanlalitnyu commented Oct 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

KuntaiDu commented Sep 26, 2025 •

edited by github-actions bot

Loading

yeqcharlotte Sep 27, 2025 •

edited

Loading

namanlalitnyu Sep 27, 2025 •

edited

Loading

namanlalitnyu Sep 28, 2025 •

edited

Loading

huydhn commented Sep 29, 2025 •

edited

Loading