[Benchmark] Cleanup deprecated nightly benchmark and adjust the docstring for performance benchmark#25786
Conversation
Signed-off-by: KuntaiDu <kuntai@uchicago.edu>
Signed-off-by: KuntaiDu <kuntai@uchicago.edu>
There was a problem hiding this comment.
Code Review
This pull request effectively cleans up the deprecated nightly benchmark suite by removing numerous files and reorganizing the directory structure. The changes align well with the stated purpose. However, I've identified a few inconsistencies in the documentation that were likely overlooked during the refactoring. These include incorrect paths in commands, broken links, and descriptions of functionality that has been removed. I've provided specific comments to help resolve these documentation issues.
Signed-off-by: KuntaiDu <kuntai@uchicago.edu>
Signed-off-by: KuntaiDu <kuntai@uchicago.edu>
|
could you fix markdown precommit? |
| ## Performance benchmark quick overview | ||
|
|
||
| **Benchmarking Coverage**: latency, throughput and fix-qps serving on A100 (the support for FP8 benchmark on H100 is coming!) and Intel® Xeon® Processors, with different models. | ||
| **Benchmarking Coverage**: latency, throughput and fix-qps serving on A100, H100 and Intel® Xeon® Processors, with different models. |
There was a problem hiding this comment.
we have b200 too~
not blocking this PR. cc: @huydhn @linzebing -- we need to add some new docs for vllm's continuous profiling and benchmarking runs! especially if these are locally runnable https://github.com/pytorch/pytorch-integration-testing/tree/main/vllm-benchmarks/benchmarks
There was a problem hiding this comment.
Sure, I'll take this up.
But, I have a small concern with these changes, that we might have to consider. Here, we are migrating the file run-performance-benchmarks.sh from the nightly-benchmarks directory to the performance-benchmarks directory.
In order for our continuous vLLM benchmarking CI to work successfully in Pytorch-infra, I think we might have to update the vllm-benchmark.yml workflow as well, to adapt to these new changes, once they are merged. As, we would be referencing the incorrect directory location at a couple of places there.
This Draft PR (84) is ready and can be merged once these changes are verified from vLLM.
cc: @huydhn @linzebing
There was a problem hiding this comment.
@yeqcharlotte @linzebing
Added the documentation for vLLM continuous profiling and benchmarking in this PR: 25819. But, I am not able to explicitly add reviewers to it 😅
There was a problem hiding this comment.
Thank you @namanlalitnyu and @yeqcharlotte for checking. Please let me know if you prefer to put the renaming of nightly-benchmarks folder in a follow-up PR.
yeqcharlotte
left a comment
There was a problem hiding this comment.
thanks for cleaning this up! please fix some broken links and mkdown lints.
|
For the sake of completeness, here is a round of benchmark https://github.com/pytorch/pytorch-integration-testing/actions/runs/18112289981 running with the change from this PR It looks like we need to tweak the workflows on PyTorch side a bit to handle the missing |
|
This pull request has merge conflicts that must be resolved before it can be |
|
Documentation preview: https://vllm--25786.org.readthedocs.build/en/25786/ |
|
Thank you for all the feedbacks. I am travelling and will be back soon. |
|
Now back to work |
Signed-off-by: KuntaiDu <kuntai@uchicago.edu>
Signed-off-by: KuntaiDu <kuntai@uchicago.edu>
Signed-off-by: KuntaiDu <kuntai@uchicago.edu>
Signed-off-by: KuntaiDu <kuntai@uchicago.edu>
|
@yeqcharlotte I have updated this PR, would be really good if you can take a look again. |
|
@KuntaiDu fix the markdown lints? |
Signed-off-by: KuntaiDu <kuntai@uchicago.edu>
…aiDu/vllm into kuntai-remove-nightly-bench
yeqcharlotte
left a comment
There was a problem hiding this comment.
thanks for cleaning this up!
|
@namanlalitnyu It would be nice if we could also start the process of changing the pytorch CI correspondingly (like this draft PR (pytorch/pytorch-integration-testing#84)) |
|
…ring for performance benchmark (vllm-project#25786) Signed-off-by: KuntaiDu <kuntai@uchicago.edu>
…ring for performance benchmark (vllm-project#25786) Signed-off-by: KuntaiDu <kuntai@uchicago.edu>
…ring for performance benchmark (vllm-project#25786) Signed-off-by: KuntaiDu <kuntai@uchicago.edu>
…ring for performance benchmark (vllm-project#25786) Signed-off-by: KuntaiDu <kuntai@uchicago.edu>
Purpose
Cleanup the nightly benchmark suite because of two reasons:
Test Plan
Test Result
Essential Elements of an Effective PR Description Checklist
supported_models.mdandexamplesfor a new model.