[Benchmark] Add plot utility for parameter sweep by DarkLight1337 · Pull Request #27168 · vllm-project/vllm

DarkLight1337 · 2025-10-19T08:55:17Z

Purpose

Follow-up to #27085

Split up the old script into multiple files in a new directory vllm/benchmarks/sweep, abstracting away common code.
Add a separate plotting CLI vllm/benchmarks/sweep/plot.py.

cc @lengrongfu

Test Plan

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

mergify · 2025-10-19T08:55:54Z

Documentation preview: https://vllm--27168.org.readthedocs.build/en/27168/

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

DarkLight1337 · 2025-10-21T01:52:18Z

Should we move SLA into a separate entrypoint serve_sla.py? To me it's different enough than the serve sweep, and we're already reusing utils. But up to you the current approach is fine

Moved

Could you post an example plot command with its output plot? Hard to imagine what the plots look like just from the code

python -m vllm.benchmarks.sweep.plot benchmarks/results/20251019_101029 \
    --fig-dir throughput_vs_concurrency \
    --var-x max_concurrency \
    --var-y request_throughput \
    --col-by api_server_count \
    --curve-by max_num_batched_tokens \
    --filter-by 'max_concurrency<=1024'

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

lengrongfu · 2025-10-21T03:29:32Z

docs/contributing/benchmarks.md

 ```bash
-python vllm/benchmarks/serve_multi.py \
+python -m vllm.benchmarks.sweep.serve_sla \
    --serve-cmd 'vllm serve meta-llama/Llama-2-7b-chat-hf' \


If we use a vllm serve api address, we should how to config?

Maybe we should add a --serve-host param, user can set a vllm online server, then this --serve-params param can be invalid.

You can set the server's host via --serve-cmd. And for resetting the server cache after each benchmark run, you can use --after-bench-cmd.

If you mean that the benchmark should not be responsible for launching the server, you can just use a dummy command that sleeps infinitely and adjust --bench-cmd to access the real server. Of course, you should also set --after-bench-cmd in this case.

I see, maybe i not need set --serve-cmd param, use --bench-cmd param to set vllm bench serve --model meta-llama/Llama-2-7b-chat-hf --backend openai is enough.

vllm/benchmarks/sweep/serve.py

lengrongfu · 2025-10-21T06:15:54Z

The generated file name has single quotes above it, which is weird, not sure if it's just my environment.

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

DarkLight1337 · 2025-10-21T13:29:12Z

The generated file name has single quotes above it, which is weird, not sure if it's just my environment.

Fixed now

vllm/benchmarks/sweep/serve.py

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: 0xrushi <6279035+0xrushi@users.noreply.github.com>

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

[Benchmark] Add plot utility for parameter sweep

b5eac52

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

mergify bot added documentation Improvements or additions to documentation performance Performance-related issues labels Oct 19, 2025

DarkLight1337 added 26 commits October 20, 2025 03:45

Update

7f93c36

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Add log plot

d52e9b9

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Fix multifigure

2f96852

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Update command

fcf156b

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Add title

ad14a53

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Support file prefix

08fab86

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Separate

bc04f30

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Improve separation

1e13493

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Set by directory, not prefix

c848b10

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Fix

f7f36f2

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Plot in parallel

c6cb78a

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Clean up

4cc5e90

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Don't silently fail

4af1e1a

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Pretty

7d82607

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Fix nested

8150f44

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Raise error if no data found

6ace5b2

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Show the problematic data item

b3eb7cd

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Convert to string first

8154e08

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Be more clear

d9fcb09

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Use seaborn grid

9c0e9fa

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Clean up

f1810cc

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Clean

dac464b

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

TODO

73c911b

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Clean

eef9c40

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Generalized filter and binning

aa96151

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Remove old script

0b98496

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

DarkLight1337 added 2 commits October 21, 2025 01:46

Separate out SLA tuner

8afa4d3

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Update

3fa0d4c

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

DarkLight1337 added 5 commits October 21, 2025 02:27

Improve error message

a3d1095

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Allow strings

a4adbda

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Fix

6357b84

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Fix

f750fc5

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Ordering

2d856ff

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

lengrongfu reviewed Oct 21, 2025

View reviewed changes

vllm/benchmarks/sweep/serve.py Outdated Show resolved Hide resolved

DarkLight1337 added 2 commits October 21, 2025 13:27

Don't split

e6d4c72

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Remove unnecessary quotes

46d9f19

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Merge branch 'main' into benchmark-sweep

b95d706

lengrongfu reviewed Oct 22, 2025

View reviewed changes

vllm/benchmarks/sweep/serve.py Outdated Show resolved Hide resolved

Update with benchmark overrides as well

ceabbc8

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

DarkLight1337 added this to the v0.11.1 milestone Oct 22, 2025

vllm-bot merged commit ceacedc into vllm-project:main Oct 22, 2025
3 of 6 checks passed

DarkLight1337 deleted the benchmark-sweep branch October 22, 2025 03:30

DarkLight1337 mentioned this pull request Oct 22, 2025

[Bugfix] Fix SLA tuner initialization #27355

Merged

5 tasks

usberkeley pushed a commit to usberkeley/vllm that referenced this pull request Oct 23, 2025

[Benchmark] Add plot utility for parameter sweep (vllm-project#27168)

716527b

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

DarkLight1337 mentioned this pull request Oct 28, 2025

[Frontend] Add vllm bench sweep to CLI #27639

Merged

5 tasks

ilmarkov pushed a commit to neuralmagic/vllm that referenced this pull request Nov 7, 2025

[Benchmark] Add plot utility for parameter sweep (vllm-project#27168)

c0f4b5e

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

rtourgeman pushed a commit to rtourgeman/vllm that referenced this pull request Nov 10, 2025

[Benchmark] Add plot utility for parameter sweep (vllm-project#27168)

f99ead1

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

devpatelio pushed a commit to SumanthRH/vllm that referenced this pull request Nov 29, 2025

[Benchmark] Add plot utility for parameter sweep (vllm-project#27168)

0ef9765

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Benchmark] Add plot utility for parameter sweep#27168

[Benchmark] Add plot utility for parameter sweep#27168
vllm-bot merged 50 commits intovllm-project:mainfrom
DarkLight1337:benchmark-sweep

DarkLight1337 commented Oct 19, 2025 •

edited by github-actions bot

Loading

Uh oh!

mergify bot commented Oct 19, 2025

Uh oh!

DarkLight1337 commented Oct 21, 2025

Uh oh!

lengrongfu Oct 21, 2025 •

edited

Loading

Uh oh!

DarkLight1337 Oct 21, 2025

Uh oh!

DarkLight1337 Oct 21, 2025 •

edited

Loading

Uh oh!

lengrongfu Oct 21, 2025

Uh oh!

Uh oh!

lengrongfu commented Oct 21, 2025

Uh oh!

DarkLight1337 commented Oct 21, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

DarkLight1337 commented Oct 19, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

mergify bot commented Oct 19, 2025

Uh oh!

DarkLight1337 commented Oct 21, 2025

Uh oh!

lengrongfu Oct 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

DarkLight1337 Oct 21, 2025

Choose a reason for hiding this comment

Uh oh!

DarkLight1337 Oct 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

lengrongfu Oct 21, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

lengrongfu commented Oct 21, 2025

Uh oh!

DarkLight1337 commented Oct 21, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

DarkLight1337 commented Oct 19, 2025 •

edited by github-actions bot

Loading

lengrongfu Oct 21, 2025 •

edited

Loading

DarkLight1337 Oct 21, 2025 •

edited

Loading