Add vllm benchmark #10600

copybara-service · 2024-07-01T20:48:02Z

Add vllm benchmark

Hi,

I was playing around with a vLLM benchmark and wanted to get your opinion on how to best deal with large files in benchmarks. The vLLM benchmark serves a model (facebook/opt-125m) and uses vLLM's own benchmark tooling to query it with a widely-used dataset (..). For development I just mounted the model and dataset into the container-under-test. But that seems impractical for CI/other devs.

Should I embed the model files into the benchmark image? Is there another way you would prefer?

Please don't do an actual review yet. Just wanted to get some early feedback. I still have to understand metricsviz and how to incorporate it.

FUTURE_COPYBARA_INTEGRATE_REVIEW=#10512 from derpsteb:vllm-benchmark 1b9f8f3

Hi, I was playing around with a vLLM benchmark and wanted to get your opinion on how to best deal with large files in benchmarks. The vLLM benchmark serves a model (facebook/opt-125m) and uses vLLM's own benchmark tooling to query it with a widely-used dataset (..). For development I just mounted the model and dataset into the container-under-test. But that seems impractical for CI/other devs. Should I embed the model files into the benchmark image? Is there another way you would prefer? Please don't do an actual review yet. Just wanted to get some early feedback. I still have to understand `metricsviz` and how to incorporate it. FUTURE_COPYBARA_INTEGRATE_REVIEW=#10512 from derpsteb:vllm-benchmark 1b9f8f3 PiperOrigin-RevId: 648472759

copybara-service bot added the exported Issue was exported automatically label Jul 1, 2024

copybara-service bot force-pushed the test/cl648472759 branch 2 times, most recently from 59aaeae to 6362a8a Compare July 3, 2024 19:52

copybara-service bot force-pushed the test/cl648472759 branch from 6362a8a to ba624b6 Compare July 3, 2024 20:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add vllm benchmark #10600

Add vllm benchmark #10600

copybara-service bot commented Jul 1, 2024 •

edited

Loading

Add vllm benchmark #10600

Are you sure you want to change the base?

Add vllm benchmark #10600

Conversation

copybara-service bot commented Jul 1, 2024 • edited Loading

copybara-service bot commented Jul 1, 2024 •

edited

Loading