[LLM Batch][4/N] vLLM engine stage #50270

comaniac · 2025-02-05T23:22:18Z

Why are these changes needed?

This PR introduces vLLM engine stage with the following features:

Decoding models.
Embedding models.
vLLM v0 and vLLM v1.
TP and PP.

This PR doesn't support:

Vision models.
End to end vLLM processor.

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
- I've added any new APIs to the API Reference. For example, if I added a
  method in Tune, I've added it in doc/source/tune/api/ under the
  corresponding .rst file.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

ci/docker/llm.build.Dockerfile

GeneDer

LGTM!

python/ray/llm/_internal/batch/stages/vllm_engine_stage.py

comaniac · 2025-02-07T02:07:58Z

@kouroshHakha CI green. PTAL

kouroshHakha

Just small comments around test org + some nits:

python/ray/llm/_internal/batch/stages/vllm_engine_stage.py

kouroshHakha · 2025-02-07T16:54:28Z

python/ray/llm/tests/batch/stages/BUILD

@@ -1,8 +1,15 @@
 load("//bazel:python.bzl", "py_test_module_list")

 py_test_module_list(
-  files = glob(["test_*.py"]),
+  files = glob(["test_*.py"], exclude=["test_vllm_engine_stage.py"]),


so what I am saying is we need to a bit of file reorg to make the BUILD files cleaner. What do you think about this suggestion:

Make the folder structure as following:

python/ray/llm/tests -- batch BUILD ----- gpu -------- stages ----------- vllm_engine_stage_gpu.py ----- cpu ------- stages ...

in the BUILD do sth like

py_test_module_list( files = glob(["cpu/*/test_*.py"]), size = "small", tags = ["exclusive", "team:llm"], deps = ["//:ray_lib"], ) py_test_module_list( files = glob(["gpu/*/test_*.py"]), size = "large", tags = ["exclusive", "gpu", "team:llm"], deps = ["//:ray_lib"], )

Signed-off-by: Cody Yu <[email protected]>

kouroshHakha

Let's goooooo

pcmoritz

I'll approve this to get unblocked :)

Let's make sure we have a docs page and good documentation once this is ready :)

comaniac requested review from a team as code owners February 5, 2025 23:22

comaniac requested review from gvspraveen, raulchen, richardliaw, GeneDer and kouroshHakha February 5, 2025 23:22

comaniac force-pushed the llm-batch-vllm branch from ec4e3ce to 4c2161d Compare February 5, 2025 23:24

aslonnie reviewed Feb 5, 2025

View reviewed changes

ci/docker/llm.build.Dockerfile Outdated Show resolved Hide resolved

GeneDer approved these changes Feb 6, 2025

View reviewed changes

kouroshHakha reviewed Feb 6, 2025

View reviewed changes

comaniac requested a review from a team as a code owner February 7, 2025 00:12

comaniac added go add ONLY when ready to merge, run all tests and removed go add ONLY when ready to merge, run all tests labels Feb 7, 2025

kouroshHakha reviewed Feb 7, 2025

View reviewed changes

comaniac mentioned this pull request Feb 8, 2025

[llm] change llm base from ml base to test base #50355

Merged

comaniac force-pushed the llm-batch-vllm branch from b45d962 to 14450b9 Compare February 10, 2025 17:19

comaniac added 11 commits February 11, 2025 10:10

wip

7205c9c

Signed-off-by: Cody Yu <[email protected]>

wip, test not working

5fdea28

Signed-off-by: Cody Yu <[email protected]>

wip

a407ea5

Signed-off-by: Cody Yu <[email protected]>

wip

93ca581

Signed-off-by: Cody Yu <[email protected]>

done

de89f73

Signed-off-by: Cody Yu <[email protected]>

test

7340fd4

Signed-off-by: Cody Yu <[email protected]>

add init

f421606

Signed-off-by: Cody Yu <[email protected]>

gpu

be9e8b0

Signed-off-by: Cody Yu <[email protected]>

comments

7df1627

Signed-off-by: Cody Yu <[email protected]>

docker

fe5c9f1

Signed-off-by: Cody Yu <[email protected]>

fix

cae77d9

Signed-off-by: Cody Yu <[email protected]>

comaniac added 5 commits February 11, 2025 10:10

fix

d57b39a

Signed-off-by: Cody Yu <[email protected]>

try

f89451d

Signed-off-by: Cody Yu <[email protected]>

test

8d2c32c

Signed-off-by: Cody Yu <[email protected]>

test

7021afe

Signed-off-by: Cody Yu <[email protected]>

revert

37beaf0

Signed-off-by: Cody Yu <[email protected]>

comaniac force-pushed the llm-batch-vllm branch from 498eb7d to 37beaf0 Compare February 11, 2025 18:12

revert

ebd9fc7

Signed-off-by: Cody Yu <[email protected]>

kouroshHakha approved these changes Feb 11, 2025

View reviewed changes

kouroshHakha enabled auto-merge (squash) February 11, 2025 22:28

pcmoritz approved these changes Feb 11, 2025

View reviewed changes

kouroshHakha merged commit 5c53b9e into ray-project:master Feb 11, 2025
6 checks passed

comaniac deleted the llm-batch-vllm branch February 11, 2025 22:36

richardliaw mentioned this pull request Feb 18, 2025

[RFC] LLM APIs for Ray Data and Ray Serve #50639

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[LLM Batch][4/N] vLLM engine stage #50270

[LLM Batch][4/N] vLLM engine stage #50270

comaniac commented Feb 5, 2025

GeneDer left a comment

comaniac commented Feb 7, 2025

kouroshHakha left a comment

kouroshHakha Feb 7, 2025

kouroshHakha left a comment

pcmoritz left a comment

[LLM Batch][4/N] vLLM engine stage #50270

[LLM Batch][4/N] vLLM engine stage #50270

Conversation

comaniac commented Feb 5, 2025

Why are these changes needed?

Checks

GeneDer left a comment

Choose a reason for hiding this comment

comaniac commented Feb 7, 2025

kouroshHakha left a comment

Choose a reason for hiding this comment

kouroshHakha Feb 7, 2025

Choose a reason for hiding this comment

kouroshHakha left a comment

Choose a reason for hiding this comment

pcmoritz left a comment

Choose a reason for hiding this comment