[serve.llm][Fix] retry with model_config arg #52991

lk-chen · 2025-05-14T07:33:41Z

Why are these changes needed?

vLLM changed (non-external-facing) api in chat_utils.py, we need to adapt the argument change. see vllm-project/vllm#18098

Related issue number

FIX #52975

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
- I've added any new APIs to the API Reference. For example, if I added a
  method in Tune, I've added it in doc/source/tune/api/ under the
  corresponding .rst file.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

Signed-off-by: Linkun <[email protected]>

GeneDer

Some nits and suggestions, but thanks for fixing it!

python/ray/llm/_internal/serve/deployments/llm/vllm/vllm_engine.py

Signed-off-by: Linkun <[email protected]>

GeneDer

Thanks for addressing the comments!

retry with model_config arg

9fd1cb0

Signed-off-by: Linkun <[email protected]>

lk-chen requested a review from a team as a code owner May 14, 2025 07:33

lk-chen requested a review from kouroshHakha May 14, 2025 07:33

lk-chen added the go add ONLY when ready to merge, run all tests label May 14, 2025

hainesmichaelc added the community-contribution Contributed by the community label May 15, 2025

lk-chen changed the title ~~retry with model_config arg~~ [Fix] retry with model_config arg May 15, 2025

lk-chen requested a review from GeneDer May 15, 2025 19:37

GeneDer approved these changes May 15, 2025

View reviewed changes

masoudcharkhabi added serve Ray Serve Related Issue llm labels May 15, 2025

use helper func

b68a36b

Signed-off-by: Linkun <[email protected]>

lk-chen changed the title ~~[Fix] retry with model_config arg~~ [serve.llm][Fix] retry with model_config arg May 15, 2025

kouroshHakha approved these changes May 15, 2025

View reviewed changes

kouroshHakha enabled auto-merge (squash) May 15, 2025 23:20

GeneDer approved these changes May 15, 2025

View reviewed changes

kouroshHakha merged commit dff4182 into ray-project:master May 15, 2025
6 checks passed

lk-chen deleted the vllm_chat_utils_api_change branch May 16, 2025 00:07

hainesmichaelc added the community-backlog label May 22, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[serve.llm][Fix] retry with model_config arg #52991

[serve.llm][Fix] retry with model_config arg #52991

Uh oh!

lk-chen commented May 14, 2025

Uh oh!

GeneDer left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

GeneDer left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[serve.llm][Fix] retry with model_config arg #52991

[serve.llm][Fix] retry with model_config arg #52991

Uh oh!

Conversation

lk-chen commented May 14, 2025

Why are these changes needed?

Related issue number

Checks

Uh oh!

GeneDer left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

GeneDer left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants