Skip to content

Conversation

@afierka-intel
Copy link

@afierka-intel afierka-intel commented Apr 16, 2025

Original PR #897

synchronize 12 vLLM flags to non-driver workers in Ray executor

FIX "not warmed-up" bucket issue in cross-node vLLM inference.

Root cause: the issue is caused by not synchronizing the 12 vLLM flags
to all the non-driver workers within the Ray cluster


![image](https://github.com/user-attachments/assets/fb51cefc-b23a-434d-a641-493592d896a6)

---------

Co-authored-by: Michał Kuligowski <[email protected]>
@michalkuligowski
Copy link

/run-gaudi-tests

@michalkuligowski michalkuligowski merged commit 5d30a8f into v1.21.0_next Apr 16, 2025
42 checks passed
@michalkuligowski michalkuligowski deleted the dev/afierka/1.21-fix-corss-nodes-flags branch April 16, 2025 16:10
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants