Skip to content

ci/build/feat: bump vLLM libs to v0.4.2 and other deps in Dockerfile.ubi#23

Merged
njhill merged 3 commits intomainfrom
vllm-0.4.2
May 8, 2024
Merged

ci/build/feat: bump vLLM libs to v0.4.2 and other deps in Dockerfile.ubi#23
njhill merged 3 commits intomainfrom
vllm-0.4.2

Conversation

@tjohnson31415
Copy link
Member

Changes:

  • vLLM v0.4.2 was published today, update our build to use pre-built libs from their wheel
  • bump other dependencies in the image build (base UBI image, miniforge, flash attention, grpcio-tools, accelerate)
  • little cleanup to remove PYTORCH_ args that are no longer used

@tjohnson31415 tjohnson31415 changed the title ci/build/feat: bump versions in Dockerfile.ubi build ci/build/feat: bump vLLM to v0.4.2 and other deps in Dockerfile.ubi May 6, 2024
@tjohnson31415 tjohnson31415 changed the title ci/build/feat: bump vLLM to v0.4.2 and other deps in Dockerfile.ubi ci/build/feat: bump vLLM libs to v0.4.2 and other deps in Dockerfile.ubi May 6, 2024
Copy link
Contributor

@njhill njhill left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @tjohnson31415 this looks great!

Signed-off-by: Travis Johnson <tsjohnso@us.ibm.com>
Signed-off-by: Travis Johnson <tsjohnso@us.ibm.com>
Signed-off-by: Travis Johnson <tsjohnso@us.ibm.com>
@njhill njhill merged commit c737a7a into main May 8, 2024
@njhill njhill deleted the vllm-0.4.2 branch May 8, 2024 22:30
tdoublep pushed a commit that referenced this pull request Jan 20, 2025
This PR tries to implement (and tracks the progress of) the warmup of
multiple different `prompt-length/max-decode` shapes.

---------

Co-authored-by: Yannick Schnider <Yannick.Schnider1@ibm.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants