chore: vllm 0.10.1.1 #2641

dmitry-tokarev-nv · 2025-08-22T15:50:50Z

Overview:

Details:

Where should the reviewer start?

Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)

closes GitHub issue: #xxx

Summary by CodeRabbit

Chores
- Upgraded VLLM dependency to version 0.10.1.1 across the project.
- Aligns container builds and optional dependencies with the latest VLLM release for consistency.
- Brings upstream fixes and improvements that may enhance stability and compatibility.
- No changes required to existing workflows; behavior remains the same for end-users.

copy-pr-bot · 2025-08-22T15:50:53Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

dmitry-tokarev-nv · 2025-08-22T15:52:31Z

/ok to test 43d9217

coderabbitai · 2025-08-22T15:55:55Z

Walkthrough

Bumps vLLM from 0.10.1 to 0.10.1.1 across container build, install script, and Python optional dependency. Updates the VLLM_REF commit hash and corresponding precompiled wheel URL. No logic or control-flow changes.

Changes

Cohort / File(s)	Summary of Changes
Container build & install `container/Dockerfile.vllm`, `container/deps/vllm/install_vllm.sh`	Updated VLLM_REF from `aab5498...` (v0.10.1) to `1da94e6...` (v0.10.1.1). Adjusted comments/URLs and switched wheel filename to `vllm-0.10.1.1-...whl`.
Python dependency spec `pyproject.toml`	Bumped `vllm[flashinfer]` optional dependency from `0.10.1` to `0.10.1.1`.

Sequence Diagram(s)

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~8 minutes

Possibly related PRs

chore: Finish vllm upgrade to 0.10.1 + cleanup #2528 — Previous vLLM bump to 0.10.1 across the same files; this PR follows up to 0.10.1.1.
build: Bump vllm and deepgemm version #2509 — Similar update adjusting VLLM_REF in Dockerfile and install script.
chore: Install vLLM and WideEP kernels in vLLM runtime container #2010 — Related changes modifying VLLM commit and wheel references in install/build scripts.

Poem

I nibble tags with careful cheer,
From 0.10.1 to .1.1, we steer.
A wheel rolls smooth, a hash aligned,
Containers hum, dependencies refined.
Hop hop—version bump, ship it clear! 🐇🛠️

Tip

🔌 Remote MCP (Model Context Protocol) integration is now available!

Pro plan users can now connect to remote MCP servers from the Integrations page. Connect with popular remote MCPs such as Notion and Linear to add more context to your reviews and chats.

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.

Support

Need help? Create a ticket on our support page for assistance with any issues or questions.

CodeRabbit Commands (Invoked using PR/Issue comments)

Type @coderabbitai help to get the list of available commands.

Other keywords and placeholders

Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

Status, Documentation and Community

Visit our Status Page to check the current availability of CodeRabbit.
Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

coderabbitai

Actionable comments posted: 0

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

container/deps/vllm/install_vllm.sh (1)

89-96: Bash usage help prints literal “f” due to Python-style f-strings.

These lines won’t interpolate variables and look unpolished in CLI help. Replace with standard bash echo.

Apply this diff:

-            echo f"  --vllm-ref REF    Git reference to checkout (default: ${VLLM_REF})"
-            echo f"  --max-jobs NUM    Maximum number of parallel jobs (default: ${MAX_JOBS})"
+            echo "  --vllm-ref REF    Git reference to checkout (default: ${VLLM_REF})"
+            echo "  --max-jobs NUM    Maximum number of parallel jobs (default: ${MAX_JOBS})"
             echo "  --arch ARCH       Architecture (amd64|arm64, default: auto-detect)"
-            echo f"  --installation-dir DIR  Directory to install vllm (default: ${INSTALLATION_DIR})"
-            echo f"  --deepgemm-ref REF  Git reference for DeepGEMM (default: ${DEEPGEMM_REF})"
-            echo f"  --flashinf-ref REF  Git reference for Flash Infer (default: ${FLASHINF_REF})"
-            echo f"  --torch-backend BACKEND  Torch backend to use (default: ${TORCH_BACKEND})"
+            echo "  --installation-dir DIR  Directory to install vllm (default: ${INSTALLATION_DIR})"
+            echo "  --deepgemm-ref REF  Git reference for DeepGEMM (default: ${DEEPGEMM_REF})"
+            echo "  --flashinf-ref REF  Git reference for Flash Infer (default: ${FLASHINF_REF})"
+            echo "  --torch-backend BACKEND  Torch backend to use (default: ${TORCH_BACKEND})"

🧹 Nitpick comments (2)

container/deps/vllm/install_vllm.sh (2)
23-27: Add guard for precompiled vLLM wheel availability

We’ve verified that the current VLLM_PRECOMPILED_WHEEL_LOCATION resolves successfully (HTTP 200), but to avoid lengthy source‐builds if a future wheel URL is missing, it’s still worthwhile to add a fast HEAD check with fallback.

• File: container/deps/vllm/install_vllm.sh
• Insert immediately after the VLLM_PRECOMPILED_WHEEL_LOCATION=... line
 VLLM_PRECOMPILED_WHEEL_LOCATION="https://vllm-wheels.s3.us-west-2.amazonaws.com/${VLLM_REF}/vllm-0.10.1.1-cp38-abi3-manylinux1_x86_64.whl"
 VLLM_GIT_URL="https://github.com/vllm-project/vllm.git"
+
+# Validate the precompiled wheel URL early to avoid long source builds if it’s missing
+if command -v curl >/dev/null 2>&1; then
+    if ! curl -fsI "${VLLM_PRECOMPILED_WHEEL_LOCATION}" >/dev/null; then
+        echo "Warning: Precompiled vLLM wheel not found at ${VLLM_PRECOMPILED_WHEEL_LOCATION}. Falling back to build-from-source."
+        unset VLLM_PRECOMPILED_WHEEL_LOCATION
+    fi
+fi
137-141: ARM64 Compatibility – Build vLLM from Source

On aarch64, vLLM’s prebuilt wheels target x86_64 and won’t work with torch==2.7.1+cu128/torchvision==0.22.1. To ensure your pinned versions function:

• Detect the ARM64 architecture in container/deps/vllm/install_vllm.sh (around lines 137–141).
• After installing torch==2.7.1+cu128 and torchvision==0.22.1, invoke vLLM’s source‐build flow (e.g. use_existing_torch.py or pip install -e .) so vLLM compiles against the installed PyTorch.
• Add a clear comment or conditional branch that points users to the vLLM ARM64 build docs:
https://docs.vllm.ai/en/stable/getting_started/installation.html#build-from-source

Example snippet update:
     if ! uv pip install torch==2.7.1+cu128 torchaudio==2.7.1 torchvision==0.22.1 --index-url https://download.pytorch.org/whl; then
         echo "Pinned PyTorch install failed"
         exit 1
     fi
+    # On aarch64, compose vLLM from source to link against this torch install:
+    # python use_existing_torch.py  # see vLLM ARM64 build docs
This ensures anyone running on ARM64 will know to rebuild vLLM for compatibility rather than relying on unavailable prebuilt wheels.

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

💡 Knowledge Base configuration:

MCP integration is disabled by default for public repositories
Jira integration is disabled by default for public repositories
Linear integration is disabled by default for public repositories

You can enable these sources in your CodeRabbit configuration.

📥 Commits

Reviewing files that changed from the base of the PR and between 6a358f7 and 43d9217.

📒 Files selected for processing (3)

container/Dockerfile.vllm (1 hunks)
container/deps/vllm/install_vllm.sh (1 hunks)
pyproject.toml (1 hunks)

🧰 Additional context used

🧠 Learnings (2)

📚 Learning: 2025-08-18T16:52:15.659Z

Learnt from: nnshah1
PR: ai-dynamo/dynamo#2489
File: container/deps/vllm/install_vllm.sh:151-152
Timestamp: 2025-08-18T16:52:15.659Z
Learning: The VLLM_PRECOMPILED_WHEEL_LOCATION environment variable, when exported, automatically triggers vLLM's build system to use the precompiled wheel instead of building from source, even when using standard `uv pip install .` commands in container/deps/vllm/install_vllm.sh.

Applied to files:

container/deps/vllm/install_vllm.sh

📚 Learning: 2025-08-18T16:52:15.659Z

Learnt from: nnshah1
PR: ai-dynamo/dynamo#2489
File: container/deps/vllm/install_vllm.sh:151-152
Timestamp: 2025-08-18T16:52:15.659Z
Learning: The VLLM_PRECOMPILED_WHEEL_LOCATION environment variable is an official vLLM environment variable that, when exported, automatically triggers vLLM's build system to use the specified precompiled wheel instead of building from source. This works even with standard `uv pip install .` commands without requiring explicit reference to the variable in the install command. The vLLM build system internally detects and uses this environment variable.

Applied to files:

container/deps/vllm/install_vllm.sh

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (1)

GitHub Check: Build and Test - dynamo

🔇 Additional comments (5)

pyproject.toml (1)

55-59: Consistency check passed – all vLLM references updated to 0.10.1.1
No stale vllm==0.10.1 entries remain; all references point to 0.10.1.1:

pyproject.toml (line 58)

container/deps/vllm/install_vllm.sh (line 26)

container/Dockerfile.vllm (line 20)

No further action required.

container/Dockerfile.vllm (2)

19-21: Release notes comment updated — LGTM.

The comment and URL now reference v0.10.1.1 and align with the ARG. No functional impact.

16-16: Confirmed: VLLM_REF correctly pins v0.10.1.1

The ARG VLLM_REF="1da94e673c257373280026f75ceb4effac80e892" on line 16 of container/Dockerfile.vllm exactly matches the v0.10.1.1 tag (pointing to commit 1da94e673c257373280026f75ceb4effac80e892, tagged Aug 20, 2025). No changes needed.

• container/Dockerfile.vllm: line 16 — ARG VLLM_REF="1da94e673c257373280026f75ceb4effac80e892"

container/deps/vllm/install_vllm.sh (2)

160-166: Relying on VLLM_PRECOMPILED_WHEEL_LOCATION is correct per vLLM behavior.

Exporting the env var before uv pip install is sufficient; vLLM build picks it up automatically. Thanks for keeping this aligned with the documented behavior.

154-159: Confirmed: openai==1.99.9 is available on PyPI

openai version 1.99.9 was published on PyPI on August 12, 2025 and can be installed directly, so pinning this exact version in container/deps/vllm/install_vllm.sh is safe. (pypi.org)

dmitry-tokarev-nv · 2025-08-25T05:10:43Z

/ok to test 30a70d0

Signed-off-by: Hannah Zhang <[email protected]>

Signed-off-by: Jason Zhou <[email protected]>

Signed-off-by: Krishnan Prashanth <[email protected]>

Signed-off-by: nnshah1 <[email protected]>

vllm 0.10.1.1

43d9217

dmitry-tokarev-nv requested review from a team, alec-flowers, ishandhanani, nnshah1, ptarasiewiczNV, richardhuo-nv, rmccorm4 and tanmayv25 as code owners August 22, 2025 15:50

pull-request-size bot added the size/S label Aug 22, 2025

github-actions bot added the chore label Aug 22, 2025

copy-pr-bot bot temporarily deployed to GITLAB August 22, 2025 15:52 Inactive

copy-pr-bot bot temporarily deployed to GITLAB August 22, 2025 15:53 Inactive

coderabbitai bot reviewed Aug 22, 2025

View reviewed changes

install_vllm.sh - remove accidental char

354fccb

alec-flowers approved these changes Aug 22, 2025

View reviewed changes

Merge branch 'main' into dtokarev-upgrade-vllm-0.10.1.1

30a70d0

copy-pr-bot bot temporarily deployed to GITLAB August 25, 2025 05:10 Inactive

copy-pr-bot bot temporarily deployed to GITLAB August 25, 2025 05:14 Inactive

dmitry-tokarev-nv merged commit 135dc82 into main Aug 25, 2025
10 of 11 checks passed

dmitry-tokarev-nv deleted the dtokarev-upgrade-vllm-0.10.1.1 branch August 25, 2025 17:13

dmitry-tokarev-nv added a commit that referenced this pull request Aug 25, 2025

chore: vllm 0.10.1.1 (#2641)

28c061a

dmitry-tokarev-nv mentioned this pull request Aug 25, 2025

chore: vllm 0.10.1.1 #2691

Merged

hhzhang16 pushed a commit that referenced this pull request Aug 27, 2025

chore: vllm 0.10.1.1 (#2641)

4c1d671

Signed-off-by: Hannah Zhang <[email protected]>

nv-anants pushed a commit that referenced this pull request Aug 28, 2025

chore: vllm 0.10.1.1 (#2641)

4003630

jasonqinzhou pushed a commit that referenced this pull request Aug 30, 2025

chore: vllm 0.10.1.1 (#2641)

26aac03

Signed-off-by: Jason Zhou <[email protected]>

KrishnanPrash pushed a commit that referenced this pull request Sep 2, 2025

chore: vllm 0.10.1.1 (#2641)

9d0d41f

Signed-off-by: Krishnan Prashanth <[email protected]>

nnshah1 pushed a commit that referenced this pull request Sep 8, 2025

chore: vllm 0.10.1.1 (#2641)

d40a718

Signed-off-by: nnshah1 <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

chore: vllm 0.10.1.1 #2641

chore: vllm 0.10.1.1 #2641

Uh oh!

dmitry-tokarev-nv commented Aug 22, 2025 •

edited by coderabbitai bot

Loading

Uh oh!

copy-pr-bot bot commented Aug 22, 2025

Uh oh!

dmitry-tokarev-nv commented Aug 22, 2025

Uh oh!

coderabbitai bot commented Aug 22, 2025

Chat

Support

CodeRabbit Commands (Invoked using PR/Issue comments)

Other keywords and placeholders

Status, Documentation and Community

Uh oh!

coderabbitai bot left a comment

Uh oh!

dmitry-tokarev-nv commented Aug 25, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

chore: vllm 0.10.1.1 #2641

chore: vllm 0.10.1.1 #2641

Uh oh!

Conversation

dmitry-tokarev-nv commented Aug 22, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview:

Details:

Where should the reviewer start?

Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)

Summary by CodeRabbit

Uh oh!

copy-pr-bot bot commented Aug 22, 2025

Uh oh!

dmitry-tokarev-nv commented Aug 22, 2025

Uh oh!

coderabbitai bot commented Aug 22, 2025

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Poem

Chat

Support

CodeRabbit Commands (Invoked using PR/Issue comments)

Other keywords and placeholders

Status, Documentation and Community

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

dmitry-tokarev-nv commented Aug 25, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

dmitry-tokarev-nv commented Aug 22, 2025 •

edited by coderabbitai bot

Loading