Skip to content

[CI] refine more logic when generating and using nightly wheels & indices, add cuda130 build for aarch64, specify correct manylinux version#30341

Merged
youkaichao merged 5 commits into
vllm-project:mainfrom
Harry-Chen:nightly-wheel-warning
Dec 11, 2025
Merged

Conversation

@Harry-Chen
Copy link
Copy Markdown
Member

@Harry-Chen Harry-Chen commented Dec 9, 2025

Purpose

This PR further fixes some corner cases when generating and using nightly wheels / indices, including:

  • Automatic retry & friendly error messages in python-only installation test script
  • Exclusion of non-release wheels in a release wheel index.

It also:

  • enable cuda130 build on aarch64 for upcoming hardware support
  • allow specifying manylinux version upon wheel upload

Test Plan

Tested by CI.


Essential Elements of an Effective PR Description Checklist
  • The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
  • The test plan, such as providing test command.
  • The test results, such as pasting the results comparison before and after, or e2e results
  • (Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
  • (Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Copilot AI review requested due to automatic review settings December 9, 2025 14:17
@chatgpt-codex-connector
Copy link
Copy Markdown

Codex usage limits have been reached for code reviews. Please check with the admins of this repo to increase the limits by adding credits.

@mergify mergify Bot added the ci/build label Dec 9, 2025
Copy link
Copy Markdown
Contributor

@gemini-code-assist gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request refines the logic for handling nightly and release wheels. The changes in .buildkite/scripts/generate-nightly-index.py to filter wheels for non-nightly versions are a good step, but the implementation has a flaw that could lead to including development wheels in release indices. I've suggested a more robust filtering logic. Additionally, the shell script for testing python-only compilation has been improved with a retry mechanism. However, the JSON validation step unnecessarily prints the entire file to the logs, which I've provided a fix for to improve log cleanliness and avoid potential issues with log size.

Comment thread .buildkite/scripts/generate-nightly-index.py Outdated
Comment thread tests/standalone_tests/python_only_compile.sh Outdated
Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR refines the CI logic for generating and using nightly wheels and indices, focusing on improved error handling and proper filtering of release wheels.

Key Changes:

  • Added automatic retry logic (5 attempts, 3-minute intervals) with user-friendly error messages for metadata.json URL validation in the python-only installation test
  • Implemented filtering to exclude dev wheels from release wheel indices, ensuring only official release wheels are included
  • Enhanced error messages throughout with consistent severity prefixes (INFO, WARNING, ERROR, CRITICAL)

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 4 comments.

File Description
tests/standalone_tests/python_only_compile.sh Added retry logic with metadata.json URL validation and improved error messages for python-only compilation tests
.buildkite/scripts/upload-wheels.sh Added re-generation of indices specifically for release versions to exclude non-release wheels
.buildkite/scripts/generate-nightly-index.py Implemented version detection and filtering logic to exclude dev wheels from release indices

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Comment thread .buildkite/scripts/upload-wheels.sh Outdated
Comment thread tests/standalone_tests/python_only_compile.sh
Comment thread .buildkite/scripts/generate-nightly-index.py Outdated
Comment thread .buildkite/scripts/generate-nightly-index.py Outdated
…cript

Signed-off-by: Shengqi Chen <harry-chen@outlook.com>
…ating indices

Signed-off-by: Shengqi Chen <harry-chen@outlook.com>
Signed-off-by: Shengqi Chen <harry-chen@outlook.com>
Signed-off-by: Shengqi Chen <harry-chen@outlook.com>
Signed-off-by: Shengqi Chen <harry-chen@outlook.com>
@Harry-Chen Harry-Chen force-pushed the nightly-wheel-warning branch from 0d2dfe6 to b588306 Compare December 11, 2025 02:34
@Harry-Chen Harry-Chen changed the title [CI] refine more logic when generating and using nightly wheels & indices [CI] refine more logic when generating and using nightly wheels & indices, add cuda130 build for aarch64, specify correct manylinux version Dec 11, 2025
@mergify mergify Bot added the nvidia label Dec 11, 2025
@Harry-Chen Harry-Chen requested a review from Copilot December 11, 2025 02:38
Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copilot encountered an error and was unable to review this pull request. You can try again by re-requesting a review.

@github-project-automation github-project-automation Bot moved this to In review in NVIDIA Dec 11, 2025
@youkaichao youkaichao enabled auto-merge (squash) December 11, 2025 13:50
@github-actions github-actions Bot added the ready ONLY add when PR is ready to merge/full CI is needed label Dec 11, 2025
@youkaichao youkaichao disabled auto-merge December 11, 2025 16:42
@youkaichao youkaichao merged commit 305b168 into vllm-project:main Dec 11, 2025
25 of 28 checks passed
@github-project-automation github-project-automation Bot moved this from In review to Done in NVIDIA Dec 11, 2025
@geerlingguy
Copy link
Copy Markdown

This is also useful for running on a Raspberry Pi with multiple GPUs, so thank you!

@Harry-Chen Harry-Chen deleted the nightly-wheel-warning branch December 22, 2025 05:00
Majid-Taheri pushed a commit to Majid-Taheri/vllm that referenced this pull request Dec 23, 2025
…ices, add cuda130 build for aarch64, specify correct manylinux version (vllm-project#30341)

Signed-off-by: Shengqi Chen <harry-chen@outlook.com>
Signed-off-by: Ubuntu <mjtaheri68@gmail.com>
mystous pushed a commit to mystous/vllm_hybrid that referenced this pull request May 10, 2026
…ices, add cuda130 build for aarch64, specify correct manylinux version (vllm-project#30341)

Signed-off-by: Shengqi Chen <harry-chen@outlook.com>
my-other-github-account pushed a commit to my-other-github-account/vllm that referenced this pull request May 15, 2026
…ices, add cuda130 build for aarch64, specify correct manylinux version (vllm-project#30341)

Signed-off-by: Shengqi Chen <harry-chen@outlook.com>
my-other-github-account pushed a commit to my-other-github-account/vllm that referenced this pull request May 15, 2026
…ices, add cuda130 build for aarch64, specify correct manylinux version (vllm-project#30341)

Signed-off-by: Shengqi Chen <harry-chen@outlook.com>
0826joyce pushed a commit to 0826joyce/vllm-serving-optimization that referenced this pull request May 19, 2026
…ices, add cuda130 build for aarch64, specify correct manylinux version (vllm-project#30341)

Signed-off-by: Shengqi Chen <harry-chen@outlook.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ci/build nvidia ready ONLY add when PR is ready to merge/full CI is needed

Projects

Status: Done

Development

Successfully merging this pull request may close these issues.

4 participants