[build] fix cu130 related release pipeline steps and publish as nightly image by Harry-Chen · Pull Request #32522 · vllm-project/vllm

Harry-Chen · 2026-01-17T14:33:58Z

Purpose

After #31032 is merged, we have received some feedback from NVIDIA, mainly #31032 (review):

@Harry-Chen if you are going to take #31822 and merge it as your own, could you at least check out the differences and consult us (NVIDIA) why those differences are there?

This was indeed my oversight. So I cherry-pick some fixes from this comment and #31822. I have also extracted push-nightly-builds.sh to avoid duplication of commands when uploading to docker hub.

Credit: @csahithi (modification of scripts), @wangshangsam.

Test Plan

I will trigger a release pipeline run to see if everything works.

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

…s.sh Signed-off-by: Shengqi Chen <harry-chen@outlook.com>

gemini-code-assist

Code Review

This pull request addresses feedback on the CUDA 13.0 release pipeline by updating the CUDA version and torch compute capabilities. It also refactors the nightly image publishing logic into a reusable shell script, which improves maintainability and adds a dedicated nightly build for CUDA 13.0. The changes are well-structured and align with the stated purpose. I've added a couple of suggestions to enhance the robustness of the new and modified shell scripts.

.buildkite/scripts/cleanup-nightly-builds.sh

.buildkite/scripts/push-nightly-builds.sh

Copilot

Pull request overview

This PR fixes CUDA 13.0-related release pipeline steps based on feedback from NVIDIA. It addresses incorrect CUDA versions (13.0.2 → 13.0.1), extracts common Docker push logic into a reusable script, and adds support for publishing CUDA 13.0 as a separate nightly image variant.

Changes:

Corrected CUDA version from 13.0.2 to 13.0.1 for cu130 builds (aligning with existing codebase standards)
Removed FLASHINFER_AOT_COMPILE build argument for CUDA 13.0 builds
Added compute capability 12.1 support for DGX Spark in arm64 CUDA 13.0 builds
Extracted nightly build push logic into reusable push-nightly-builds.sh script
Made cleanup-nightly-builds.sh parameterizable to support different tag prefixes
Added new pipeline step to publish CUDA 13.0 variant as separate nightly image

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 4 comments.

File	Description
`.buildkite/scripts/push-nightly-builds.sh`	New script that consolidates Docker image tagging and pushing logic for nightly builds, supporting optional tag variants like "cu130"
`.buildkite/scripts/cleanup-nightly-builds.sh`	Enhanced to accept tag prefix parameter for cleaning up variant-specific nightly builds
`.buildkite/release-pipeline.yaml`	Updated CUDA 13.0 build configurations, refactored nightly build steps to use new script, added CUDA 13.0 nightly build step

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

.buildkite/scripts/push-nightly-builds.sh

.buildkite/release-pipeline.yaml

.buildkite/scripts/cleanup-nightly-builds.sh

Signed-off-by: Shengqi Chen <harry-chen@outlook.com>

…ly image (#32522) Signed-off-by: Shengqi Chen <harry-chen@outlook.com> (cherry picked from commit 965765a)

wangshangsam · 2026-01-19T20:22:16Z

Thanks a lot, @Harry-Chen! I apologize for intense expression of my frustration.

I see that the CI hasn't been uploading the nightly images to dockerhub for a few days: https://hub.docker.com/r/vllm/vllm-openai/tags I'm wondering if you are aware what's happening?

Harry-Chen · 2026-01-20T11:45:39Z

Thanks a lot, @Harry-Chen! I apologize for intense expression of my frustration.

I see that the CI hasn't been uploading the nightly images to dockerhub for a few days: https://hub.docker.com/r/vllm/vllm-openai/tags I'm wondering if you are aware what's happening?

Yes, this is due to some permission issue, and @khluu has fixed it now. Thanks for reminding us.

wangshangsam · 2026-01-20T17:02:54Z

Thanks a lot, @Harry-Chen! I apologize for intense expression of my frustration.
I see that the CI hasn't been uploading the nightly images to dockerhub for a few days: https://hub.docker.com/r/vllm/vllm-openai/tags I'm wondering if you are aware what's happening?

Yes, this is due to some permission issue, and @khluu has fixed it now. Thanks for reminding us.

Thanks! I see that there are v0.14.0-aarch64-cu130 and v0.14.0-x86_64-cu130 now for the v0.14 release, but where are the nightly cu130- images though?

Harry-Chen · 2026-01-21T03:53:00Z

Thanks! I see that there are v0.14.0-aarch64-cu130 and v0.14.0-x86_64-cu130 now for the v0.14 release, but where are the nightly cu130- images though?

The cu130 nightly pipelines need a manual trigger to run. I have triggered one on yesterday's nightly run. @khluu do you think we should remove the block and let it run automatically?

wangshangsam · 2026-01-21T05:36:45Z

It would be very nice if we could remove the block and let it run automatically, which makes our lives easier when testing the latest changes on all the Blackwell platforms.

…ly image (vllm-project#32522) Signed-off-by: Shengqi Chen <harry-chen@outlook.com> Signed-off-by: dsuhinin <suhinin.dmitriy@gmail.com>

simon-mo · 2026-01-21T17:43:40Z

Running it automatically makes sense to me.

…ly image (vllm-project#32522) Signed-off-by: Shengqi Chen <harry-chen@outlook.com>

- [build] fix cu130 related release pipeline steps and publish as nightly image (vllm-project#32522) - [Misc] Replace urllib's `urlparse` with urllib3's `parse_url` (vllm-project#32746) - [Misc] Bump opencv-python dependency version to 4.13 (vllm-project#32668) - [Bugfix] Fix Whisper/encoder-decoder GPU memory leak (vllm-project#32789) - [CI] fix version comparsion and exclusion patterns in upload-release-wheels.sh (vllm-project#32971) - tokenizers: mistral: fix merge conflict - `Dockerfile.tpu.ubi`: add `git` to allow `pip install git+https`

build: fix cu130 related release pipeline, extract push-nightly-build…

fa12dba

…s.sh Signed-off-by: Shengqi Chen <harry-chen@outlook.com>

Copilot AI review requested due to automatic review settings January 17, 2026 14:33

Copilot started reviewing on behalf of Harry-Chen January 17, 2026 14:34 View session

mergify bot added the ci/build label Jan 17, 2026

gemini-code-assist bot reviewed Jan 17, 2026

View reviewed changes

.buildkite/scripts/cleanup-nightly-builds.sh Show resolved Hide resolved

.buildkite/scripts/push-nightly-builds.sh Show resolved Hide resolved

Copilot AI reviewed Jan 17, 2026

View reviewed changes

fix copilot review comments

2502b93

Signed-off-by: Shengqi Chen <harry-chen@outlook.com>

khluu enabled auto-merge (squash) January 17, 2026 17:01

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Jan 17, 2026

khluu approved these changes Jan 18, 2026

View reviewed changes

khluu merged commit 965765a into vllm-project:main Jan 18, 2026
18 checks passed

khluu pushed a commit that referenced this pull request Jan 18, 2026

[build] fix cu130 related release pipeline steps and publish as night…

d682094

…ly image (#32522) Signed-off-by: Shengqi Chen <harry-chen@outlook.com> (cherry picked from commit 965765a)

Harry-Chen deleted the cuda13-image-fix branch January 21, 2026 03:53

Harry-Chen mentioned this pull request Jan 22, 2026

[CI] refactor release pipeline config into groups #32833

Merged

5 tasks

ItzDEXX pushed a commit to ItzDEXX/vllm that referenced this pull request Feb 19, 2026

[build] fix cu130 related release pipeline steps and publish as night…

9dddc6c

…ly image (vllm-project#32522) Signed-off-by: Shengqi Chen <harry-chen@outlook.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[build] fix cu130 related release pipeline steps and publish as nightly image#32522

[build] fix cu130 related release pipeline steps and publish as nightly image#32522
khluu merged 2 commits intovllm-project:mainfrom
Harry-Chen:cuda13-image-fix

Harry-Chen commented Jan 17, 2026 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wangshangsam commented Jan 19, 2026

Uh oh!

Harry-Chen commented Jan 20, 2026 •

edited

Loading

Uh oh!

wangshangsam commented Jan 20, 2026 •

edited

Loading

Uh oh!

Harry-Chen commented Jan 21, 2026

Uh oh!

wangshangsam commented Jan 21, 2026

Uh oh!

simon-mo commented Jan 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

Conversation

Harry-Chen commented Jan 17, 2026 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wangshangsam commented Jan 19, 2026

Uh oh!

Harry-Chen commented Jan 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

wangshangsam commented Jan 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Harry-Chen commented Jan 21, 2026

Uh oh!

wangshangsam commented Jan 21, 2026

Uh oh!

simon-mo commented Jan 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Harry-Chen commented Jan 17, 2026 •

edited by github-actions bot

Loading

Harry-Chen commented Jan 20, 2026 •

edited

Loading

wangshangsam commented Jan 20, 2026 •

edited

Loading