Add Ubuntu 24.04 support for Docker builds by aasgaonkar · Pull Request #35386 · vllm-project/vllm

aasgaonkar · 2026-02-26T09:12:55Z

Purpose

Adds Ubuntu 24.04 as an opt-in build target for vLLM Docker release images, closing #35118.

Changes:

docker/Dockerfile: Add UBUNTU_VERSION ARG (default 22.04) to parameterize
FINAL_BASE_IMAGE. Install python${PYTHON_VERSION}-dev before apt cache cleanup (required by
cmake 3.28 on Ubuntu 24.04 for Development.SABIModule). Remove python3-pip from final stage
apt deps to avoid conflict with get-pip.py on Ubuntu 24.04 (pip 24.0 ships without a RECORD
file). Remove EXTERNALLY-MANAGED marker to allow pip installs into the system Python.
docker/docker-bake.hcl: Add test-ubuntu2404 and openai-ubuntu2404 targets with
UBUNTU_VERSION=24.04, GDRCOPY_OS_VERSION=Ubuntu24_04, and FLASHINFER_AOT_COMPILE=true.
docker/versions.json: Add UBUNTU_VERSION with default "22.04".
.buildkite/release-pipeline.yaml: Add 4 new release pipeline steps (x86_64 + aarch64 for
CUDA 12.9 and 13.0, all with Ubuntu 24.04) and 2 multi-arch manifest steps. CUDA 13.0 steps
explicitly pass BUILD_BASE_IMAGE=nvidia/cuda:13.0.1-devel-ubuntu24.04 since NVIDIA does not
publish CUDA 13.x devel images for Ubuntu 20.04 (the Dockerfile default).

Test Plan

Built and tested all 4 Ubuntu 24.04 image variants locally:

CUDA 12.9 + Ubuntu 24.04

docker build --build-arg CUDA_VERSION=12.9.1 --build-arg UBUNTU_VERSION=24.04 \
--build-arg GDRCOPY_OS_VERSION=Ubuntu24_04 --build-arg FLASHINFER_AOT_COMPILE=true \
--target vllm-openai -f docker/Dockerfile .

CUDA 13.0 + Ubuntu 24.04

docker build --build-arg CUDA_VERSION=13.0.1 --build-arg UBUNTU_VERSION=24.04 \
--build-arg GDRCOPY_OS_VERSION=Ubuntu24_04 --build-arg FLASHINFER_AOT_COMPILE=true \
--build-arg BUILD_BASE_IMAGE=nvidia/cuda:13.0.1-devel-ubuntu24.04 \
--target vllm-openai -f docker/Dockerfile .

Test Result

Test: vllm.entrypoints.openai.api_server with facebook/opt-125m, prompt "The capital of France is", max_tokens=20, temperature=0. Multi-GPU test uses --tensor-parallel-size 4
across 4×A100. Ubuntu 22.04 rows verify no regression in existing builds.

Configuration	Single GPU	TP=4
CUDA 12.9 + Ubuntu 24.04 (new)	Pass	Pass
CUDA 13.0 + Ubuntu 24.04 (new)	Pass	Pass
CUDA 12.9 + Ubuntu 22.04 (existing)	Pass	Pass
CUDA 13.0 + Ubuntu 22.04 (existing)	Pass	Pass

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

gemini-code-assist

Code Review

The pull request successfully adds support for Ubuntu 24.04 across Docker builds, including new build arguments, parameterized base images, and updated release pipeline steps. The fix for the EXTERNALLY-MANAGED pip issue is crucial for compatibility with newer Ubuntu versions. However, there's an inconsistency in the .buildkite/release-pipeline.yaml regarding the BUILD_BASE_IMAGE for CUDA 13.0 Ubuntu 24.04 builds.

gemini-code-assist · 2026-02-26T09:14:21Z

docker/Dockerfile

+    && rm -f /usr/lib/python${PYTHON_VERSION}/EXTERNALLY-MANAGED \
+    && rm -rf /usr/lib/python3/dist-packages/pip /usr/lib/python3/dist-packages/pip-*.dist-info \


Removing the EXTERNALLY-MANAGED file and existing pip installations is a critical step to ensure pip functions correctly on newer Ubuntu versions (like 24.04) where Python's pip is externally managed by default. This directly addresses a potential breakage in the build process.

.buildkite/release-pipeline.yaml

gemini-code-assist · 2026-02-26T09:14:21Z

.buildkite/release-pipeline.yaml

+          queue: arm64_cpu_queue_postmerge
+        commands:
+          - "aws ecr-public get-login-password --region us-east-1 | docker login --username AWS --password-stdin public.ecr.aws/q9t5s3a7"
+          - "DOCKER_BUILDKIT=1 docker build --build-arg max_jobs=16 --build-arg USE_SCCACHE=1 --build-arg GIT_REPO_CHECK=1 --build-arg CUDA_VERSION=13.0.1 --build-arg UBUNTU_VERSION=24.04 --build-arg GDRCOPY_OS_VERSION=Ubuntu24_04 --build-arg torch_cuda_arch_list='8.7 8.9 9.0 10.0+PTX 12.0 12.1' --build-arg INSTALL_KV_CONNECTORS=true --build-arg BUILD_BASE_IMAGE=nvidia/cuda:13.0.1-devel-ubuntu22.04 --tag public.ecr.aws/q9t5s3a7/vllm-release-repo:$BUILDKITE_COMMIT-$(uname -m)-cu130-ubuntu2404 --target vllm-openai --progress plain -f docker/Dockerfile ."


Similar to the x86_64 build, the BUILD_BASE_IMAGE for the aarch64 CUDA 13.0 Ubuntu 24.04 build is explicitly set to nvidia/cuda:13.0.1-devel-ubuntu22.04. This should be updated to ubuntu24.04 for consistency with the UBUNTU_VERSION argument, or adjusted based on the intended base image strategy.

- "DOCKER_BUILDKIT=1 docker build --build-arg max_jobs=16 --build-arg USE_SCCACHE=1 --build-arg GIT_REPO_CHECK=1 --build-arg CUDA_VERSION=13.0.1 --build-arg UBUNTU_VERSION=24.04 --build-arg GDRCOPY_OS_VERSION=Ubuntu24_04 --build-arg torch_cuda_arch_list='8.7 8.9 9.0 10.0+PTX 12.0 12.1' --build-arg INSTALL_KV_CONNECTORS=true --build-arg BUILD_BASE_IMAGE=nvidia/cuda:13.0.1-devel-ubuntu24.04 --tag public.ecr.aws/q9t5s3a7/vllm-release-repo:$BUILDKITE_COMMIT-$(uname -m)-cu130-ubuntu2404 --target vllm-openai --progress plain -f docker/Dockerfile ."

github-actions · 2026-02-26T09:14:31Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors.

You ask your reviewers to trigger select CI tests on top of fastcheck CI.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

If you have any questions, please reach out to us on Slack at https://slack.vllm.ai.

🚀

- Add UBUNTU_VERSION build arg to Dockerfile, defaulting to 22.04 - Parameterize FINAL_BASE_IMAGE to use UBUNTU_VERSION - Fix pip EXTERNALLY-MANAGED issue for newer Ubuntu versions - Add Ubuntu 24.04 build targets in docker-bake.hcl - Add Ubuntu 24.04 release pipeline steps for x86_64 and aarch64 (CUDA 12.9 and 13.0) with multi-arch manifests Signed-off-by: aasgaonkar <aasgaonkar@nvidia.com>

Update BUILD_BASE_IMAGE from ubuntu22.04 to ubuntu24.04 for the CUDA 13.0 + Ubuntu 24.04 release pipeline steps to be consistent with the UBUNTU_VERSION=24.04 build arg. Signed-off-by: aasgaonkar <aasgaonkar@nvidia.com>

Three bug fixes found during local build testing of Ubuntu 24.04 images: 1. docker/Dockerfile (base stage): Install python${PYTHON_VERSION}-dev before apt cache cleanup. Ubuntu 24.04 ships cmake 3.28 which requires the Development.SABIModule component; without Python headers the csrc-build stage fails with "Could NOT find Python (missing: Python_INCLUDE_DIRS Development.SABIModule)". The install is best-effort (|| true) so it silently no-ops on Ubuntu 20.04/22.04 where the package is not in the default repos. 2. docker/Dockerfile (vllm-base stage): Remove python3-pip from apt deps. On Ubuntu 24.04, apt installs pip 24.0 without a RECORD file, causing get-pip.py to fail with "Cannot uninstall pip 24.0: no RECORD file was found". Removing python3-pip from apt lets get-pip.py install pip fresh with no conflict. 3. .buildkite/release-pipeline.yaml: Add FLASHINFER_AOT_COMPILE=true to the CUDA 13.0 + Ubuntu 24.04 build steps (x86_64 and aarch64). It was already set on the CUDA 12.9 + Ubuntu 24.04 steps; without it the CUDA 13.0 Ubuntu 24.04 images silently fall back to slow JIT compilation at runtime. Signed-off-by: aasgaonkar <aasgaonkar@nvidia.com>

- Add torch_cuda_arch_list='8.7 8.9 9.0 10.0+PTX 12.0' to x86_64 CUDA 12.9 Ubuntu 24.04 release build, matching aarch64 and existing Ubuntu 22.04 builds - Add torch_cuda_arch_list='8.7 8.9 9.0 10.0+PTX 12.0 12.1' to x86_64 CUDA 13.0 Ubuntu 24.04 release build, matching aarch64 counterpart - Add FLASHINFER_AOT_COMPILE=true to test-ubuntu2404 and openai-ubuntu2404 docker-bake.hcl targets to match CI pipeline and avoid silent JIT fallback Signed-off-by: aasgaonkar <aasgaonkar@nvidia.com>

mgoin

Looks reasonable to me!

Signed-off-by: aasgaonkar <aasgaonkar@nvidia.com>

Signed-off-by: aasgaonkar <aasgaonkar@nvidia.com> Signed-off-by: Michel Belleau <michel.belleau@malaiwah.com>

Signed-off-by: aasgaonkar <aasgaonkar@nvidia.com>

Signed-off-by: aasgaonkar <aasgaonkar@nvidia.com> Signed-off-by: Monishver Chandrasekaran <monishverchandrasekaran@gmail.com>

Signed-off-by: aasgaonkar <aasgaonkar@nvidia.com> Signed-off-by: Nithin Chalapathi <nithin.ch10@gmail.com>

Signed-off-by: aasgaonkar <aasgaonkar@nvidia.com>

Signed-off-by: aasgaonkar <aasgaonkar@nvidia.com> (cherry picked from commit 0c1809c)

Signed-off-by: aasgaonkar <aasgaonkar@nvidia.com> Signed-off-by: Vinay Damodaran <vrdn@hey.com>

Signed-off-by: aasgaonkar <aasgaonkar@nvidia.com> Signed-off-by: EricccYang <yangyang4991@gmail.com>

Signed-off-by: aasgaonkar <aasgaonkar@nvidia.com> Signed-off-by: bhargav-patel-29 <bhargav.patel@tihiitb.org>

mergify bot added the ci/build label Feb 26, 2026

gemini-code-assist bot reviewed Feb 26, 2026

View reviewed changes

aasgaonkar mentioned this pull request Feb 26, 2026

[Feature]: Support ubuntu 24.04 runtime container #35118

Closed

1 task

aasgaonkar force-pushed the ubuntu-2404-support branch from 98363eb to 383fdff Compare February 26, 2026 10:48

aasgaonkar force-pushed the ubuntu-2404-support branch from fb0e9cd to 14d350e Compare March 18, 2026 22:54

aasgaonkar added 3 commits March 18, 2026 22:56

Fix BUILD_BASE_IMAGE for CUDA 13.0 Ubuntu 24.04 builds

ef9e095

Update BUILD_BASE_IMAGE from ubuntu22.04 to ubuntu24.04 for the CUDA 13.0 + Ubuntu 24.04 release pipeline steps to be consistent with the UBUNTU_VERSION=24.04 build arg. Signed-off-by: aasgaonkar <aasgaonkar@nvidia.com>

aasgaonkar force-pushed the ubuntu-2404-support branch from 14d350e to 6c964bd Compare March 18, 2026 22:57

aasgaonkar and others added 3 commits March 19, 2026 00:30

Merge branch 'main' into ubuntu-2404-support

eea2a82

Merge branch 'main' into ubuntu-2404-support

9016beb

aasgaonkar marked this pull request as ready for review March 21, 2026 01:05

Merge branch 'main' into ubuntu-2404-support

3d2da4a

mgoin added the ready ONLY add when PR is ready to merge/full CI is needed label Mar 24, 2026

mgoin approved these changes Mar 24, 2026

View reviewed changes

vllm-bot merged commit 0c1809c into vllm-project:main Mar 24, 2026
133 of 136 checks passed

RhizoNymph pushed a commit to RhizoNymph/vllm that referenced this pull request Mar 26, 2026

Add Ubuntu 24.04 support for Docker builds (vllm-project#35386)

bedc91d

Signed-off-by: aasgaonkar <aasgaonkar@nvidia.com>

HenryTangDev pushed a commit to HenryTangMain/vllm that referenced this pull request Mar 27, 2026

Add Ubuntu 24.04 support for Docker builds (vllm-project#35386)

ded62d6

Signed-off-by: aasgaonkar <aasgaonkar@nvidia.com>

malaiwah pushed a commit to malaiwah/vllm that referenced this pull request Mar 27, 2026

Add Ubuntu 24.04 support for Docker builds (vllm-project#35386)

df03f0c

Signed-off-by: aasgaonkar <aasgaonkar@nvidia.com> Signed-off-by: Michel Belleau <michel.belleau@malaiwah.com>

khairulkabir1661 pushed a commit to khairulkabir1661/vllm that referenced this pull request Mar 27, 2026

Add Ubuntu 24.04 support for Docker builds (vllm-project#35386)

89c7920

Signed-off-by: aasgaonkar <aasgaonkar@nvidia.com>

Monishver11 pushed a commit to Monishver11/vllm that referenced this pull request Mar 27, 2026

Add Ubuntu 24.04 support for Docker builds (vllm-project#35386)

4c8a98c

Signed-off-by: aasgaonkar <aasgaonkar@nvidia.com> Signed-off-by: Monishver Chandrasekaran <monishverchandrasekaran@gmail.com>

nithinvc pushed a commit to nithinvc/vllm that referenced this pull request Mar 27, 2026

Add Ubuntu 24.04 support for Docker builds (vllm-project#35386)

999fcb7

Signed-off-by: aasgaonkar <aasgaonkar@nvidia.com> Signed-off-by: Nithin Chalapathi <nithin.ch10@gmail.com>

JiantaoXu pushed a commit to JiantaoXu/vllm that referenced this pull request Mar 28, 2026

Add Ubuntu 24.04 support for Docker builds (vllm-project#35386)

1a6257f

Signed-off-by: aasgaonkar <aasgaonkar@nvidia.com>

khluu pushed a commit that referenced this pull request Mar 30, 2026

Add Ubuntu 24.04 support for Docker builds (#35386)

f0a5c59

Signed-off-by: aasgaonkar <aasgaonkar@nvidia.com> (cherry picked from commit 0c1809c)

vrdn-23 pushed a commit to vrdn-23/vllm that referenced this pull request Mar 30, 2026

Add Ubuntu 24.04 support for Docker builds (vllm-project#35386)

597035b

Signed-off-by: aasgaonkar <aasgaonkar@nvidia.com> Signed-off-by: Vinay Damodaran <vrdn@hey.com>

EricccYang pushed a commit to EricccYang/vllm that referenced this pull request Apr 1, 2026

Add Ubuntu 24.04 support for Docker builds (vllm-project#35386)

ecc4c62

Signed-off-by: aasgaonkar <aasgaonkar@nvidia.com> Signed-off-by: EricccYang <yangyang4991@gmail.com>

bhargav-patel-29 pushed a commit to Bharatgen-Tech/vllm that referenced this pull request Apr 1, 2026

Add Ubuntu 24.04 support for Docker builds (vllm-project#35386)

037de19

Signed-off-by: aasgaonkar <aasgaonkar@nvidia.com> Signed-off-by: bhargav-patel-29 <bhargav.patel@tihiitb.org>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Ubuntu 24.04 support for Docker builds#35386

Add Ubuntu 24.04 support for Docker builds#35386
vllm-bot merged 7 commits intovllm-project:mainfrom
aasgaonkar:ubuntu-2404-support

aasgaonkar commented Feb 26, 2026 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Feb 26, 2026

Uh oh!

Uh oh!

gemini-code-assist bot Feb 26, 2026

Uh oh!

github-actions bot commented Feb 26, 2026

Uh oh!

mgoin left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		&& rm -f /usr/lib/python${PYTHON_VERSION}/EXTERNALLY-MANAGED \
		&& rm -rf /usr/lib/python3/dist-packages/pip /usr/lib/python3/dist-packages/pip-*.dist-info \

Uh oh!

Conversation

aasgaonkar commented Feb 26, 2026 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Changes:

Test Plan

CUDA 12.9 + Ubuntu 24.04

CUDA 13.0 + Ubuntu 24.04

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

gemini-code-assist bot Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Feb 26, 2026

Uh oh!

mgoin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

aasgaonkar commented Feb 26, 2026 •

edited by github-actions bot

Loading