[Core] Update PyTorch to 2.12.0, torchvision to 0.27.0, triton to 3.7.0 by atalman · Pull Request #42848 · vllm-project/vllm

atalman · 2026-05-16T18:46:15Z

Purpose

Update the PyTorch ecosystem to the released versions:

torch: 2.11.0 → 2.12.0
torchvision: 0.26.0 → 0.27.0
triton: 3.6.0 → 3.7.0
torchaudio: stays at 2.11.0

This PR supersedes #40077. Now that the torch 2.12.0 release is published on PyPI / download.pytorch.org/whl/, no temporary whl/test/ index URLs are needed — wheels resolve from the regular indexes.

CUDA 13 transitive deps

Bumped to match torch==2.12.0+cu130:

nvidia-cudnn-cu13: 9.19.0.56 → 9.20.0.48
nvidia-cusparselt-cu13: 0.8.0 → 0.8.1
nvidia-nccl-cu13: 2.28.9 → 2.29.7

CPU compatibility test fix

.buildkite/scripts/hardware_ci/run-cpu-compatibility-test.sh previously set TORCH_COMPILE_DISABLE=1 to skip torch.compile (slow under SDE). On torch 2.11 this turned every torch.compile call site into a silent no-op. On torch 2.12, call sites that pass fullgraph=True now raise:

RuntimeError: Worker failed with error 'torch.compile with fullgraph=True
found no compiled frames. The frame was likely skipped (...).'

Engine init goes through vLLM's piecewise-compile path (which uses fullgraph=True), so init crashes inside determine_available_memory. Switched to vLLM's canonical --enforce-eager engine flag, which never constructs a torch.compile wrapper at all — same SDE speedup, no contract violation, works on both torch 2.11 and 2.12.

Tracked upstream as pytorch/pytorch#181247 (under umbrella pytorch/pytorch#180899).

Test Plan

CI sign-off — Buildkite full daily / full nightly runs on this branch:

CUDA build + tests (CUDA 13)
CPU build + tests (x86_64, aarch64, s390x)
ROCm build
CPU SDE compatibility test (Sky Lake / Cascade Lake / Cooper Lake)

Test Result

To be filled in once CI completes on this branch.

Duplicate-work check

[WIP][Core] Update PyTorch to 2.12.0, torchvision to 0.27.0, triton to 3.7.0 #40077 — same author's WIP draft for the same upgrade, kept open while torch 2.12 was on whl/test/. This PR is the clean release-channel version and supersedes it.
[WIP][XPU] upgrade torch-xpu to 2.12 #42262 — XPU-only torch-xpu==2.12 upgrade. Different scope; this PR doesn't touch XPU.
Update gpu.xpu.inc.md to use triton-xpu 3.7.0 #39715 — XPU docs change for triton-xpu 3.7. Unrelated.

AI-assistance disclosure

AI assistance (Claude) was used to draft the changes. Every changed line was reviewed and the test plan above was constructed and run by a human submitter.

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.

gemini-code-assist

Code Review

This pull request updates the project's dependencies to support PyTorch 2.12.0 across various build and environment configurations, including CUDA, ROCm, CPU, and s390x. Key changes include version bumps for torch, torchvision, triton, and several NVIDIA libraries. Additionally, the hardware CI script for CPU compatibility was updated to use the --enforce-eager flag, replacing the TORCH_COMPILE_DISABLE environment variable to prevent crashes during engine initialization with the new PyTorch version. I have no further feedback to provide.

Update PyTorch ecosystem versions: - torch: 2.11.0 -> 2.12.0 - torchvision: 0.26.0 -> 0.27.0 - triton: 3.6.0 -> 3.7.0 - torchaudio: stays at 2.11.0 Bump CUDA 13 deps to match torch 2.12.0+cu130: - nvidia-cudnn-cu13: 9.19.0.56 -> 9.20.0.48 - nvidia-cusparselt-cu13: 0.8.0 -> 0.8.1 - nvidia-nccl-cu13: 2.28.9 -> 2.29.7 Use --enforce-eager instead of TORCH_COMPILE_DISABLE=1 in the CPU SDE compat test. On torch 2.11 TORCH_COMPILE_DISABLE turned torch.compile call sites into silent no-ops; on torch 2.12 sites that pass fullgraph=True now raise "found no compiled frames", which crashes engine init via vLLM's piecewise-compile path. --enforce-eager skips the wrapper entirely on both versions. Supersedes vllm-project#40077 (release wheels are now published, so the download.pytorch.org/whl/test/ indexes are no longer needed). Co-authored-by: Claude <noreply@anthropic.com> Signed-off-by: atalman <atalman@meta.com>

mergify Bot added ci/build nvidia labels May 16, 2026

github-project-automation Bot added this to NVIDIA May 16, 2026

mergify Bot added the cpu Related to CPU backends label May 16, 2026

gemini-code-assist Bot reviewed May 16, 2026

View reviewed changes

atalman mentioned this pull request May 19, 2026

[vllm] [2.12 regression][multimodal] Qwen2-Audio text-then-audio_embeds: prompt_embeds vs raw-text outputs diverge under --enforce-eager pytorch/pytorch#184431

Open

atalman force-pushed the fix_release_212 branch from 47af9e1 to d2792bf Compare May 19, 2026 23:10

atalman force-pushed the fix_release_212 branch from d2792bf to 3fa7787 Compare May 20, 2026 20:49

atalman force-pushed the fix_release_212 branch from 3fa7787 to f64937a Compare May 21, 2026 22:38

Harry-Chen mentioned this pull request May 27, 2026

[WIP][Core] Update PyTorch to 2.12.0, torchvision to 0.27.0, triton to 3.7.0 #40077

Closed

4 tasks

Merge branch 'main' into fix_release_212

efed1a3

Harry-Chen mentioned this pull request Jun 2, 2026

[10/n] Migrate cuda_view and silu_and_mul_per_block_quant kernels to torch stale ABI. #44334

Merged

5 tasks

tdoublep mentioned this pull request Jun 3, 2026

Upgrade to PT 2.12 torch-spyre/torch-spyre#2218

Open

4 tasks

atalman added 2 commits June 3, 2026 14:36

Merge branch 'main' into fix_release_212

c35e6d2

Merge branch 'main' into fix_release_212

38227f3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Core] Update PyTorch to 2.12.0, torchvision to 0.27.0, triton to 3.7.0#42848

[Core] Update PyTorch to 2.12.0, torchvision to 0.27.0, triton to 3.7.0#42848
atalman wants to merge 4 commits into
vllm-project:mainfrom
atalman:fix_release_212

atalman commented May 16, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

atalman commented May 16, 2026

Purpose

CUDA 13 transitive deps

CPU compatibility test fix

Test Plan

Test Result

Duplicate-work check

AI-assistance disclosure

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant