[Core] Update PyTorch to 2.12.0, torchvision to 0.27.0, triton to 3.7.0#42848
Draft
atalman wants to merge 4 commits into
Draft
[Core] Update PyTorch to 2.12.0, torchvision to 0.27.0, triton to 3.7.0#42848atalman wants to merge 4 commits into
atalman wants to merge 4 commits into
Conversation
Contributor
There was a problem hiding this comment.
Code Review
This pull request updates the project's dependencies to support PyTorch 2.12.0 across various build and environment configurations, including CUDA, ROCm, CPU, and s390x. Key changes include version bumps for torch, torchvision, triton, and several NVIDIA libraries. Additionally, the hardware CI script for CPU compatibility was updated to use the --enforce-eager flag, replacing the TORCH_COMPILE_DISABLE environment variable to prevent crashes during engine initialization with the new PyTorch version. I have no further feedback to provide.
This was referenced May 20, 2026
Update PyTorch ecosystem versions: - torch: 2.11.0 -> 2.12.0 - torchvision: 0.26.0 -> 0.27.0 - triton: 3.6.0 -> 3.7.0 - torchaudio: stays at 2.11.0 Bump CUDA 13 deps to match torch 2.12.0+cu130: - nvidia-cudnn-cu13: 9.19.0.56 -> 9.20.0.48 - nvidia-cusparselt-cu13: 0.8.0 -> 0.8.1 - nvidia-nccl-cu13: 2.28.9 -> 2.29.7 Use --enforce-eager instead of TORCH_COMPILE_DISABLE=1 in the CPU SDE compat test. On torch 2.11 TORCH_COMPILE_DISABLE turned torch.compile call sites into silent no-ops; on torch 2.12 sites that pass fullgraph=True now raise "found no compiled frames", which crashes engine init via vLLM's piecewise-compile path. --enforce-eager skips the wrapper entirely on both versions. Supersedes vllm-project#40077 (release wheels are now published, so the download.pytorch.org/whl/test/ indexes are no longer needed). Co-authored-by: Claude <noreply@anthropic.com> Signed-off-by: atalman <atalman@meta.com>
4 tasks
5 tasks
4 tasks
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Purpose
Update the PyTorch ecosystem to the released versions:
torch: 2.11.0 → 2.12.0torchvision: 0.26.0 → 0.27.0triton: 3.6.0 → 3.7.0torchaudio: stays at 2.11.0This PR supersedes #40077. Now that the torch 2.12.0 release is published on PyPI /
download.pytorch.org/whl/, no temporarywhl/test/index URLs are needed — wheels resolve from the regular indexes.CUDA 13 transitive deps
Bumped to match
torch==2.12.0+cu130:nvidia-cudnn-cu13: 9.19.0.56 → 9.20.0.48nvidia-cusparselt-cu13: 0.8.0 → 0.8.1nvidia-nccl-cu13: 2.28.9 → 2.29.7CPU compatibility test fix
.buildkite/scripts/hardware_ci/run-cpu-compatibility-test.shpreviously setTORCH_COMPILE_DISABLE=1to skiptorch.compile(slow under SDE). On torch 2.11 this turned everytorch.compilecall site into a silent no-op. On torch 2.12, call sites that passfullgraph=Truenow raise:Engine init goes through vLLM's piecewise-compile path (which uses
fullgraph=True), so init crashes insidedetermine_available_memory. Switched to vLLM's canonical--enforce-eagerengine flag, which never constructs atorch.compilewrapper at all — same SDE speedup, no contract violation, works on both torch 2.11 and 2.12.Tracked upstream as
pytorch/pytorch#181247(under umbrellapytorch/pytorch#180899).Test Plan
CI sign-off — Buildkite full daily / full nightly runs on this branch:
Test Result
To be filled in once CI completes on this branch.
Duplicate-work check
whl/test/. This PR is the clean release-channel version and supersedes it.torch-xpu==2.12upgrade. Different scope; this PR doesn't touch XPU.AI-assistance disclosure
AI assistance (Claude) was used to draft the changes. Every changed line was reviewed and the test plan above was constructed and run by a human submitter.
Essential Elements of an Effective PR Description Checklist
supported_models.mdandexamplesfor a new model.