Skip to content

[CUDA] Upgrade build pipelines to use CUDA 12.8 + cuDNN 9.8#26267

Merged
tianleiwu merged 14 commits intomainfrom
cuda128
Oct 13, 2025
Merged

[CUDA] Upgrade build pipelines to use CUDA 12.8 + cuDNN 9.8#26267
tianleiwu merged 14 commits intomainfrom
cuda128

Conversation

@tianleiwu
Copy link
Copy Markdown
Contributor

@tianleiwu tianleiwu commented Oct 9, 2025

This upgrades CUDA 12.2 + cuDNN 9.5 to CUDA 12.8 + cuDNN 9.8 in CI pipelines, so that we can build 120-real to support Blackwell GPU.

To speed up build, we also disable relocatable-device-code.

MSVC is updated to latest for some windows build pipelines.

Known issues

Some onnx models (yolo v3, yolo v4, mobilenet v1) failed to run due to cudnn frontend failed to find engine plan. We will try upgrade cudnn frontend later. Related failed tests are disabled for now.

@tianleiwu tianleiwu requested a review from snnn October 10, 2025 23:01
@tianleiwu tianleiwu merged commit 311b4a6 into main Oct 13, 2025
102 of 104 checks passed
@tianleiwu tianleiwu deleted the cuda128 branch October 13, 2025 21:16
fs-eire pushed a commit that referenced this pull request Oct 24, 2025
This upgrades CUDA 12.2 + cuDNN 9.5 to CUDA 12.8 + cuDNN 9.8 in CI
pipelines, so that we can build 120-real to support Blackwell GPU.

To speed up build, we also disable relocatable-device-code.

MSVC is updated to latest for some windows build pipelines.

#### Known issues

Some onnx models (yolo v3, yolo v4, mobilenet v1) failed to run due to
cudnn frontend failed to find engine plan. We will try upgrade cudnn
frontend later. Related failed tests are disabled for now.

---------

Co-authored-by: Changming Sun <chasun@microsoft.com>
naomiOvad pushed a commit to naomiOvad/onnxruntime that referenced this pull request Nov 2, 2025
…t#26267)

This upgrades CUDA 12.2 + cuDNN 9.5 to CUDA 12.8 + cuDNN 9.8 in CI
pipelines, so that we can build 120-real to support Blackwell GPU.

To speed up build, we also disable relocatable-device-code.

MSVC is updated to latest for some windows build pipelines.

#### Known issues

Some onnx models (yolo v3, yolo v4, mobilenet v1) failed to run due to
cudnn frontend failed to find engine plan. We will try upgrade cudnn
frontend later. Related failed tests are disabled for now.

---------

Co-authored-by: Changming Sun <chasun@microsoft.com>
adrastogi added a commit that referenced this pull request Mar 30, 2026
1. Linux CPU Minimal Build: Upgrade setup-build-tools from v0.0.9 to
   v0.0.12 (SHA 8bad63a3) with ccache support. The v0.0.12 build actions
   (build-and-prep-ort-files, build-minimal-ort-and-run-tests,
   run-build-script-in-docker) all hardcode ccache invocations internally,
   so ccache MUST be installed by setup-build-tools. Added actions/cache
   steps for ccache and vcpkg directories across all jobs.

2. Web CI Pipeline (WASM): Same setup-build-tools upgrade plus added
   --use_cache to common_build_args. Without ccache, WASM builds compile
   from scratch each time, causing 30+ min timeouts.

3. CUDA Builds: Cherry-pick test/build fixes from main commit 311b4a6
   (PR #26267) needed for CUDA 12.8 compatibility:
   - Disable YOLO v3/v4 and MobilenetV1 model tests (cuDNN frontend
     cannot find engine plan with cuDNN 9.8)
   - Switch from --relocatable-device-code=true to
     --static-global-template-stub=false (faster builds)
   - Fix typeid(T).name() build error in gather_block_quantized test

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants