[CUDA] Upgrade build pipelines to use CUDA 12.8 + cuDNN 9.8 by tianleiwu · Pull Request #26267 · microsoft/onnxruntime

tianleiwu · 2025-10-09T20:25:26Z

This upgrades CUDA 12.2 + cuDNN 9.5 to CUDA 12.8 + cuDNN 9.8 in CI pipelines, so that we can build 120-real to support Blackwell GPU.

To speed up build, we also disable relocatable-device-code.

MSVC is updated to latest for some windows build pipelines.

Known issues

Some onnx models (yolo v3, yolo v4, mobilenet v1) failed to run due to cudnn frontend failed to find engine plan. We will try upgrade cudnn frontend later. Related failed tests are disabled for now.

…xruntime-github-vs2022-latest

tools/ci_build/github/azure-pipelines/nuget/templates/test_linux.yml

This upgrades CUDA 12.2 + cuDNN 9.5 to CUDA 12.8 + cuDNN 9.8 in CI pipelines, so that we can build 120-real to support Blackwell GPU. To speed up build, we also disable relocatable-device-code. MSVC is updated to latest for some windows build pipelines. #### Known issues Some onnx models (yolo v3, yolo v4, mobilenet v1) failed to run due to cudnn frontend failed to find engine plan. We will try upgrade cudnn frontend later. Related failed tests are disabled for now. --------- Co-authored-by: Changming Sun <chasun@microsoft.com>

…t#26267) This upgrades CUDA 12.2 + cuDNN 9.5 to CUDA 12.8 + cuDNN 9.8 in CI pipelines, so that we can build 120-real to support Blackwell GPU. To speed up build, we also disable relocatable-device-code. MSVC is updated to latest for some windows build pipelines. #### Known issues Some onnx models (yolo v3, yolo v4, mobilenet v1) failed to run due to cudnn frontend failed to find engine plan. We will try upgrade cudnn frontend later. Related failed tests are disabled for now. --------- Co-authored-by: Changming Sun <chasun@microsoft.com>

1. Linux CPU Minimal Build: Upgrade setup-build-tools from v0.0.9 to v0.0.12 (SHA 8bad63a3) with ccache support. The v0.0.12 build actions (build-and-prep-ort-files, build-minimal-ort-and-run-tests, run-build-script-in-docker) all hardcode ccache invocations internally, so ccache MUST be installed by setup-build-tools. Added actions/cache steps for ccache and vcpkg directories across all jobs. 2. Web CI Pipeline (WASM): Same setup-build-tools upgrade plus added --use_cache to common_build_args. Without ccache, WASM builds compile from scratch each time, causing 30+ min timeouts. 3. CUDA Builds: Cherry-pick test/build fixes from main commit 311b4a6 (PR #26267) needed for CUDA 12.8 compatibility: - Disable YOLO v3/v4 and MobilenetV1 model tests (cuDNN frontend cannot find engine plan with cuDNN 9.8) - Switch from --relocatable-device-code=true to --static-global-template-stub=false (faster builds) - Fix typeid(T).name() build error in gather_block_quantized test Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Changming Sun and others added 11 commits October 8, 2025 12:30

CUDA build

6245011

update

218724f

change base images

c165a83

remove rdc to speed up build

5a58b39

Merge remote-tracking branch 'origin/main' into cuda128

77c5b46

Replace the Windows machine from onnxruntime-github-vs2022-mms to onn…

5085600

…xruntime-github-vs2022-latest

Update custom-nuget-packaging-pipeline.yml for Azure Pipelines

90a6a35

Merge remote-tracking branch 'origin/main' into cuda128

e851f63

update

1951b84

Merge remote-tracking branch 'origin/snnn/fl_pipeline' into cuda128

507ccc9

update

2e54006

tianleiwu commented Oct 9, 2025

View reviewed changes

tools/ci_build/github/azure-pipelines/nuget/templates/test_linux.yml Outdated Show resolved Hide resolved

disable failed onnx model tests

fa9484f

tianleiwu mentioned this pull request Oct 10, 2025

[CUDA] cudnn frontend failed for yolov3, yolov4 and mobilenet_v1 #26274

Closed

update

824554b

tianleiwu requested a review from snnn October 10, 2025 23:01

updated disabled test names

d01a3e5

snnn approved these changes Oct 13, 2025

View reviewed changes

tianleiwu merged commit 311b4a6 into main Oct 13, 2025
102 of 104 checks passed

tianleiwu deleted the cuda128 branch October 13, 2025 21:16

tianleiwu mentioned this pull request Oct 14, 2025

[CUDA] Adjust package build settings for Blackwell GPU #26307

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CUDA] Upgrade build pipelines to use CUDA 12.8 + cuDNN 9.8#26267

[CUDA] Upgrade build pipelines to use CUDA 12.8 + cuDNN 9.8#26267
tianleiwu merged 14 commits intomainfrom
cuda128

tianleiwu commented Oct 9, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

tianleiwu commented Oct 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Known issues

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

tianleiwu commented Oct 9, 2025 •

edited

Loading