Skip to content

Comments

[STF] Fix CUDA graph API calls for CUDA 13#5636

Merged
caugonnet merged 3 commits intoNVIDIA:mainfrom
caugonnet:stf_cuda13
Aug 23, 2025
Merged

[STF] Fix CUDA graph API calls for CUDA 13#5636
caugonnet merged 3 commits intoNVIDIA:mainfrom
caugonnet:stf_cuda13

Conversation

@caugonnet
Copy link
Contributor

Description

CUDASTF updates for CUDA graph API changes in CUDA 13

closes

Checklist

  • New or existing tests cover these changes.
  • The documentation is up to date with these changes.

@caugonnet caugonnet requested a review from a team as a code owner August 22, 2025 21:51
@caugonnet caugonnet requested a review from pciolkosz August 22, 2025 21:51
@github-project-automation github-project-automation bot moved this to Todo in CCCL Aug 22, 2025
@copy-pr-bot
Copy link
Contributor

copy-pr-bot bot commented Aug 22, 2025

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@caugonnet caugonnet self-assigned this Aug 22, 2025
@caugonnet caugonnet added the stf Sequential Task Flow programming model label Aug 22, 2025
@cccl-authenticator-app cccl-authenticator-app bot moved this from Todo to In Review in CCCL Aug 22, 2025
@caugonnet
Copy link
Contributor Author

/ok to test 5a8fa16

@caugonnet
Copy link
Contributor Author

/ok to test 7c7d71c

@caugonnet
Copy link
Contributor Author

/ok to test 6ae6a09

@caugonnet caugonnet enabled auto-merge (squash) August 22, 2025 23:00
@github-actions
Copy link
Contributor

🟩 CI finished in 1h 27m: Pass: 100%/32 | Total: 7h 46m | Avg: 14m 35s | Max: 39m 56s | Hits: 78%/15446
  • 🟩 cudax: Pass: 100%/28 | Total: 7h 08m | Avg: 15m 17s | Max: 39m 56s | Hits: 78%/15446

    🟩 cpu
      🟩 amd64              Pass: 100%/24  | Total:  6h 20m | Avg: 15m 50s | Max: 39m 56s | Hits:  79%/13066 
      🟩 arm64              Pass: 100%/4   | Total: 47m 43s | Avg: 11m 55s | Max: 12m 35s | Hits:  75%/2380  
    🟩 ctk
      🟩 12.0               Pass: 100%/3   | Total: 34m 40s | Avg: 11m 33s | Max: 12m 56s | Hits:  79%/1480  
      🟩 12.9               Pass: 100%/25  | Total:  6h 33m | Avg: 15m 44s | Max: 39m 56s | Hits:  78%/13966 
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/3   | Total: 34m 40s | Avg: 11m 33s | Max: 12m 56s | Hits:  79%/1480  
      🟩 nvcc12.9           Pass: 100%/25  | Total:  6h 33m | Avg: 15m 44s | Max: 39m 56s | Hits:  78%/13966 
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/28  | Total:  7h 08m | Avg: 15m 17s | Max: 39m 56s | Hits:  78%/15446 
    🟩 cxx
      🟩 Clang14            Pass: 100%/2   | Total: 21m 41s | Avg: 10m 50s | Max: 11m 41s | Hits:  75%/1192  
      🟩 Clang15            Pass: 100%/1   | Total: 12m 40s | Avg: 12m 40s | Max: 12m 40s | Hits:  75%/595   
      🟩 Clang16            Pass: 100%/1   | Total: 12m 21s | Avg: 12m 21s | Max: 12m 21s | Hits:  75%/595   
      🟩 Clang17            Pass: 100%/1   | Total: 13m 48s | Avg: 13m 48s | Max: 13m 48s | Hits:  75%/595   
      🟩 Clang18            Pass: 100%/1   | Total: 12m 54s | Avg: 12m 54s | Max: 12m 54s | Hits:  75%/595   
      🟩 Clang19            Pass: 100%/4   | Total:  1h 04m | Avg: 16m 02s | Max: 28m 28s | Hits:  81%/2380  
      🟩 GCC10              Pass: 100%/2   | Total: 26m 48s | Avg: 13m 24s | Max: 15m 04s | Hits:  75%/1192  
      🟩 GCC11              Pass: 100%/1   | Total: 14m 19s | Avg: 14m 19s | Max: 14m 19s | Hits:  75%/595   
      🟩 GCC12              Pass: 100%/1   | Total: 15m 45s | Avg: 15m 45s | Max: 15m 45s | Hits:  75%/595   
      🟩 GCC13              Pass: 100%/8   | Total:  2h 05m | Avg: 15m 38s | Max: 39m 56s | Hits:  80%/4760  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 12m 56s | Avg: 12m 56s | Max: 12m 56s | Hits:  95%/290   
      🟩 MSVC14.43          Pass: 100%/3   | Total: 40m 31s | Avg: 13m 30s | Max: 15m 24s | Hits:  92%/876   
      🟩 NVHPC25.5          Pass: 100%/2   | Total: 55m 08s | Avg: 27m 34s | Max: 28m 18s | Hits:  66%/1186  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/10  | Total:  2h 17m | Avg: 13m 45s | Max: 28m 28s | Hits:  77%/5952  
      🟩 GCC                Pass: 100%/12  | Total:  3h 01m | Avg: 15m 09s | Max: 39m 56s | Hits:  78%/7142  
      🟩 MSVC               Pass: 100%/4   | Total: 53m 27s | Avg: 13m 21s | Max: 15m 24s | Hits:  93%/1166  
      🟩 NVHPC              Pass: 100%/2   | Total: 55m 08s | Avg: 27m 34s | Max: 28m 18s | Hits:  66%/1186  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 19m 33s | Avg:  9m 46s | Max: 10m 45s | Hits:  87%/1190  
      🟩 rtx2080            Pass: 100%/26  | Total:  6h 48m | Avg: 15m 42s | Max: 39m 56s | Hits:  77%/14256 
    🟩 jobs
      🟩 Build              Pass: 100%/25  | Total:  5h 50m | Avg: 14m 02s | Max: 28m 18s | Hits:  75%/13661 
      🟩 Test               Pass: 100%/3   | Total:  1h 17m | Avg: 25m 44s | Max: 39m 56s | Hits:  99%/1785  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 19m 33s | Avg:  9m 46s | Max: 10m 45s | Hits:  87%/1190  
      🟩 90;90a             Pass: 100%/2   | Total: 26m 51s | Avg: 13m 25s | Max: 14m 08s | Hits:  74%/887   
      🟩 100;120            Pass: 100%/2   | Total: 24m 51s | Avg: 12m 25s | Max: 12m 27s | Hits:  80%/887   
    🟩 std
      🟩 17                 Pass: 100%/3   | Total: 50m 25s | Avg: 16m 48s | Max: 26m 50s | Hits:  72%/1783  
      🟩 20                 Pass: 100%/25  | Total:  6h 17m | Avg: 15m 06s | Max: 39m 56s | Hits:  79%/13663 
    
  • 🟩 packaging: Pass: 100%/4 | Total: 38m 44s | Avg: 9m 41s | Max: 10m 22s

    🟩 cpu
      🟩 amd64              Pass: 100%/4   | Total: 38m 44s | Avg:  9m 41s | Max: 10m 22s
    🟩 ctk
      🟩 12.0               Pass: 100%/2   | Total: 19m 00s | Avg:  9m 30s | Max: 10m 09s
      🟩 12.9               Pass: 100%/2   | Total: 19m 44s | Avg:  9m 52s | Max: 10m 22s
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/2   | Total: 19m 00s | Avg:  9m 30s | Max: 10m 09s
      🟩 nvcc12.9           Pass: 100%/2   | Total: 19m 44s | Avg:  9m 52s | Max: 10m 22s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 38m 44s | Avg:  9m 41s | Max: 10m 22s
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total: 10m 09s | Avg: 10m 09s | Max: 10m 09s
      🟩 Clang19            Pass: 100%/1   | Total: 10m 22s | Avg: 10m 22s | Max: 10m 22s
      🟩 GCC12              Pass: 100%/1   | Total:  8m 51s | Avg:  8m 51s | Max:  8m 51s
      🟩 GCC13              Pass: 100%/1   | Total:  9m 22s | Avg:  9m 22s | Max:  9m 22s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/2   | Total: 20m 31s | Avg: 10m 15s | Max: 10m 22s
      🟩 GCC                Pass: 100%/2   | Total: 18m 13s | Avg:  9m 06s | Max:  9m 22s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 38m 44s | Avg:  9m 41s | Max: 10m 22s
    🟩 jobs
      🟩 Test               Pass: 100%/4   | Total: 38m 44s | Avg:  9m 41s | Max: 10m 22s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
CCCL Packaging
libcu++
CUB
Thrust
+/- CUDA Experimental
stdpar
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- CCCL Packaging
libcu++
CUB
Thrust
+/- CUDA Experimental
stdpar
python
CCCL C Parallel Library
Catch2Helper

🏃‍ Runner counts (total jobs: 32)

# Runner
17 linux-amd64-cpu16
6 linux-amd64-gpu-rtx2080-latest-1
4 linux-arm64-cpu16
4 windows-amd64-cpu16
1 linux-amd64-gpu-h100-latest-1

@caugonnet caugonnet merged commit e98b6a5 into NVIDIA:main Aug 23, 2025
44 checks passed
@github-project-automation github-project-automation bot moved this from In Review to Done in CCCL Aug 23, 2025
davebayer pushed a commit to davebayer/cccl that referenced this pull request Sep 23, 2025
* Fix CUDA graph API calls for CUDA 13

* safer way to update the API with macros

* safer way to update the API with macros
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

stf Sequential Task Flow programming model

Projects

Archived in project

Development

Successfully merging this pull request may close these issues.

2 participants