Skip to content

[STF] Add python bindings#5315

Open
caugonnet wants to merge 279 commits intoNVIDIA:mainfrom
caugonnet:stf_c_api
Open

[STF] Add python bindings#5315
caugonnet wants to merge 279 commits intoNVIDIA:mainfrom
caugonnet:stf_c_api

Conversation

@caugonnet
Copy link
Contributor

@caugonnet caugonnet commented Jul 18, 2025

Description

Introduce python bindings for CUDASTF

closes

Checklist

  • New or existing tests cover these changes.
  • The documentation is up to date with these changes.

@caugonnet caugonnet self-assigned this Jul 18, 2025
@caugonnet caugonnet requested review from a team as code owners July 18, 2025 21:28
@caugonnet caugonnet added the stf Sequential Task Flow programming model label Jul 18, 2025
@github-project-automation github-project-automation bot moved this to Todo in CCCL Jul 18, 2025
@copy-pr-bot
Copy link
Contributor

copy-pr-bot bot commented Jul 18, 2025

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@cccl-authenticator-app cccl-authenticator-app bot moved this from Todo to In Review in CCCL Jul 18, 2025
@caugonnet
Copy link
Contributor Author

/ok to test c9acb99

@caugonnet
Copy link
Contributor Author

/ok to test 6971d8f

@caugonnet
Copy link
Contributor Author

/ok to test 2f7299e

@github-actions
Copy link
Contributor

🟨 CI finished in 1h 40m: Pass: 91%/205 | Total: 1d 11h | Avg: 10m 28s | Max: 43m 15s | Hits: 96%/340562
  • 🟥 python: Pass: 0%/18 | Total: 18m 23s | Avg: 1m 01s | Max: 9m 23s

    🟥 cpu
      🟥 amd64              Pass:   0%/18  | Total: 18m 23s | Avg:  1m 01s | Max:  9m 23s
    🟥 ctk
      🟥 12.9               Pass:   0%/18  | Total: 18m 23s | Avg:  1m 01s | Max:  9m 23s
    🟥 cudacxx
      🟥 nvcc12.9           Pass:   0%/18  | Total: 18m 23s | Avg:  1m 01s | Max:  9m 23s
    🟥 cudacxx_family
      🟥 nvcc               Pass:   0%/18  | Total: 18m 23s | Avg:  1m 01s | Max:  9m 23s
    🟥 cxx
      🟥 GCC13              Pass:   0%/18  | Total: 18m 23s | Avg:  1m 01s | Max:  9m 23s
    🟥 cxx_family
      🟥 GCC                Pass:   0%/18  | Total: 18m 23s | Avg:  1m 01s | Max:  9m 23s
    🟥 gpu
      🟥 h100               Pass:   0%/8  
      🟥 rtxa6000           Pass:   0%/10  | Total: 18m 23s | Avg:  1m 50s | Max:  9m 23s
    🟥 jobs
      🟥 Build cuda.cccl    Pass:   0%/2   | Total: 18m 23s | Avg:  9m 11s | Max:  9m 23s
      🟥 Test cuda.cccl.cooperative Pass:   0%/4  
      🟥 Test cuda.cccl.examples Pass:   0%/4  
      🟥 Test cuda.cccl.headers Pass:   0%/4  
      🟥 Test cuda.cccl.parallel Pass:   0%/4  
    🟥 py_version
      🟥 3.10               Pass:   0%/9   | Total:  9m 23s | Avg:  1m 02s | Max:  9m 23s
      🟥 3.13               Pass:   0%/9   | Total:  9m 00s | Avg:  1m 00s | Max:  9m 00s
    
  • 🟩 cub: Pass: 100%/50 | Total: 12h 24m | Avg: 14m 53s | Max: 43m 15s | Hits: 99%/61706

    🟩 cpu
      🟩 amd64              Pass: 100%/48  | Total: 12h 10m | Avg: 15m 12s | Max: 43m 15s | Hits:  99%/59190 
      🟩 arm64              Pass: 100%/2   | Total: 14m 34s | Avg:  7m 17s | Max:  8m 24s | Hits:  99%/2516  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  1h 00m | Avg: 12m 00s | Max: 30m 17s | Hits:  99%/6186  
      🟩 12.9               Pass: 100%/45  | Total: 11h 24m | Avg: 15m 12s | Max: 43m 15s | Hits:  99%/55520 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 10m 29s | Avg:  5m 14s | Max:  5m 18s | Hits:  99%/2165  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  1h 00m | Avg: 12m 00s | Max: 30m 17s | Hits:  99%/6186  
      🟩 nvcc12.9           Pass: 100%/43  | Total: 11h 14m | Avg: 15m 40s | Max: 43m 15s | Hits:  99%/53355 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 29s | Avg:  5m 14s | Max:  5m 18s | Hits:  99%/2165  
      🟩 nvcc               Pass: 100%/48  | Total: 12h 14m | Avg: 15m 17s | Max: 43m 15s | Hits:  99%/59541 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 26m 17s | Avg:  6m 34s | Max:  7m 00s | Hits:  99%/5034  
      🟩 Clang15            Pass: 100%/2   | Total: 13m 48s | Avg:  6m 54s | Max:  6m 55s | Hits:  99%/2513  
      🟩 Clang16            Pass: 100%/2   | Total: 13m 27s | Avg:  6m 43s | Max:  6m 45s | Hits:  99%/2513  
      🟩 Clang17            Pass: 100%/2   | Total: 14m 22s | Avg:  7m 11s | Max:  7m 17s | Hits:  99%/2513  
      🟩 Clang18            Pass: 100%/2   | Total: 13m 10s | Avg:  6m 35s | Max:  6m 36s | Hits:  99%/2513  
      🟩 Clang19            Pass: 100%/7   | Total:  1h 39m | Avg: 14m 13s | Max: 35m 00s | Hits:  99%/8449  
      🟩 GCC7               Pass: 100%/2   | Total: 16m 58s | Avg:  8m 29s | Max:  8m 32s | Hits:  99%/2516  
      🟩 GCC8               Pass: 100%/1   | Total:  8m 47s | Avg:  8m 47s | Max:  8m 47s | Hits:  99%/1258  
      🟩 GCC9               Pass: 100%/2   | Total: 18m 43s | Avg:  9m 21s | Max:  9m 43s | Hits:  99%/2516  
      🟩 GCC10              Pass: 100%/2   | Total: 18m 01s | Avg:  9m 00s | Max:  9m 05s | Hits:  99%/2517  
      🟩 GCC11              Pass: 100%/2   | Total: 17m 44s | Avg:  8m 52s | Max:  9m 14s | Hits:  99%/2513  
      🟩 GCC12              Pass: 100%/2   | Total: 19m 10s | Avg:  9m 35s | Max: 10m 03s | Hits:  99%/2513  
      🟩 GCC13              Pass: 100%/12  | Total:  4h 17m | Avg: 21m 28s | Max: 43m 15s | Hits:  99%/15105 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 02m | Avg: 31m 28s | Max: 32m 40s | Hits:  99%/2306  
      🟩 MSVC14.43          Pass: 100%/4   | Total:  1h 57m | Avg: 29m 23s | Max: 32m 44s | Hits:  99%/4612  
      🟩 NVHPC25.5          Pass: 100%/2   | Total: 26m 34s | Avg: 13m 17s | Max: 13m 20s | Hits:  98%/2315  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  3h 00m | Avg:  9m 30s | Max: 35m 00s | Hits:  99%/23535 
      🟩 GCC                Pass: 100%/23  | Total:  5h 57m | Avg: 15m 31s | Max: 43m 15s | Hits:  99%/28938 
      🟩 MSVC               Pass: 100%/6   | Total:  3h 00m | Avg: 30m 05s | Max: 32m 44s | Hits:  99%/6918  
      🟩 NVHPC              Pass: 100%/2   | Total: 26m 34s | Avg: 13m 17s | Max: 13m 20s | Hits:  98%/2315  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total: 59m 25s | Avg: 19m 48s | Max: 29m 32s | Hits:  99%/3777  
      🟩 rtx2080            Pass: 100%/39  | Total:  7h 25m | Avg: 11m 25s | Max: 32m 44s | Hits:  99%/47863 
      🟩 rtxa6000           Pass: 100%/8   | Total:  3h 59m | Avg: 29m 59s | Max: 43m 15s | Hits:  99%/10066 
    🟩 jobs
      🟩 Build              Pass: 100%/42  | Total:  7h 47m | Avg: 11m 08s | Max: 32m 44s | Hits:  99%/51638 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 37m 41s | Avg: 37m 41s | Max: 37m 41s | Hits:  99%/1259  
      🟩 GraphCapture       Pass: 100%/1   | Total: 35m 39s | Avg: 35m 39s | Max: 35m 39s | Hits:  99%/1259  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 42m | Avg: 34m 12s | Max: 38m 05s | Hits:  99%/3775  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 40m | Avg: 33m 39s | Max: 43m 15s | Hits:  99%/3775  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 59m 25s | Avg: 19m 48s | Max: 29m 32s | Hits:  99%/3777  
      🟩 90;90a             Pass: 100%/2   | Total: 34m 58s | Avg: 17m 29s | Max: 26m 27s | Hits:  99%/2412  
      🟩 100;120            Pass: 100%/2   | Total: 35m 50s | Avg: 17m 55s | Max: 27m 10s | Hits:  99%/2412  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total:  4h 01m | Avg: 11m 30s | Max: 32m 44s | Hits:  99%/25810 
      🟩 20                 Pass: 100%/29  | Total:  8h 23m | Avg: 17m 20s | Max: 43m 15s | Hits:  99%/35896 
    
  • 🟩 thrust: Pass: 100%/50 | Total: 9h 24m | Avg: 11m 17s | Max: 36m 32s | Hits: 99%/95621

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 21m 03s | Avg: 10m 31s | Max: 13m 07s | Hits:  99%/3828  
    🟩 cpu
      🟩 amd64              Pass: 100%/48  | Total:  9h 12m | Avg: 11m 30s | Max: 36m 32s | Hits:  99%/91794 
      🟩 arm64              Pass: 100%/2   | Total: 12m 05s | Avg:  6m 02s | Max:  6m 59s | Hits:  99%/3827  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 52m 06s | Avg: 10m 25s | Max: 27m 17s | Hits:  99%/9560  
      🟩 12.9               Pass: 100%/45  | Total:  8h 32m | Avg: 11m 23s | Max: 36m 32s | Hits:  99%/86061 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 11m 13s | Avg:  5m 36s | Max:  5m 53s | Hits: 100%/3826  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 52m 06s | Avg: 10m 25s | Max: 27m 17s | Hits:  99%/9560  
      🟩 nvcc12.9           Pass: 100%/43  | Total:  8h 21m | Avg: 11m 39s | Max: 36m 32s | Hits:  99%/82235 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 11m 13s | Avg:  5m 36s | Max:  5m 53s | Hits: 100%/3826  
      🟩 nvcc               Pass: 100%/48  | Total:  9h 13m | Avg: 11m 31s | Max: 36m 32s | Hits:  99%/91795 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 23m 42s | Avg:  5m 55s | Max:  6m 24s | Hits: 100%/7652  
      🟩 Clang15            Pass: 100%/2   | Total: 11m 34s | Avg:  5m 47s | Max:  5m 48s | Hits: 100%/3826  
      🟩 Clang16            Pass: 100%/2   | Total: 12m 01s | Avg:  6m 00s | Max:  6m 06s | Hits: 100%/3826  
      🟩 Clang17            Pass: 100%/2   | Total: 11m 33s | Avg:  5m 46s | Max:  5m 50s | Hits: 100%/3826  
      🟩 Clang18            Pass: 100%/2   | Total: 11m 27s | Avg:  5m 43s | Max:  5m 44s | Hits: 100%/3826  
      🟩 Clang19            Pass: 100%/7   | Total: 47m 25s | Avg:  6m 46s | Max: 10m 58s | Hits: 100%/13391 
      🟩 GCC7               Pass: 100%/2   | Total: 13m 47s | Avg:  6m 53s | Max:  7m 21s | Hits:  99%/3828  
      🟩 GCC8               Pass: 100%/1   | Total:  7m 08s | Avg:  7m 08s | Max:  7m 08s | Hits:  99%/1914  
      🟩 GCC9               Pass: 100%/2   | Total: 15m 06s | Avg:  7m 33s | Max:  7m 58s | Hits:  99%/3828  
      🟩 GCC10              Pass: 100%/2   | Total: 14m 42s | Avg:  7m 21s | Max:  7m 24s | Hits:  99%/3828  
      🟩 GCC11              Pass: 100%/2   | Total: 15m 42s | Avg:  7m 51s | Max:  8m 00s | Hits:  99%/3828  
      🟩 GCC12              Pass: 100%/2   | Total: 16m 47s | Avg:  8m 23s | Max:  8m 29s | Hits:  99%/3828  
      🟩 GCC13              Pass: 100%/11  | Total:  1h 41m | Avg:  9m 14s | Max: 14m 40s | Hits:  99%/21054 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 55m 13s | Avg: 27m 36s | Max: 27m 56s | Hits:  99%/3812  
      🟩 MSVC14.43          Pass: 100%/5   | Total:  2h 20m | Avg: 28m 05s | Max: 33m 15s | Hits:  99%/9530  
      🟩 NVHPC25.5          Pass: 100%/2   | Total:  1h 06m | Avg: 33m 17s | Max: 36m 32s | Hits:  99%/3824  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  1h 57m | Avg:  6m 11s | Max: 10m 58s | Hits: 100%/36347 
      🟩 GCC                Pass: 100%/22  | Total:  3h 04m | Avg:  8m 24s | Max: 14m 40s | Hits:  99%/42108 
      🟩 MSVC               Pass: 100%/7   | Total:  3h 15m | Avg: 27m 57s | Max: 33m 15s | Hits:  99%/13342 
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 06m | Avg: 33m 17s | Max: 36m 32s | Hits:  99%/3824  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 20m 25s | Avg: 10m 12s | Max: 14m 40s | Hits:  99%/3828  
      🟩 rtx2080            Pass: 100%/38  | Total:  6h 45m | Avg: 10m 39s | Max: 36m 32s | Hits:  99%/72672 
      🟩 rtx4090            Pass: 100%/10  | Total:  2h 19m | Avg: 13m 54s | Max: 33m 15s | Hits:  99%/19121 
    🟩 jobs
      🟩 Build              Pass: 100%/43  | Total:  7h 42m | Avg: 10m 44s | Max: 36m 32s | Hits:  99%/82233 
      🟩 TestCPU            Pass: 100%/3   | Total: 50m 42s | Avg: 16m 54s | Max: 33m 15s | Hits:  99%/5733  
      🟩 TestGPU            Pass: 100%/4   | Total: 52m 00s | Avg: 13m 00s | Max: 14m 40s | Hits:  99%/7655  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 20m 25s | Avg: 10m 12s | Max: 14m 40s | Hits:  99%/3828  
      🟩 90;90a             Pass: 100%/2   | Total: 31m 34s | Avg: 15m 47s | Max: 24m 24s | Hits:  99%/3820  
      🟩 100;120            Pass: 100%/2   | Total: 31m 29s | Avg: 15m 44s | Max: 24m 33s | Hits:  99%/3820  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total:  3h 56m | Avg: 11m 15s | Max: 36m 32s | Hits:  99%/40160 
      🟩 20                 Pass: 100%/27  | Total:  5h 07m | Avg: 11m 23s | Max: 33m 15s | Hits:  99%/51633 
    
  • 🟩 libcudacxx: Pass: 100%/48 | Total: 10h 07m | Avg: 12m 39s | Max: 34m 35s | Hits: 92%/166828

    🟩 cpu
      🟩 amd64              Pass: 100%/46  | Total:  9h 58m | Avg: 13m 00s | Max: 34m 35s | Hits:  92%/159367
      🟩 arm64              Pass: 100%/2   | Total:  9m 08s | Avg:  4m 34s | Max:  4m 40s | Hits:  99%/7461  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 47m 32s | Avg:  9m 30s | Max: 30m 11s | Hits:  99%/18287 
      🟩 12.9               Pass: 100%/43  | Total:  9h 19m | Avg: 13m 01s | Max: 34m 35s | Hits:  91%/148541
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 53m 46s | Avg: 26m 53s | Max: 27m 32s | Hits:  29%/7425  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 47m 32s | Avg:  9m 30s | Max: 30m 11s | Hits:  99%/18287 
      🟩 nvcc12.9           Pass: 100%/41  | Total:  8h 26m | Avg: 12m 20s | Max: 34m 35s | Hits:  94%/141116
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 53m 46s | Avg: 26m 53s | Max: 27m 32s | Hits:  29%/7425  
      🟩 nvcc               Pass: 100%/46  | Total:  9h 13m | Avg: 12m 02s | Max: 34m 35s | Hits:  95%/159403
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 18m 45s | Avg:  4m 41s | Max:  5m 03s | Hits:  99%/14806 
      🟩 Clang15            Pass: 100%/2   | Total: 36m 25s | Avg: 18m 12s | Max: 31m 20s | Hits:  68%/7421  
      🟩 Clang16            Pass: 100%/2   | Total: 11m 12s | Avg:  5m 36s | Max:  6m 10s | Hits:  98%/7421  
      🟩 Clang17            Pass: 100%/2   | Total: 39m 30s | Avg: 19m 45s | Max: 34m 35s | Hits:  68%/7421  
      🟩 Clang18            Pass: 100%/2   | Total:  9m 50s | Avg:  4m 55s | Max:  4m 58s | Hits:  99%/7421  
      🟩 Clang19            Pass: 100%/6   | Total:  1h 30m | Avg: 15m 01s | Max: 27m 32s | Hits:  75%/22306 
      🟩 GCC7               Pass: 100%/2   | Total:  8m 54s | Avg:  4m 27s | Max:  4m 44s | Hits:  99%/7357  
      🟩 GCC8               Pass: 100%/1   | Total:  4m 33s | Avg:  4m 33s | Max:  4m 33s | Hits:  99%/3689  
      🟩 GCC9               Pass: 100%/2   | Total:  9m 47s | Avg:  4m 53s | Max:  5m 20s | Hits:  99%/7369  
      🟩 GCC10              Pass: 100%/2   | Total:  9m 43s | Avg:  4m 51s | Max:  4m 58s | Hits:  99%/7423  
      🟩 GCC11              Pass: 100%/2   | Total: 18m 13s | Avg:  9m 06s | Max: 13m 24s | Hits:  89%/7419  
      🟩 GCC12              Pass: 100%/2   | Total: 10m 06s | Avg:  5m 03s | Max:  5m 09s | Hits:  99%/7423  
      🟩 GCC13              Pass: 100%/11  | Total:  2h 16m | Avg: 12m 22s | Max: 24m 39s | Hits:  96%/30183 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 01m | Avg: 30m 59s | Max: 31m 47s | Hits:  99%/7093  
      🟩 MSVC14.43          Pass: 100%/4   | Total:  2h 00m | Avg: 30m 05s | Max: 32m 19s | Hits:  99%/14669 
      🟩 NVHPC25.5          Pass: 100%/2   | Total: 21m 55s | Avg: 10m 57s | Max: 11m 10s | Hits:  98%/7407  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/18  | Total:  3h 25m | Avg: 11m 26s | Max: 34m 35s | Hits:  84%/66796 
      🟩 GCC                Pass: 100%/22  | Total:  3h 17m | Avg:  8m 58s | Max: 24m 39s | Hits:  97%/70863 
      🟩 MSVC               Pass: 100%/6   | Total:  3h 02m | Avg: 30m 23s | Max: 32m 19s | Hits:  99%/21762 
      🟩 NVHPC              Pass: 100%/2   | Total: 21m 55s | Avg: 10m 57s | Max: 11m 10s | Hits:  98%/7407  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 27m 15s | Avg: 13m 37s | Max: 22m 05s | Hits:  99%/7628  
      🟩 rtx2080            Pass: 100%/46  | Total:  9h 40m | Avg: 12m 36s | Max: 34m 35s | Hits:  92%/159200
    🟩 jobs
      🟩 Build              Pass: 100%/42  | Total:  8h 09m | Avg: 11m 39s | Max: 34m 35s | Hits:  92%/155511
      🟩 NVRTC              Pass: 100%/2   | Total: 48m 44s | Avg: 24m 22s | Max: 24m 39s | Hits:  90%/42    
      🟩 Test               Pass: 100%/3   | Total:  1h 07m | Avg: 22m 25s | Max: 23m 44s | Hits:  99%/11275 
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 58s | Avg:  1m 58s | Max:  1m 58s
    🟩 sm
      🟩 75                 Pass: 100%/2   | Total: 48m 44s | Avg: 24m 22s | Max: 24m 39s | Hits:  90%/42    
      🟩 90                 Pass: 100%/2   | Total: 27m 15s | Avg: 13m 37s | Max: 22m 05s | Hits:  99%/7628  
      🟩 90;90a             Pass: 100%/2   | Total: 35m 18s | Avg: 17m 39s | Max: 30m 15s | Hits:  99%/7575  
      🟩 100;120            Pass: 100%/2   | Total: 34m 03s | Avg: 17m 01s | Max: 28m 28s | Hits:  99%/7575  
    🟩 std
      🟩 17                 Pass: 100%/22  | Total:  4h 34m | Avg: 12m 27s | Max: 31m 47s | Hits:  91%/77045 
      🟩 20                 Pass: 100%/25  | Total:  5h 31m | Avg: 13m 15s | Max: 34m 35s | Hits:  93%/89783 
    
  • 🟩 cudax: Pass: 100%/28 | Total: 2h 37m | Avg: 5m 37s | Max: 12m 36s | Hits: 99%/15906

    🟩 cpu
      🟩 amd64              Pass: 100%/24  | Total:  2h 24m | Avg:  6m 02s | Max: 12m 36s | Hits:  99%/13462 
      🟩 arm64              Pass: 100%/4   | Total: 12m 37s | Avg:  3m 09s | Max:  3m 27s | Hits:  99%/2444  
    🟩 ctk
      🟩 12.0               Pass: 100%/3   | Total: 18m 20s | Avg:  6m 06s | Max: 11m 39s | Hits:  99%/1531  
      🟩 12.9               Pass: 100%/25  | Total:  2h 19m | Avg:  5m 33s | Max: 12m 36s | Hits:  99%/14375 
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/3   | Total: 18m 20s | Avg:  6m 06s | Max: 11m 39s | Hits:  99%/1531  
      🟩 nvcc12.9           Pass: 100%/25  | Total:  2h 19m | Avg:  5m 33s | Max: 12m 36s | Hits:  99%/14375 
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/28  | Total:  2h 37m | Avg:  5m 37s | Max: 12m 36s | Hits:  99%/15906 
    🟩 cxx
      🟩 Clang14            Pass: 100%/2   | Total:  6m 35s | Avg:  3m 17s | Max:  3m 31s | Hits: 100%/1224  
      🟩 Clang15            Pass: 100%/1   | Total:  3m 20s | Avg:  3m 20s | Max:  3m 20s | Hits: 100%/611   
      🟩 Clang16            Pass: 100%/1   | Total:  3m 20s | Avg:  3m 20s | Max:  3m 20s | Hits: 100%/611   
      🟩 Clang17            Pass: 100%/1   | Total:  3m 19s | Avg:  3m 19s | Max:  3m 19s | Hits: 100%/611   
      🟩 Clang18            Pass: 100%/1   | Total:  3m 22s | Avg:  3m 22s | Max:  3m 22s | Hits: 100%/611   
      🟩 Clang19            Pass: 100%/4   | Total: 17m 42s | Avg:  4m 25s | Max:  8m 44s | Hits: 100%/2444  
      🟩 GCC10              Pass: 100%/2   | Total:  7m 15s | Avg:  3m 37s | Max:  3m 38s | Hits:  99%/1224  
      🟩 GCC11              Pass: 100%/1   | Total:  3m 48s | Avg:  3m 48s | Max:  3m 48s | Hits:  99%/611   
      🟩 GCC12              Pass: 100%/1   | Total:  3m 55s | Avg:  3m 55s | Max:  3m 55s | Hits:  99%/611   
      🟩 GCC13              Pass: 100%/8   | Total: 44m 42s | Avg:  5m 35s | Max: 12m 36s | Hits:  99%/4888  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 11m 39s | Avg: 11m 39s | Max: 11m 39s | Hits:  95%/309   
      🟩 MSVC14.43          Pass: 100%/3   | Total: 33m 15s | Avg: 11m 05s | Max: 11m 37s | Hits:  95%/933   
      🟩 NVHPC25.5          Pass: 100%/2   | Total: 15m 16s | Avg:  7m 38s | Max:  7m 50s | Hits:  97%/1218  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/10  | Total: 37m 38s | Avg:  3m 45s | Max:  8m 44s | Hits: 100%/6112  
      🟩 GCC                Pass: 100%/12  | Total: 59m 40s | Avg:  4m 58s | Max: 12m 36s | Hits:  99%/7334  
      🟩 MSVC               Pass: 100%/4   | Total: 44m 54s | Avg: 11m 13s | Max: 11m 39s | Hits:  95%/1242  
      🟩 NVHPC              Pass: 100%/2   | Total: 15m 16s | Avg:  7m 38s | Max:  7m 50s | Hits:  97%/1218  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 13m 23s | Avg:  6m 41s | Max: 10m 08s | Hits:  99%/1222  
      🟩 rtx2080            Pass: 100%/26  | Total:  2h 24m | Avg:  5m 32s | Max: 12m 36s | Hits:  99%/14684 
    🟩 jobs
      🟩 Build              Pass: 100%/25  | Total:  2h 06m | Avg:  5m 02s | Max: 11m 39s | Hits:  99%/14073 
      🟩 Test               Pass: 100%/3   | Total: 31m 28s | Avg: 10m 29s | Max: 12m 36s | Hits:  99%/1833  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 13m 23s | Avg:  6m 41s | Max: 10m 08s | Hits:  99%/1222  
      🟩 90;90a             Pass: 100%/2   | Total: 14m 56s | Avg:  7m 28s | Max: 11m 09s | Hits:  98%/922   
      🟩 100;120            Pass: 100%/2   | Total: 14m 12s | Avg:  7m 06s | Max: 10m 29s | Hits:  98%/922   
    🟩 std
      🟩 17                 Pass: 100%/3   | Total: 14m 10s | Avg:  4m 43s | Max:  7m 50s | Hits:  99%/1831  
      🟩 20                 Pass: 100%/25  | Total:  2h 23m | Avg:  5m 43s | Max: 12m 36s | Hits:  99%/14075 
    
  • 🟩 packaging: Pass: 100%/4 | Total: 12m 18s | Avg: 3m 04s | Max: 3m 22s

    🟩 cpu
      🟩 amd64              Pass: 100%/4   | Total: 12m 18s | Avg:  3m 04s | Max:  3m 22s
    🟩 ctk
      🟩 12.0               Pass: 100%/2   | Total:  5m 42s | Avg:  2m 51s | Max:  2m 55s
      🟩 12.9               Pass: 100%/2   | Total:  6m 36s | Avg:  3m 18s | Max:  3m 22s
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/2   | Total:  5m 42s | Avg:  2m 51s | Max:  2m 55s
      🟩 nvcc12.9           Pass: 100%/2   | Total:  6m 36s | Avg:  3m 18s | Max:  3m 22s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 12m 18s | Avg:  3m 04s | Max:  3m 22s
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  2m 47s | Avg:  2m 47s | Max:  2m 47s
      🟩 Clang19            Pass: 100%/1   | Total:  3m 14s | Avg:  3m 14s | Max:  3m 14s
      🟩 GCC12              Pass: 100%/1   | Total:  2m 55s | Avg:  2m 55s | Max:  2m 55s
      🟩 GCC13              Pass: 100%/1   | Total:  3m 22s | Avg:  3m 22s | Max:  3m 22s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/2   | Total:  6m 01s | Avg:  3m 00s | Max:  3m 14s
      🟩 GCC                Pass: 100%/2   | Total:  6m 17s | Avg:  3m 08s | Max:  3m 22s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 12m 18s | Avg:  3m 04s | Max:  3m 22s
    🟩 jobs
      🟩 Test               Pass: 100%/4   | Total: 12m 18s | Avg:  3m 04s | Max:  3m 22s
    
  • 🟩 stdpar: Pass: 100%/4 | Total: 17m 30s | Avg: 4m 22s | Max: 4m 48s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  9m 19s | Avg:  4m 39s | Max:  4m 48s
      🟩 arm64              Pass: 100%/2   | Total:  8m 11s | Avg:  4m 05s | Max:  4m 08s
    🟩 ctk
      🟩 12.9               Pass: 100%/4   | Total: 17m 30s | Avg:  4m 22s | Max:  4m 48s
    🟩 cudacxx
      🟩 nvcc12.9           Pass: 100%/4   | Total: 17m 30s | Avg:  4m 22s | Max:  4m 48s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 17m 30s | Avg:  4m 22s | Max:  4m 48s
    🟩 cxx
      🟩 NVHPC25.5          Pass: 100%/4   | Total: 17m 30s | Avg:  4m 22s | Max:  4m 48s
    🟩 cxx_family
      🟩 NVHPC              Pass: 100%/4   | Total: 17m 30s | Avg:  4m 22s | Max:  4m 48s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 17m 30s | Avg:  4m 22s | Max:  4m 48s
    🟩 jobs
      🟩 Build              Pass: 100%/4   | Total: 17m 30s | Avg:  4m 22s | Max:  4m 48s
    🟩 std
      🟩 17                 Pass: 100%/2   | Total:  8m 34s | Avg:  4m 17s | Max:  4m 31s
      🟩 20                 Pass: 100%/2   | Total:  8m 56s | Avg:  4m 28s | Max:  4m 48s
    
  • 🟩 cccl_c_parallel: Pass: 100%/3 | Total: 26m 19s | Avg: 8m 46s | Max: 14m 09s | Hits: 98%/501

    🟩 cpu
      🟩 amd64              Pass: 100%/3   | Total: 26m 19s | Avg:  8m 46s | Max: 14m 09s | Hits:  98%/501   
    🟩 ctk
      🟩 12.9               Pass: 100%/3   | Total: 26m 19s | Avg:  8m 46s | Max: 14m 09s | Hits:  98%/501   
    🟩 cudacxx
      🟩 nvcc12.9           Pass: 100%/3   | Total: 26m 19s | Avg:  8m 46s | Max: 14m 09s | Hits:  98%/501   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/3   | Total: 26m 19s | Avg:  8m 46s | Max: 14m 09s | Hits:  98%/501   
    🟩 cxx
      🟩 GCC13              Pass: 100%/3   | Total: 26m 19s | Avg:  8m 46s | Max: 14m 09s | Hits:  98%/501   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/3   | Total: 26m 19s | Avg:  8m 46s | Max: 14m 09s | Hits:  98%/501   
    🟩 gpu
      🟩 h100               Pass: 100%/1   | Total: 14m 09s | Avg: 14m 09s | Max: 14m 09s | Hits:  98%/167   
      🟩 rtx2080            Pass: 100%/2   | Total: 12m 10s | Avg:  6m 05s | Max:  9m 39s | Hits:  98%/334   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 31s | Avg:  2m 31s | Max:  2m 31s | Hits:  97%/167   
      🟩 Test               Pass: 100%/2   | Total: 23m 48s | Avg: 11m 54s | Max: 14m 09s | Hits:  98%/334   
    

👃 Inspect Changes

Modifications in project?

Project
+/- CCCL Infrastructure
CCCL Packaging
libcu++
CUB
Thrust
CUDA Experimental
stdpar
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
+/- CCCL Infrastructure
+/- CCCL Packaging
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- stdpar
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 205)

# Runner
128 linux-amd64-cpu16
23 windows-amd64-cpu16
14 linux-amd64-gpu-h100-latest-1
14 linux-amd64-gpu-rtxa6000-latest-1
12 linux-arm64-cpu16
11 linux-amd64-gpu-rtx2080-latest-1
3 linux-amd64-gpu-rtx4090-latest-1

@caugonnet
Copy link
Contributor Author

/ok to test fa8c960

@github-actions
Copy link
Contributor

🟨 CI finished in 1h 10m: Pass: 90%/205 | Total: 1d 13h | Avg: 10m 59s | Max: 39m 22s | Hits: 96%/340547
  • 🟥 python: Pass: 0%/18 | Total: 19m 10s | Avg: 1m 03s | Max: 9m 56s

    🟥 cpu
      🟥 amd64              Pass:   0%/18  | Total: 19m 10s | Avg:  1m 03s | Max:  9m 56s
    🟥 ctk
      🟥 12.9               Pass:   0%/18  | Total: 19m 10s | Avg:  1m 03s | Max:  9m 56s
    🟥 cudacxx
      🟥 nvcc12.9           Pass:   0%/18  | Total: 19m 10s | Avg:  1m 03s | Max:  9m 56s
    🟥 cudacxx_family
      🟥 nvcc               Pass:   0%/18  | Total: 19m 10s | Avg:  1m 03s | Max:  9m 56s
    🟥 cxx
      🟥 GCC13              Pass:   0%/18  | Total: 19m 10s | Avg:  1m 03s | Max:  9m 56s
    🟥 cxx_family
      🟥 GCC                Pass:   0%/18  | Total: 19m 10s | Avg:  1m 03s | Max:  9m 56s
    🟥 gpu
      🟥 h100               Pass:   0%/8  
      🟥 rtxa6000           Pass:   0%/10  | Total: 19m 10s | Avg:  1m 55s | Max:  9m 56s
    🟥 jobs
      🟥 Build cuda.cccl    Pass:   0%/2   | Total: 19m 10s | Avg:  9m 35s | Max:  9m 56s
      🟥 Test cuda.cccl.cooperative Pass:   0%/4  
      🟥 Test cuda.cccl.examples Pass:   0%/4  
      🟥 Test cuda.cccl.headers Pass:   0%/4  
      🟥 Test cuda.cccl.parallel Pass:   0%/4  
    🟥 py_version
      🟥 3.10               Pass:   0%/9   | Total:  9m 56s | Avg:  1m 06s | Max:  9m 56s
      🟥 3.13               Pass:   0%/9   | Total:  9m 14s | Avg:  1m 01s | Max:  9m 14s
    
  • 🟨 libcudacxx: Pass: 97%/48 | Total: 9h 11m | Avg: 11m 28s | Max: 34m 17s | Hits: 94%/166807

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  97%/46  | Total:  9h 01m | Avg: 11m 46s | Max: 34m 17s | Hits:  94%/159346
      🟩 arm64              Pass: 100%/2   | Total:  9m 11s | Avg:  4m 35s | Max:  4m 43s | Hits:  99%/7461  
    🔍 ctk: 12.9 🔍
      🟩 12.0               Pass: 100%/5   | Total: 46m 43s | Avg:  9m 20s | Max: 29m 33s | Hits:  99%/18287 
      🔍 12.9               Pass:  97%/43  | Total:  8h 24m | Avg: 11m 43s | Max: 34m 17s | Hits:  94%/148520
    🔍 cudacxx: nvcc12.9 🔍
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 52m 45s | Avg: 26m 22s | Max: 27m 01s | Hits:  29%/7425  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 46m 43s | Avg:  9m 20s | Max: 29m 33s | Hits:  99%/18287 
      🔍 nvcc12.9           Pass:  97%/41  | Total:  7h 31m | Avg: 11m 01s | Max: 34m 17s | Hits:  97%/141095
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total: 52m 45s | Avg: 26m 22s | Max: 27m 01s | Hits:  29%/7425  
      🔍 nvcc               Pass:  97%/46  | Total:  8h 18m | Avg: 10m 50s | Max: 34m 17s | Hits:  97%/159382
    🔍 cxx: GCC13 🔍
      🟩 Clang14            Pass: 100%/4   | Total: 19m 25s | Avg:  4m 51s | Max:  5m 18s | Hits:  99%/14806 
      🟩 Clang15            Pass: 100%/2   | Total: 10m 13s | Avg:  5m 06s | Max:  5m 08s | Hits:  99%/7421  
      🟩 Clang16            Pass: 100%/2   | Total: 10m 05s | Avg:  5m 02s | Max:  5m 08s | Hits:  99%/7421  
      🟩 Clang17            Pass: 100%/2   | Total:  9m 43s | Avg:  4m 51s | Max:  4m 52s | Hits:  99%/7421  
      🟩 Clang18            Pass: 100%/2   | Total:  9m 42s | Avg:  4m 51s | Max:  4m 57s | Hits:  99%/7421  
      🟩 Clang19            Pass: 100%/6   | Total:  1h 26m | Avg: 14m 27s | Max: 27m 01s | Hits:  75%/22306 
      🟩 GCC7               Pass: 100%/2   | Total:  8m 39s | Avg:  4m 19s | Max:  4m 41s | Hits:  99%/7357  
      🟩 GCC8               Pass: 100%/1   | Total:  4m 59s | Avg:  4m 59s | Max:  4m 59s | Hits:  99%/3689  
      🟩 GCC9               Pass: 100%/2   | Total:  9m 17s | Avg:  4m 38s | Max:  5m 02s | Hits:  99%/7369  
      🟩 GCC10              Pass: 100%/2   | Total: 39m 20s | Avg: 19m 40s | Max: 34m 17s | Hits:  68%/7423  
      🟩 GCC11              Pass: 100%/2   | Total:  9m 37s | Avg:  4m 48s | Max:  4m 49s | Hits:  99%/7419  
      🟩 GCC12              Pass: 100%/2   | Total: 10m 20s | Avg:  5m 10s | Max:  5m 16s | Hits:  99%/7423  
      🔍 GCC13              Pass:  90%/11  | Total:  1h 59m | Avg: 10m 50s | Max: 22m 56s | Hits:  99%/30162 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 00m | Avg: 30m 07s | Max: 30m 42s | Hits:  99%/7093  
      🟩 MSVC14.43          Pass: 100%/4   | Total:  2h 02m | Avg: 30m 34s | Max: 31m 42s | Hits:  99%/14669 
      🟩 NVHPC25.5          Pass: 100%/2   | Total: 21m 23s | Avg: 10m 41s | Max: 10m 55s | Hits:  98%/7407  
    🔍 cxx_family: GCC 🔍
      🟩 Clang              Pass: 100%/18  | Total:  2h 25m | Avg:  8m 06s | Max: 27m 01s | Hits:  91%/66796 
      🔍 GCC                Pass:  95%/22  | Total:  3h 21m | Avg:  9m 09s | Max: 34m 17s | Hits:  96%/70842 
      🟩 MSVC               Pass: 100%/6   | Total:  3h 02m | Avg: 30m 25s | Max: 31m 42s | Hits:  99%/21762 
      🟩 NVHPC              Pass: 100%/2   | Total: 21m 23s | Avg: 10m 41s | Max: 10m 55s | Hits:  98%/7407  
    🔍 gpu: rtx2080 🔍
      🟩 h100               Pass: 100%/2   | Total: 27m 50s | Avg: 13m 55s | Max: 22m 56s | Hits:  99%/7628  
      🔍 rtx2080            Pass:  97%/46  | Total:  8h 43m | Avg: 11m 22s | Max: 34m 17s | Hits:  94%/159179
    🔍 jobs: NVRTC 🔍
      🟩 Build              Pass: 100%/42  | Total:  7h 23m | Avg: 10m 33s | Max: 34m 17s | Hits:  94%/155511
      🔍 NVRTC              Pass:  50%/2   | Total: 42m 52s | Avg: 21m 26s | Max: 22m 09s | Hits:  90%/21    
      🟩 Test               Pass: 100%/3   | Total:  1h 03m | Avg: 21m 04s | Max: 22m 56s | Hits:  99%/11275 
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 52s | Avg:  1m 52s | Max:  1m 52s
    🔍 sm: 75 🔍
      🔍 75                 Pass:  50%/2   | Total: 42m 52s | Avg: 21m 26s | Max: 22m 09s | Hits:  90%/21    
      🟩 90                 Pass: 100%/2   | Total: 27m 50s | Avg: 13m 55s | Max: 22m 56s | Hits:  99%/7628  
      🟩 90;90a             Pass: 100%/2   | Total: 33m 44s | Avg: 16m 52s | Max: 28m 17s | Hits:  99%/7575  
      🟩 100;120            Pass: 100%/2   | Total: 36m 03s | Avg: 18m 01s | Max: 31m 07s | Hits:  99%/7575  
    🔍 std: 17 🔍
      🔍 17                 Pass:  95%/22  | Total:  3h 47m | Avg: 10m 21s | Max: 31m 11s | Hits:  96%/77024 
      🟩 20                 Pass: 100%/25  | Total:  5h 21m | Avg: 12m 51s | Max: 34m 17s | Hits:  93%/89783 
    
  • 🟩 cub: Pass: 100%/50 | Total: 12h 26m | Avg: 14m 55s | Max: 39m 22s | Hits: 99%/61706

    🟩 cpu
      🟩 amd64              Pass: 100%/48  | Total: 12h 12m | Avg: 15m 15s | Max: 39m 22s | Hits:  99%/59190 
      🟩 arm64              Pass: 100%/2   | Total: 14m 35s | Avg:  7m 17s | Max:  8m 33s | Hits:  99%/2516  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  1h 00m | Avg: 12m 11s | Max: 32m 09s | Hits:  99%/6186  
      🟩 12.9               Pass: 100%/45  | Total: 11h 25m | Avg: 15m 14s | Max: 39m 22s | Hits:  99%/55520 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 10m 35s | Avg:  5m 17s | Max:  5m 18s | Hits:  99%/2165  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  1h 00m | Avg: 12m 11s | Max: 32m 09s | Hits:  99%/6186  
      🟩 nvcc12.9           Pass: 100%/43  | Total: 11h 15m | Avg: 15m 41s | Max: 39m 22s | Hits:  99%/53355 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 35s | Avg:  5m 17s | Max:  5m 18s | Hits:  99%/2165  
      🟩 nvcc               Pass: 100%/48  | Total: 12h 16m | Avg: 15m 20s | Max: 39m 22s | Hits:  99%/59541 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 26m 31s | Avg:  6m 37s | Max:  6m 59s | Hits:  99%/5034  
      🟩 Clang15            Pass: 100%/2   | Total: 14m 01s | Avg:  7m 00s | Max:  7m 01s | Hits:  99%/2513  
      🟩 Clang16            Pass: 100%/2   | Total: 13m 34s | Avg:  6m 47s | Max:  6m 55s | Hits:  99%/2513  
      🟩 Clang17            Pass: 100%/2   | Total: 14m 17s | Avg:  7m 08s | Max:  7m 21s | Hits:  99%/2513  
      🟩 Clang18            Pass: 100%/2   | Total: 13m 23s | Avg:  6m 41s | Max:  6m 45s | Hits:  99%/2513  
      🟩 Clang19            Pass: 100%/7   | Total:  1h 34m | Avg: 13m 31s | Max: 32m 06s | Hits:  99%/8449  
      🟩 GCC7               Pass: 100%/2   | Total: 16m 28s | Avg:  8m 14s | Max:  8m 27s | Hits:  99%/2516  
      🟩 GCC8               Pass: 100%/1   | Total:  8m 30s | Avg:  8m 30s | Max:  8m 30s | Hits:  99%/1258  
      🟩 GCC9               Pass: 100%/2   | Total: 17m 11s | Avg:  8m 35s | Max:  8m 56s | Hits:  99%/2516  
      🟩 GCC10              Pass: 100%/2   | Total: 17m 44s | Avg:  8m 52s | Max:  9m 02s | Hits:  99%/2517  
      🟩 GCC11              Pass: 100%/2   | Total: 17m 37s | Avg:  8m 48s | Max:  8m 56s | Hits:  99%/2513  
      🟩 GCC12              Pass: 100%/2   | Total: 19m 42s | Avg:  9m 51s | Max:  9m 55s | Hits:  99%/2513  
      🟩 GCC13              Pass: 100%/12  | Total:  4h 25m | Avg: 22m 08s | Max: 39m 22s | Hits:  99%/15105 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 01m | Avg: 30m 51s | Max: 32m 09s | Hits:  99%/2306  
      🟩 MSVC14.43          Pass: 100%/4   | Total:  1h 59m | Avg: 29m 51s | Max: 33m 28s | Hits:  99%/4612  
      🟩 NVHPC25.5          Pass: 100%/2   | Total: 26m 08s | Avg: 13m 04s | Max: 13m 19s | Hits:  98%/2315  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  2h 56m | Avg:  9m 17s | Max: 32m 06s | Hits:  99%/23535 
      🟩 GCC                Pass: 100%/23  | Total:  6h 02m | Avg: 15m 46s | Max: 39m 22s | Hits:  99%/28938 
      🟩 MSVC               Pass: 100%/6   | Total:  3h 01m | Avg: 30m 11s | Max: 33m 28s | Hits:  99%/6918  
      🟩 NVHPC              Pass: 100%/2   | Total: 26m 08s | Avg: 13m 04s | Max: 13m 19s | Hits:  98%/2315  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total:  1h 16m | Avg: 25m 37s | Max: 36m 05s | Hits:  99%/3777  
      🟩 rtx2080            Pass: 100%/39  | Total:  7h 23m | Avg: 11m 22s | Max: 33m 28s | Hits:  99%/47863 
      🟩 rtxa6000           Pass: 100%/8   | Total:  3h 46m | Avg: 28m 16s | Max: 39m 22s | Hits:  99%/10066 
    🟩 jobs
      🟩 Build              Pass: 100%/42  | Total:  7h 46m | Avg: 11m 06s | Max: 33m 28s | Hits:  99%/51638 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 38m 47s | Avg: 38m 47s | Max: 38m 47s | Hits:  99%/1259  
      🟩 GraphCapture       Pass: 100%/1   | Total: 30m 49s | Avg: 30m 49s | Max: 30m 49s | Hits:  99%/1259  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 47m | Avg: 35m 49s | Max: 39m 22s | Hits:  99%/3775  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 43m | Avg: 34m 20s | Max: 36m 34s | Hits:  99%/3775  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total:  1h 16m | Avg: 25m 37s | Max: 36m 05s | Hits:  99%/3777  
      🟩 90;90a             Pass: 100%/2   | Total: 34m 31s | Avg: 17m 15s | Max: 26m 45s | Hits:  99%/2412  
      🟩 100;120            Pass: 100%/2   | Total: 35m 18s | Avg: 17m 39s | Max: 27m 07s | Hits:  99%/2412  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total:  4h 01m | Avg: 11m 29s | Max: 33m 28s | Hits:  99%/25810 
      🟩 20                 Pass: 100%/29  | Total:  8h 25m | Avg: 17m 25s | Max: 39m 22s | Hits:  99%/35896 
    
  • 🟩 thrust: Pass: 100%/50 | Total: 9h 20m | Avg: 11m 12s | Max: 37m 02s | Hits: 99%/95621

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 21m 12s | Avg: 10m 36s | Max: 13m 15s | Hits:  99%/3828  
    🟩 cpu
      🟩 amd64              Pass: 100%/48  | Total:  9h 08m | Avg: 11m 25s | Max: 37m 02s | Hits:  99%/91794 
      🟩 arm64              Pass: 100%/2   | Total: 12m 06s | Avg:  6m 03s | Max:  6m 56s | Hits:  99%/3827  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 52m 47s | Avg: 10m 33s | Max: 28m 13s | Hits:  99%/9560  
      🟩 12.9               Pass: 100%/45  | Total:  8h 27m | Avg: 11m 17s | Max: 37m 02s | Hits:  99%/86061 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 11m 25s | Avg:  5m 42s | Max:  5m 51s | Hits: 100%/3826  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 52m 47s | Avg: 10m 33s | Max: 28m 13s | Hits:  99%/9560  
      🟩 nvcc12.9           Pass: 100%/43  | Total:  8h 16m | Avg: 11m 32s | Max: 37m 02s | Hits:  99%/82235 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 11m 25s | Avg:  5m 42s | Max:  5m 51s | Hits: 100%/3826  
      🟩 nvcc               Pass: 100%/48  | Total:  9h 09m | Avg: 11m 26s | Max: 37m 02s | Hits:  99%/91795 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 22m 29s | Avg:  5m 37s | Max:  6m 08s | Hits: 100%/7652  
      🟩 Clang15            Pass: 100%/2   | Total: 11m 45s | Avg:  5m 52s | Max:  5m 54s | Hits: 100%/3826  
      🟩 Clang16            Pass: 100%/2   | Total: 12m 13s | Avg:  6m 06s | Max:  6m 23s | Hits: 100%/3826  
      🟩 Clang17            Pass: 100%/2   | Total: 11m 42s | Avg:  5m 51s | Max:  5m 58s | Hits: 100%/3826  
      🟩 Clang18            Pass: 100%/2   | Total: 11m 40s | Avg:  5m 50s | Max:  6m 02s | Hits: 100%/3826  
      🟩 Clang19            Pass: 100%/7   | Total: 47m 00s | Avg:  6m 42s | Max: 10m 02s | Hits: 100%/13391 
      🟩 GCC7               Pass: 100%/2   | Total: 14m 15s | Avg:  7m 07s | Max:  7m 12s | Hits:  99%/3828  
      🟩 GCC8               Pass: 100%/1   | Total:  7m 01s | Avg:  7m 01s | Max:  7m 01s | Hits:  99%/1914  
      🟩 GCC9               Pass: 100%/2   | Total: 14m 16s | Avg:  7m 08s | Max:  7m 17s | Hits:  99%/3828  
      🟩 GCC10              Pass: 100%/2   | Total: 15m 34s | Avg:  7m 47s | Max:  8m 17s | Hits:  99%/3828  
      🟩 GCC11              Pass: 100%/2   | Total: 16m 45s | Avg:  8m 22s | Max:  9m 11s | Hits:  99%/3828  
      🟩 GCC12              Pass: 100%/2   | Total: 15m 55s | Avg:  7m 57s | Max:  8m 04s | Hits:  99%/3828  
      🟩 GCC13              Pass: 100%/11  | Total:  1h 41m | Avg:  9m 12s | Max: 15m 07s | Hits:  99%/21054 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 55m 50s | Avg: 27m 55s | Max: 28m 13s | Hits:  99%/3812  
      🟩 MSVC14.43          Pass: 100%/5   | Total:  2h 27m | Avg: 29m 31s | Max: 37m 02s | Hits:  99%/9530  
      🟩 NVHPC25.5          Pass: 100%/2   | Total: 55m 24s | Avg: 27m 42s | Max: 28m 21s | Hits:  99%/3824  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  1h 56m | Avg:  6m 08s | Max: 10m 02s | Hits: 100%/36347 
      🟩 GCC                Pass: 100%/22  | Total:  3h 05m | Avg:  8m 24s | Max: 15m 07s | Hits:  99%/42108 
      🟩 MSVC               Pass: 100%/7   | Total:  3h 23m | Avg: 29m 04s | Max: 37m 02s | Hits:  99%/13342 
      🟩 NVHPC              Pass: 100%/2   | Total: 55m 24s | Avg: 27m 42s | Max: 28m 21s | Hits:  99%/3824  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 20m 53s | Avg: 10m 26s | Max: 15m 07s | Hits:  99%/3828  
      🟩 rtx2080            Pass: 100%/38  | Total:  6h 37m | Avg: 10m 28s | Max: 32m 11s | Hits:  99%/72672 
      🟩 rtx4090            Pass: 100%/10  | Total:  2h 22m | Avg: 14m 12s | Max: 37m 02s | Hits:  99%/19121 
    🟩 jobs
      🟩 Build              Pass: 100%/43  | Total:  7h 34m | Avg: 10m 34s | Max: 32m 11s | Hits:  99%/82233 
      🟩 TestCPU            Pass: 100%/3   | Total: 54m 40s | Avg: 18m 13s | Max: 37m 02s | Hits:  99%/5733  
      🟩 TestGPU            Pass: 100%/4   | Total: 51m 33s | Avg: 12m 53s | Max: 15m 07s | Hits:  99%/7655  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 20m 53s | Avg: 10m 26s | Max: 15m 07s | Hits:  99%/3828  
      🟩 90;90a             Pass: 100%/2   | Total: 31m 24s | Avg: 15m 42s | Max: 24m 29s | Hits:  99%/3820  
      🟩 100;120            Pass: 100%/2   | Total: 32m 04s | Avg: 16m 02s | Max: 25m 02s | Hits:  99%/3820  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total:  3h 52m | Avg: 11m 03s | Max: 32m 11s | Hits:  99%/40160 
      🟩 20                 Pass: 100%/27  | Total:  5h 07m | Avg: 11m 22s | Max: 37m 02s | Hits:  99%/51633 
    
  • 🟩 cudax: Pass: 100%/28 | Total: 5h 17m | Avg: 11m 20s | Max: 21m 47s | Hits: 85%/15906

    🟩 cpu
      🟩 amd64              Pass: 100%/24  | Total:  4h 37m | Avg: 11m 34s | Max: 21m 47s | Hits:  85%/13462 
      🟩 arm64              Pass: 100%/4   | Total: 39m 49s | Avg:  9m 57s | Max: 11m 11s | Hits:  82%/2444  
    🟩 ctk
      🟩 12.0               Pass: 100%/3   | Total: 31m 04s | Avg: 10m 21s | Max: 11m 21s | Hits:  85%/1531  
      🟩 12.9               Pass: 100%/25  | Total:  4h 46m | Avg: 11m 27s | Max: 21m 47s | Hits:  85%/14375 
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/3   | Total: 31m 04s | Avg: 10m 21s | Max: 11m 21s | Hits:  85%/1531  
      🟩 nvcc12.9           Pass: 100%/25  | Total:  4h 46m | Avg: 11m 27s | Max: 21m 47s | Hits:  85%/14375 
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/28  | Total:  5h 17m | Avg: 11m 20s | Max: 21m 47s | Hits:  85%/15906 
    🟩 cxx
      🟩 Clang14            Pass: 100%/2   | Total: 18m 49s | Avg:  9m 24s | Max:  9m 33s | Hits:  82%/1224  
      🟩 Clang15            Pass: 100%/1   | Total: 10m 25s | Avg: 10m 25s | Max: 10m 25s | Hits:  82%/611   
      🟩 Clang16            Pass: 100%/1   | Total: 10m 59s | Avg: 10m 59s | Max: 10m 59s | Hits:  82%/611   
      🟩 Clang17            Pass: 100%/1   | Total: 10m 43s | Avg: 10m 43s | Max: 10m 43s | Hits:  82%/611   
      🟩 Clang18            Pass: 100%/1   | Total: 10m 42s | Avg: 10m 42s | Max: 10m 42s | Hits:  82%/611   
      🟩 Clang19            Pass: 100%/4   | Total: 38m 06s | Avg:  9m 31s | Max: 10m 34s | Hits:  87%/2444  
      🟩 GCC10              Pass: 100%/2   | Total: 22m 24s | Avg: 11m 12s | Max: 11m 57s | Hits:  82%/1224  
      🟩 GCC11              Pass: 100%/1   | Total: 11m 04s | Avg: 11m 04s | Max: 11m 04s | Hits:  82%/611   
      🟩 GCC12              Pass: 100%/1   | Total: 12m 00s | Avg: 12m 00s | Max: 12m 00s | Hits:  82%/611   
      🟩 GCC13              Pass: 100%/8   | Total:  1h 23m | Avg: 10m 29s | Max: 12m 26s | Hits:  86%/4888  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 11m 21s | Avg: 11m 21s | Max: 11m 21s | Hits:  95%/309   
      🟩 MSVC14.43          Pass: 100%/3   | Total: 35m 23s | Avg: 11m 47s | Max: 12m 29s | Hits:  96%/933   
      🟩 NVHPC25.5          Pass: 100%/2   | Total: 41m 43s | Avg: 20m 51s | Max: 21m 47s | Hits:  80%/1218  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/10  | Total:  1h 39m | Avg:  9m 58s | Max: 10m 59s | Hits:  84%/6112  
      🟩 GCC                Pass: 100%/12  | Total:  2h 09m | Avg: 10m 47s | Max: 12m 26s | Hits:  85%/7334  
      🟩 MSVC               Pass: 100%/4   | Total: 46m 44s | Avg: 11m 41s | Max: 12m 29s | Hits:  95%/1242  
      🟩 NVHPC              Pass: 100%/2   | Total: 41m 43s | Avg: 20m 51s | Max: 21m 47s | Hits:  80%/1218  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 18m 17s | Avg:  9m 08s | Max:  9m 11s | Hits:  91%/1222  
      🟩 rtx2080            Pass: 100%/26  | Total:  4h 59m | Avg: 11m 30s | Max: 21m 47s | Hits:  85%/14684 
    🟩 jobs
      🟩 Build              Pass: 100%/25  | Total:  4h 48m | Avg: 11m 32s | Max: 21m 47s | Hits:  83%/14073 
      🟩 Test               Pass: 100%/3   | Total: 29m 14s | Avg:  9m 44s | Max: 10m 59s | Hits:  99%/1833  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 18m 17s | Avg:  9m 08s | Max:  9m 11s | Hits:  91%/1222  
      🟩 90;90a             Pass: 100%/2   | Total: 21m 27s | Avg: 10m 43s | Max: 10m 47s | Hits:  87%/922   
      🟩 100;120            Pass: 100%/2   | Total: 22m 21s | Avg: 11m 10s | Max: 12m 07s | Hits:  86%/922   
    🟩 std
      🟩 17                 Pass: 100%/3   | Total: 39m 02s | Avg: 13m 00s | Max: 19m 56s | Hits:  81%/1831  
      🟩 20                 Pass: 100%/25  | Total:  4h 38m | Avg: 11m 08s | Max: 21m 47s | Hits:  85%/14075 
    
  • 🟩 packaging: Pass: 100%/4 | Total: 14m 55s | Avg: 3m 43s | Max: 4m 02s

    🟩 cpu
      🟩 amd64              Pass: 100%/4   | Total: 14m 55s | Avg:  3m 43s | Max:  4m 02s
    🟩 ctk
      🟩 12.0               Pass: 100%/2   | Total:  7m 27s | Avg:  3m 43s | Max:  3m 51s
      🟩 12.9               Pass: 100%/2   | Total:  7m 28s | Avg:  3m 44s | Max:  4m 02s
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/2   | Total:  7m 27s | Avg:  3m 43s | Max:  3m 51s
      🟩 nvcc12.9           Pass: 100%/2   | Total:  7m 28s | Avg:  3m 44s | Max:  4m 02s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 14m 55s | Avg:  3m 43s | Max:  4m 02s
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  3m 51s | Avg:  3m 51s | Max:  3m 51s
      🟩 Clang19            Pass: 100%/1   | Total:  4m 02s | Avg:  4m 02s | Max:  4m 02s
      🟩 GCC12              Pass: 100%/1   | Total:  3m 36s | Avg:  3m 36s | Max:  3m 36s
      🟩 GCC13              Pass: 100%/1   | Total:  3m 26s | Avg:  3m 26s | Max:  3m 26s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/2   | Total:  7m 53s | Avg:  3m 56s | Max:  4m 02s
      🟩 GCC                Pass: 100%/2   | Total:  7m 02s | Avg:  3m 31s | Max:  3m 36s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 14m 55s | Avg:  3m 43s | Max:  4m 02s
    🟩 jobs
      🟩 Test               Pass: 100%/4   | Total: 14m 55s | Avg:  3m 43s | Max:  4m 02s
    
  • 🟩 stdpar: Pass: 100%/4 | Total: 16m 14s | Avg: 4m 03s | Max: 4m 24s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  8m 40s | Avg:  4m 20s | Max:  4m 24s
      🟩 arm64              Pass: 100%/2   | Total:  7m 34s | Avg:  3m 47s | Max:  3m 53s
    🟩 ctk
      🟩 12.9               Pass: 100%/4   | Total: 16m 14s | Avg:  4m 03s | Max:  4m 24s
    🟩 cudacxx
      🟩 nvcc12.9           Pass: 100%/4   | Total: 16m 14s | Avg:  4m 03s | Max:  4m 24s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 16m 14s | Avg:  4m 03s | Max:  4m 24s
    🟩 cxx
      🟩 NVHPC25.5          Pass: 100%/4   | Total: 16m 14s | Avg:  4m 03s | Max:  4m 24s
    🟩 cxx_family
      🟩 NVHPC              Pass: 100%/4   | Total: 16m 14s | Avg:  4m 03s | Max:  4m 24s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 16m 14s | Avg:  4m 03s | Max:  4m 24s
    🟩 jobs
      🟩 Build              Pass: 100%/4   | Total: 16m 14s | Avg:  4m 03s | Max:  4m 24s
    🟩 std
      🟩 17                 Pass: 100%/2   | Total:  7m 57s | Avg:  3m 58s | Max:  4m 16s
      🟩 20                 Pass: 100%/2   | Total:  8m 17s | Avg:  4m 08s | Max:  4m 24s
    
  • 🟩 cccl_c_parallel: Pass: 100%/3 | Total: 28m 11s | Avg: 9m 23s | Max: 15m 03s | Hits: 98%/507

    🟩 cpu
      🟩 amd64              Pass: 100%/3   | Total: 28m 11s | Avg:  9m 23s | Max: 15m 03s | Hits:  98%/507   
    🟩 ctk
      🟩 12.9               Pass: 100%/3   | Total: 28m 11s | Avg:  9m 23s | Max: 15m 03s | Hits:  98%/507   
    🟩 cudacxx
      🟩 nvcc12.9           Pass: 100%/3   | Total: 28m 11s | Avg:  9m 23s | Max: 15m 03s | Hits:  98%/507   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/3   | Total: 28m 11s | Avg:  9m 23s | Max: 15m 03s | Hits:  98%/507   
    🟩 cxx
      🟩 GCC13              Pass: 100%/3   | Total: 28m 11s | Avg:  9m 23s | Max: 15m 03s | Hits:  98%/507   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/3   | Total: 28m 11s | Avg:  9m 23s | Max: 15m 03s | Hits:  98%/507   
    🟩 gpu
      🟩 h100               Pass: 100%/1   | Total: 15m 03s | Avg: 15m 03s | Max: 15m 03s | Hits:  98%/169   
      🟩 rtx2080            Pass: 100%/2   | Total: 13m 08s | Avg:  6m 34s | Max: 10m 25s | Hits:  97%/338   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 43s | Avg:  2m 43s | Max:  2m 43s | Hits:  96%/169   
      🟩 Test               Pass: 100%/2   | Total: 25m 28s | Avg: 12m 44s | Max: 15m 03s | Hits:  98%/338   
    

👃 Inspect Changes

Modifications in project?

Project
+/- CCCL Infrastructure
CCCL Packaging
libcu++
CUB
Thrust
+/- CUDA Experimental
stdpar
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
+/- CCCL Infrastructure
+/- CCCL Packaging
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- stdpar
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 205)

# Runner
128 linux-amd64-cpu16
23 windows-amd64-cpu16
14 linux-amd64-gpu-h100-latest-1
14 linux-amd64-gpu-rtxa6000-latest-1
12 linux-arm64-cpu16
11 linux-amd64-gpu-rtx2080-latest-1
3 linux-amd64-gpu-rtx4090-latest-1

@caugonnet caugonnet requested a review from a team as a code owner July 20, 2025 08:03
@caugonnet caugonnet requested a review from gonidelis July 20, 2025 08:03
@caugonnet
Copy link
Contributor Author

Rebased on #5319 (low level cuda_kernel API) and #5215 (CUfunction support in cuda_kernel)

@caugonnet
Copy link
Contributor Author

/ok to test f3b57da

@github-actions
Copy link
Contributor

🟨 CI finished in 1h 05m: Pass: 89%/205 | Total: 1d 13h | Avg: 10m 53s | Max: 40m 23s | Hits: 95%/340359
  • 🟥 python: Pass: 0%/18 | Total: 18m 25s | Avg: 1m 01s | Max: 9m 21s

    🟥 cpu
      🟥 amd64              Pass:   0%/18  | Total: 18m 25s | Avg:  1m 01s | Max:  9m 21s
    🟥 ctk
      🟥 12.9               Pass:   0%/18  | Total: 18m 25s | Avg:  1m 01s | Max:  9m 21s
    🟥 cudacxx
      🟥 nvcc12.9           Pass:   0%/18  | Total: 18m 25s | Avg:  1m 01s | Max:  9m 21s
    🟥 cudacxx_family
      🟥 nvcc               Pass:   0%/18  | Total: 18m 25s | Avg:  1m 01s | Max:  9m 21s
    🟥 cxx
      🟥 GCC13              Pass:   0%/18  | Total: 18m 25s | Avg:  1m 01s | Max:  9m 21s
    🟥 cxx_family
      🟥 GCC                Pass:   0%/18  | Total: 18m 25s | Avg:  1m 01s | Max:  9m 21s
    🟥 gpu
      🟥 h100               Pass:   0%/8  
      🟥 rtxa6000           Pass:   0%/10  | Total: 18m 25s | Avg:  1m 50s | Max:  9m 21s
    🟥 jobs
      🟥 Build cuda.cccl    Pass:   0%/2   | Total: 18m 25s | Avg:  9m 12s | Max:  9m 21s
      🟥 Test cuda.cccl.cooperative Pass:   0%/4  
      🟥 Test cuda.cccl.examples Pass:   0%/4  
      🟥 Test cuda.cccl.headers Pass:   0%/4  
      🟥 Test cuda.cccl.parallel Pass:   0%/4  
    🟥 py_version
      🟥 3.10               Pass:   0%/9   | Total:  9m 04s | Avg:  1m 00s | Max:  9m 04s
      🟥 3.13               Pass:   0%/9   | Total:  9m 21s | Avg:  1m 02s | Max:  9m 21s
    
  • 🟥 cccl_c_parallel: Pass: 0%/3 | Total: 2m 07s | Avg: 0m 42s | Max: 2m 07s

    🟥 cpu
      🟥 amd64              Pass:   0%/3   | Total:  2m 07s | Avg:  0m 42s | Max:  2m 07s
    🟥 ctk
      🟥 12.9               Pass:   0%/3   | Total:  2m 07s | Avg:  0m 42s | Max:  2m 07s
    🟥 cudacxx
      🟥 nvcc12.9           Pass:   0%/3   | Total:  2m 07s | Avg:  0m 42s | Max:  2m 07s
    🟥 cudacxx_family
      🟥 nvcc               Pass:   0%/3   | Total:  2m 07s | Avg:  0m 42s | Max:  2m 07s
    🟥 cxx
      🟥 GCC13              Pass:   0%/3   | Total:  2m 07s | Avg:  0m 42s | Max:  2m 07s
    🟥 cxx_family
      🟥 GCC                Pass:   0%/3   | Total:  2m 07s | Avg:  0m 42s | Max:  2m 07s
    🟥 gpu
      🟥 h100               Pass:   0%/1  
      🟥 rtx2080            Pass:   0%/2   | Total:  2m 07s | Avg:  1m 03s | Max:  2m 07s
    🟥 jobs
      🟥 Build              Pass:   0%/1   | Total:  2m 07s | Avg:  2m 07s | Max:  2m 07s
      🟥 Test               Pass:   0%/2  
    
  • 🟩 cub: Pass: 100%/50 | Total: 12h 16m | Avg: 14m 43s | Max: 40m 23s | Hits: 99%/61956

    🟩 cpu
      🟩 amd64              Pass: 100%/48  | Total: 12h 01m | Avg: 15m 02s | Max: 40m 23s | Hits:  99%/59430 
      🟩 arm64              Pass: 100%/2   | Total: 14m 32s | Avg:  7m 16s | Max:  8m 27s | Hits:  99%/2526  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 58m 55s | Avg: 11m 47s | Max: 30m 19s | Hits:  99%/6211  
      🟩 12.9               Pass: 100%/45  | Total: 11h 17m | Avg: 15m 03s | Max: 40m 23s | Hits:  99%/55745 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 10m 46s | Avg:  5m 23s | Max:  5m 24s | Hits:  99%/2175  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 58m 55s | Avg: 11m 47s | Max: 30m 19s | Hits:  99%/6211  
      🟩 nvcc12.9           Pass: 100%/43  | Total: 11h 06m | Avg: 15m 30s | Max: 40m 23s | Hits:  99%/53570 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 46s | Avg:  5m 23s | Max:  5m 24s | Hits:  99%/2175  
      🟩 nvcc               Pass: 100%/48  | Total: 12h 05m | Avg: 15m 07s | Max: 40m 23s | Hits:  99%/59781 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 26m 14s | Avg:  6m 33s | Max:  6m 59s | Hits:  99%/5054  
      🟩 Clang15            Pass: 100%/2   | Total: 14m 11s | Avg:  7m 05s | Max:  7m 21s | Hits:  99%/2523  
      🟩 Clang16            Pass: 100%/2   | Total: 13m 53s | Avg:  6m 56s | Max:  7m 01s | Hits:  99%/2523  
      🟩 Clang17            Pass: 100%/2   | Total: 13m 42s | Avg:  6m 51s | Max:  6m 55s | Hits:  99%/2523  
      🟩 Clang18            Pass: 100%/2   | Total: 13m 17s | Avg:  6m 38s | Max:  6m 41s | Hits:  99%/2523  
      🟩 Clang19            Pass: 100%/7   | Total:  1h 34m | Avg: 13m 33s | Max: 32m 28s | Hits:  99%/8484  
      🟩 GCC7               Pass: 100%/2   | Total: 16m 18s | Avg:  8m 09s | Max:  8m 26s | Hits:  99%/2526  
      🟩 GCC8               Pass: 100%/1   | Total:  8m 22s | Avg:  8m 22s | Max:  8m 22s | Hits:  99%/1263  
      🟩 GCC9               Pass: 100%/2   | Total: 17m 05s | Avg:  8m 32s | Max:  8m 44s | Hits:  99%/2526  
      🟩 GCC10              Pass: 100%/2   | Total: 18m 28s | Avg:  9m 14s | Max:  9m 27s | Hits:  99%/2527  
      🟩 GCC11              Pass: 100%/2   | Total: 18m 04s | Avg:  9m 02s | Max:  9m 15s | Hits:  99%/2523  
      🟩 GCC12              Pass: 100%/2   | Total: 18m 03s | Avg:  9m 01s | Max:  9m 15s | Hits:  99%/2523  
      🟩 GCC13              Pass: 100%/12  | Total:  4h 21m | Avg: 21m 47s | Max: 40m 23s | Hits:  99%/15165 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 00m | Avg: 30m 26s | Max: 30m 33s | Hits:  99%/2316  
      🟩 MSVC14.43          Pass: 100%/4   | Total:  1h 56m | Avg: 29m 06s | Max: 31m 48s | Hits:  99%/4632  
      🟩 NVHPC25.5          Pass: 100%/2   | Total: 25m 18s | Avg: 12m 39s | Max: 12m 49s | Hits:  98%/2325  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  2h 56m | Avg:  9m 16s | Max: 32m 28s | Hits:  99%/23630 
      🟩 GCC                Pass: 100%/23  | Total:  5h 57m | Avg: 15m 33s | Max: 40m 23s | Hits:  99%/29053 
      🟩 MSVC               Pass: 100%/6   | Total:  2h 57m | Avg: 29m 32s | Max: 31m 48s | Hits:  99%/6948  
      🟩 NVHPC              Pass: 100%/2   | Total: 25m 18s | Avg: 12m 39s | Max: 12m 49s | Hits:  98%/2325  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total:  1h 11m | Avg: 23m 46s | Max: 34m 12s | Hits:  99%/3792  
      🟩 rtx2080            Pass: 100%/39  | Total:  7h 17m | Avg: 11m 12s | Max: 31m 48s | Hits:  99%/48058 
      🟩 rtxa6000           Pass: 100%/8   | Total:  3h 47m | Avg: 28m 28s | Max: 40m 23s | Hits:  99%/10106 
    🟩 jobs
      🟩 Build              Pass: 100%/42  | Total:  7h 40m | Avg: 10m 57s | Max: 31m 48s | Hits:  99%/51848 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 38m 59s | Avg: 38m 59s | Max: 38m 59s | Hits:  99%/1264  
      🟩 GraphCapture       Pass: 100%/1   | Total: 31m 11s | Avg: 31m 11s | Max: 31m 11s | Hits:  99%/1264  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 43m | Avg: 34m 25s | Max: 40m 23s | Hits:  99%/3790  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 43m | Avg: 34m 20s | Max: 36m 38s | Hits:  99%/3790  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total:  1h 11m | Avg: 23m 46s | Max: 34m 12s | Hits:  99%/3792  
      🟩 90;90a             Pass: 100%/2   | Total: 34m 23s | Avg: 17m 11s | Max: 26m 41s | Hits:  99%/2422  
      🟩 100;120            Pass: 100%/2   | Total: 34m 33s | Avg: 17m 16s | Max: 26m 14s | Hits:  99%/2422  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total:  3h 56m | Avg: 11m 14s | Max: 31m 48s | Hits:  99%/25915 
      🟩 20                 Pass: 100%/29  | Total:  8h 20m | Avg: 17m 15s | Max: 40m 23s | Hits:  99%/36041 
    
  • 🟩 thrust: Pass: 100%/50 | Total: 9h 24m | Avg: 11m 17s | Max: 35m 03s | Hits: 99%/95621

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 21m 15s | Avg: 10m 37s | Max: 13m 16s | Hits:  99%/3828  
    🟩 cpu
      🟩 amd64              Pass: 100%/48  | Total:  9h 12m | Avg: 11m 30s | Max: 35m 03s | Hits:  99%/91794 
      🟩 arm64              Pass: 100%/2   | Total: 12m 03s | Avg:  6m 01s | Max:  6m 53s | Hits:  99%/3827  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 51m 30s | Avg: 10m 18s | Max: 27m 23s | Hits:  99%/9560  
      🟩 12.9               Pass: 100%/45  | Total:  8h 32m | Avg: 11m 23s | Max: 35m 03s | Hits:  99%/86061 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 10m 46s | Avg:  5m 23s | Max:  5m 32s | Hits: 100%/3826  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 51m 30s | Avg: 10m 18s | Max: 27m 23s | Hits:  99%/9560  
      🟩 nvcc12.9           Pass: 100%/43  | Total:  8h 22m | Avg: 11m 40s | Max: 35m 03s | Hits:  99%/82235 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 46s | Avg:  5m 23s | Max:  5m 32s | Hits: 100%/3826  
      🟩 nvcc               Pass: 100%/48  | Total:  9h 13m | Avg: 11m 31s | Max: 35m 03s | Hits:  99%/91795 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 23m 06s | Avg:  5m 46s | Max:  6m 39s | Hits: 100%/7652  
      🟩 Clang15            Pass: 100%/2   | Total: 11m 52s | Avg:  5m 56s | Max:  6m 09s | Hits: 100%/3826  
      🟩 Clang16            Pass: 100%/2   | Total: 12m 33s | Avg:  6m 16s | Max:  6m 24s | Hits: 100%/3826  
      🟩 Clang17            Pass: 100%/2   | Total: 11m 38s | Avg:  5m 49s | Max:  5m 51s | Hits: 100%/3826  
      🟩 Clang18            Pass: 100%/2   | Total: 11m 56s | Avg:  5m 58s | Max:  6m 05s | Hits: 100%/3826  
      🟩 Clang19            Pass: 100%/7   | Total: 45m 34s | Avg:  6m 30s | Max: 10m 03s | Hits: 100%/13391 
      🟩 GCC7               Pass: 100%/2   | Total: 13m 59s | Avg:  6m 59s | Max:  7m 27s | Hits:  99%/3828  
      🟩 GCC8               Pass: 100%/1   | Total:  7m 09s | Avg:  7m 09s | Max:  7m 09s | Hits:  99%/1914  
      🟩 GCC9               Pass: 100%/2   | Total: 14m 30s | Avg:  7m 15s | Max:  7m 28s | Hits:  99%/3828  
      🟩 GCC10              Pass: 100%/2   | Total: 14m 24s | Avg:  7m 12s | Max:  7m 19s | Hits:  99%/3828  
      🟩 GCC11              Pass: 100%/2   | Total: 15m 13s | Avg:  7m 36s | Max:  7m 50s | Hits:  99%/3828  
      🟩 GCC12              Pass: 100%/2   | Total: 15m 43s | Avg:  7m 51s | Max:  8m 05s | Hits:  99%/3828  
      🟩 GCC13              Pass: 100%/11  | Total:  1h 38m | Avg:  8m 55s | Max: 13m 27s | Hits:  99%/21054 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 54m 40s | Avg: 27m 20s | Max: 27m 23s | Hits:  99%/3812  
      🟩 MSVC14.43          Pass: 100%/5   | Total:  2h 25m | Avg: 29m 11s | Max: 32m 45s | Hits:  99%/9530  
      🟩 NVHPC25.5          Pass: 100%/2   | Total:  1h 07m | Avg: 33m 57s | Max: 35m 03s | Hits:  99%/3824  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  1h 56m | Avg:  6m 08s | Max: 10m 03s | Hits: 100%/36347 
      🟩 GCC                Pass: 100%/22  | Total:  2h 59m | Avg:  8m 08s | Max: 13m 27s | Hits:  99%/42108 
      🟩 MSVC               Pass: 100%/7   | Total:  3h 20m | Avg: 28m 39s | Max: 32m 45s | Hits:  99%/13342 
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 07m | Avg: 33m 57s | Max: 35m 03s | Hits:  99%/3824  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 18m 20s | Avg:  9m 10s | Max: 12m 20s | Hits:  99%/3828  
      🟩 rtx2080            Pass: 100%/38  | Total:  6h 48m | Avg: 10m 44s | Max: 35m 03s | Hits:  99%/72672 
      🟩 rtx4090            Pass: 100%/10  | Total:  2h 17m | Avg: 13m 46s | Max: 32m 45s | Hits:  99%/19121 
    🟩 jobs
      🟩 Build              Pass: 100%/43  | Total:  7h 45m | Avg: 10m 49s | Max: 35m 03s | Hits:  99%/82233 
      🟩 TestCPU            Pass: 100%/3   | Total: 49m 59s | Avg: 16m 39s | Max: 32m 45s | Hits:  99%/5733  
      🟩 TestGPU            Pass: 100%/4   | Total: 49m 06s | Avg: 12m 16s | Max: 13m 27s | Hits:  99%/7655  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 18m 20s | Avg:  9m 10s | Max: 12m 20s | Hits:  99%/3828  
      🟩 90;90a             Pass: 100%/2   | Total: 31m 39s | Avg: 15m 49s | Max: 25m 02s | Hits:  99%/3820  
      🟩 100;120            Pass: 100%/2   | Total: 34m 02s | Avg: 17m 01s | Max: 27m 28s | Hits:  99%/3820  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total:  3h 54m | Avg: 11m 10s | Max: 35m 03s | Hits:  99%/40160 
      🟩 20                 Pass: 100%/27  | Total:  5h 08m | Avg: 11m 25s | Max: 32m 52s | Hits:  99%/51633 
    
  • 🟩 libcudacxx: Pass: 100%/48 | Total: 9h 19m | Avg: 11m 39s | Max: 34m 31s | Hits: 93%/166828

    🟩 cpu
      🟩 amd64              Pass: 100%/46  | Total:  9h 02m | Avg: 11m 47s | Max: 34m 31s | Hits:  93%/159367
      🟩 arm64              Pass: 100%/2   | Total: 17m 18s | Avg:  8m 39s | Max: 12m 44s | Hits:  88%/7461  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 45m 26s | Avg:  9m 05s | Max: 28m 38s | Hits:  99%/18287 
      🟩 12.9               Pass: 100%/43  | Total:  8h 33m | Avg: 11m 57s | Max: 34m 31s | Hits:  92%/148541
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 57m 59s | Avg: 28m 59s | Max: 30m 44s | Hits:  29%/7425  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 45m 26s | Avg:  9m 05s | Max: 28m 38s | Hits:  99%/18287 
      🟩 nvcc12.9           Pass: 100%/41  | Total:  7h 35m | Avg: 11m 07s | Max: 34m 31s | Hits:  96%/141116
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 57m 59s | Avg: 28m 59s | Max: 30m 44s | Hits:  29%/7425  
      🟩 nvcc               Pass: 100%/46  | Total:  8h 21m | Avg: 10m 54s | Max: 34m 31s | Hits:  96%/159403
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 41m 54s | Avg: 10m 28s | Max: 28m 09s | Hits:  84%/14806 
      🟩 Clang15            Pass: 100%/2   | Total: 10m 08s | Avg:  5m 04s | Max:  5m 12s | Hits:  99%/7421  
      🟩 Clang16            Pass: 100%/2   | Total:  9m 58s | Avg:  4m 59s | Max:  5m 03s | Hits:  99%/7421  
      🟩 Clang17            Pass: 100%/2   | Total:  9m 43s | Avg:  4m 51s | Max:  4m 54s | Hits:  99%/7421  
      🟩 Clang18            Pass: 100%/2   | Total: 10m 15s | Avg:  5m 07s | Max:  5m 15s | Hits:  99%/7421  
      🟩 Clang19            Pass: 100%/6   | Total:  1h 33m | Avg: 15m 38s | Max: 30m 44s | Hits:  75%/22306 
      🟩 GCC7               Pass: 100%/2   | Total: 16m 12s | Avg:  8m 06s | Max: 12m 10s | Hits:  89%/7357  
      🟩 GCC8               Pass: 100%/1   | Total:  5m 04s | Avg:  5m 04s | Max:  5m 04s | Hits:  99%/3689  
      🟩 GCC9               Pass: 100%/2   | Total:  9m 31s | Avg:  4m 45s | Max:  5m 13s | Hits:  99%/7369  
      🟩 GCC10              Pass: 100%/2   | Total:  9m 49s | Avg:  4m 54s | Max:  5m 01s | Hits:  99%/7423  
      🟩 GCC11              Pass: 100%/2   | Total:  9m 36s | Avg:  4m 48s | Max:  4m 51s | Hits:  99%/7419  
      🟩 GCC12              Pass: 100%/2   | Total: 10m 33s | Avg:  5m 16s | Max:  5m 31s | Hits:  99%/7423  
      🟩 GCC13              Pass: 100%/11  | Total:  2h 02m | Avg: 11m 10s | Max: 23m 15s | Hits:  96%/30183 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 58m 25s | Avg: 29m 12s | Max: 29m 47s | Hits:  88%/7093  
      🟩 MSVC14.43          Pass: 100%/4   | Total:  2h 00m | Avg: 30m 00s | Max: 34m 31s | Hits:  99%/14669 
      🟩 NVHPC25.5          Pass: 100%/2   | Total: 21m 28s | Avg: 10m 44s | Max: 10m 54s | Hits:  98%/7407  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/18  | Total:  2h 55m | Avg:  9m 46s | Max: 30m 44s | Hits:  88%/66796 
      🟩 GCC                Pass: 100%/22  | Total:  3h 03m | Avg:  8m 20s | Max: 23m 15s | Hits:  97%/70863 
      🟩 MSVC               Pass: 100%/6   | Total:  2h 58m | Avg: 29m 44s | Max: 34m 31s | Hits:  95%/21762 
      🟩 NVHPC              Pass: 100%/2   | Total: 21m 28s | Avg: 10m 44s | Max: 10m 54s | Hits:  98%/7407  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 22m 30s | Avg: 11m 15s | Max: 17m 49s | Hits:  99%/7628  
      🟩 rtx2080            Pass: 100%/46  | Total:  8h 56m | Avg: 11m 40s | Max: 34m 31s | Hits:  93%/159200
    🟩 jobs
      🟩 Build              Pass: 100%/42  | Total:  7h 34m | Avg: 10m 48s | Max: 34m 31s | Hits:  93%/155511
      🟩 NVRTC              Pass: 100%/2   | Total: 41m 11s | Avg: 20m 35s | Max: 21m 27s | Hits:  90%/42    
      🟩 Test               Pass: 100%/3   | Total:  1h 02m | Avg: 20m 45s | Max: 23m 15s | Hits:  99%/11275 
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 54s | Avg:  1m 54s | Max:  1m 54s
    🟩 sm
      🟩 75                 Pass: 100%/2   | Total: 41m 11s | Avg: 20m 35s | Max: 21m 27s | Hits:  90%/42    
      🟩 90                 Pass: 100%/2   | Total: 22m 30s | Avg: 11m 15s | Max: 17m 49s | Hits:  99%/7628  
      🟩 90;90a             Pass: 100%/2   | Total: 33m 08s | Avg: 16m 34s | Max: 27m 46s | Hits:  99%/7575  
      🟩 100;120            Pass: 100%/2   | Total: 32m 51s | Avg: 16m 25s | Max: 27m 32s | Hits:  99%/7575  
    🟩 std
      🟩 17                 Pass: 100%/22  | Total:  4h 16m | Avg: 11m 38s | Max: 30m 11s | Hits:  91%/77045 
      🟩 20                 Pass: 100%/25  | Total:  5h 01m | Avg: 12m 03s | Max: 34m 31s | Hits:  95%/89783 
    
  • 🟩 cudax: Pass: 100%/28 | Total: 5h 20m | Avg: 11m 26s | Max: 21m 35s | Hits: 84%/15954

    🟩 cpu
      🟩 amd64              Pass: 100%/24  | Total:  4h 39m | Avg: 11m 39s | Max: 21m 35s | Hits:  85%/13502 
      🟩 arm64              Pass: 100%/4   | Total: 40m 25s | Avg: 10m 06s | Max: 10m 55s | Hits:  82%/2452  
    🟩 ctk
      🟩 12.0               Pass: 100%/3   | Total: 30m 31s | Avg: 10m 10s | Max: 11m 00s | Hits:  85%/1535  
      🟩 12.9               Pass: 100%/25  | Total:  4h 49m | Avg: 11m 35s | Max: 21m 35s | Hits:  84%/14419 
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/3   | Total: 30m 31s | Avg: 10m 10s | Max: 11m 00s | Hits:  85%/1535  
      🟩 nvcc12.9           Pass: 100%/25  | Total:  4h 49m | Avg: 11m 35s | Max: 21m 35s | Hits:  84%/14419 
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/28  | Total:  5h 20m | Avg: 11m 26s | Max: 21m 35s | Hits:  84%/15954 
    🟩 cxx
      🟩 Clang14            Pass: 100%/2   | Total: 20m 15s | Avg: 10m 07s | Max: 10m 45s | Hits:  82%/1228  
      🟩 Clang15            Pass: 100%/1   | Total: 10m 30s | Avg: 10m 30s | Max: 10m 30s | Hits:  82%/613   
      🟩 Clang16            Pass: 100%/1   | Total: 10m 53s | Avg: 10m 53s | Max: 10m 53s | Hits:  82%/613   
      🟩 Clang17            Pass: 100%/1   | Total: 11m 21s | Avg: 11m 21s | Max: 11m 21s | Hits:  82%/613   
      🟩 Clang18            Pass: 100%/1   | Total: 10m 33s | Avg: 10m 33s | Max: 10m 33s | Hits:  82%/613   
      🟩 Clang19            Pass: 100%/4   | Total: 38m 29s | Avg:  9m 37s | Max: 10m 36s | Hits:  86%/2452  
      🟩 GCC10              Pass: 100%/2   | Total: 20m 55s | Avg: 10m 27s | Max: 10m 54s | Hits:  82%/1228  
      🟩 GCC11              Pass: 100%/1   | Total: 11m 14s | Avg: 11m 14s | Max: 11m 14s | Hits:  82%/613   
      🟩 GCC12              Pass: 100%/1   | Total: 12m 30s | Avg: 12m 30s | Max: 12m 30s | Hits:  82%/613   
      🟩 GCC13              Pass: 100%/8   | Total:  1h 27m | Avg: 10m 55s | Max: 15m 43s | Hits:  85%/4904  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 11m 00s | Avg: 11m 00s | Max: 11m 00s | Hits:  95%/309   
      🟩 MSVC14.43          Pass: 100%/3   | Total: 33m 54s | Avg: 11m 18s | Max: 12m 14s | Hits:  95%/933   
      🟩 NVHPC25.5          Pass: 100%/2   | Total: 41m 22s | Avg: 20m 41s | Max: 21m 35s | Hits:  79%/1222  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/10  | Total:  1h 42m | Avg: 10m 12s | Max: 11m 21s | Hits:  84%/6132  
      🟩 GCC                Pass: 100%/12  | Total:  2h 12m | Avg: 11m 00s | Max: 15m 43s | Hits:  84%/7358  
      🟩 MSVC               Pass: 100%/4   | Total: 44m 54s | Avg: 11m 13s | Max: 12m 14s | Hits:  95%/1242  
      🟩 NVHPC              Pass: 100%/2   | Total: 41m 22s | Avg: 20m 41s | Max: 21m 35s | Hits:  79%/1222  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 15m 58s | Avg:  7m 59s | Max:  8m 40s | Hits:  90%/1226  
      🟩 rtx2080            Pass: 100%/26  | Total:  5h 04m | Avg: 11m 42s | Max: 21m 35s | Hits:  84%/14728 
    🟩 jobs
      🟩 Build              Pass: 100%/25  | Total:  4h 48m | Avg: 11m 32s | Max: 21m 35s | Hits:  83%/14115 
      🟩 Test               Pass: 100%/3   | Total: 31m 41s | Avg: 10m 33s | Max: 15m 43s | Hits:  96%/1839  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 15m 58s | Avg:  7m 59s | Max:  8m 40s | Hits:  90%/1226  
      🟩 90;90a             Pass: 100%/2   | Total: 21m 13s | Avg: 10m 36s | Max: 11m 04s | Hits:  86%/924   
      🟩 100;120            Pass: 100%/2   | Total: 22m 02s | Avg: 11m 01s | Max: 11m 26s | Hits:  86%/924   
    🟩 std
      🟩 17                 Pass: 100%/3   | Total: 39m 21s | Avg: 13m 07s | Max: 19m 47s | Hits:  81%/1837  
      🟩 20                 Pass: 100%/25  | Total:  4h 41m | Avg: 11m 14s | Max: 21m 35s | Hits:  85%/14117 
    
  • 🟩 packaging: Pass: 100%/4 | Total: 16m 23s | Avg: 4m 05s | Max: 4m 44s

    🟩 cpu
      🟩 amd64              Pass: 100%/4   | Total: 16m 23s | Avg:  4m 05s | Max:  4m 44s
    🟩 ctk
      🟩 12.0               Pass: 100%/2   | Total:  7m 50s | Avg:  3m 55s | Max:  4m 02s
      🟩 12.9               Pass: 100%/2   | Total:  8m 33s | Avg:  4m 16s | Max:  4m 44s
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/2   | Total:  7m 50s | Avg:  3m 55s | Max:  4m 02s
      🟩 nvcc12.9           Pass: 100%/2   | Total:  8m 33s | Avg:  4m 16s | Max:  4m 44s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 16m 23s | Avg:  4m 05s | Max:  4m 44s
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  3m 48s | Avg:  3m 48s | Max:  3m 48s
      🟩 Clang19            Pass: 100%/1   | Total:  3m 49s | Avg:  3m 49s | Max:  3m 49s
      🟩 GCC12              Pass: 100%/1   | Total:  4m 02s | Avg:  4m 02s | Max:  4m 02s
      🟩 GCC13              Pass: 100%/1   | Total:  4m 44s | Avg:  4m 44s | Max:  4m 44s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/2   | Total:  7m 37s | Avg:  3m 48s | Max:  3m 49s
      🟩 GCC                Pass: 100%/2   | Total:  8m 46s | Avg:  4m 23s | Max:  4m 44s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 16m 23s | Avg:  4m 05s | Max:  4m 44s
    🟩 jobs
      🟩 Test               Pass: 100%/4   | Total: 16m 23s | Avg:  4m 05s | Max:  4m 44s
    
  • 🟩 stdpar: Pass: 100%/4 | Total: 15m 06s | Avg: 3m 46s | Max: 4m 01s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  7m 19s | Avg:  3m 39s | Max:  3m 43s
      🟩 arm64              Pass: 100%/2   | Total:  7m 47s | Avg:  3m 53s | Max:  4m 01s
    🟩 ctk
      🟩 12.9               Pass: 100%/4   | Total: 15m 06s | Avg:  3m 46s | Max:  4m 01s
    🟩 cudacxx
      🟩 nvcc12.9           Pass: 100%/4   | Total: 15m 06s | Avg:  3m 46s | Max:  4m 01s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 15m 06s | Avg:  3m 46s | Max:  4m 01s
    🟩 cxx
      🟩 NVHPC25.5          Pass: 100%/4   | Total: 15m 06s | Avg:  3m 46s | Max:  4m 01s
    🟩 cxx_family
      🟩 NVHPC              Pass: 100%/4   | Total: 15m 06s | Avg:  3m 46s | Max:  4m 01s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 15m 06s | Avg:  3m 46s | Max:  4m 01s
    🟩 jobs
      🟩 Build              Pass: 100%/4   | Total: 15m 06s | Avg:  3m 46s | Max:  4m 01s
    🟩 std
      🟩 17                 Pass: 100%/2   | Total:  7m 44s | Avg:  3m 52s | Max:  4m 01s
      🟩 20                 Pass: 100%/2   | Total:  7m 22s | Avg:  3m 41s | Max:  3m 46s
    

👃 Inspect Changes

Modifications in project?

Project
+/- CCCL Infrastructure
CCCL Packaging
libcu++
CUB
Thrust
+/- CUDA Experimental
stdpar
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
+/- CCCL Infrastructure
+/- CCCL Packaging
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- stdpar
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 205)

# Runner
128 linux-amd64-cpu16
23 windows-amd64-cpu16
14 linux-amd64-gpu-h100-latest-1
14 linux-amd64-gpu-rtxa6000-latest-1
12 linux-arm64-cpu16
11 linux-amd64-gpu-rtx2080-latest-1
3 linux-amd64-gpu-rtx4090-latest-1

… pass it to a shared library, so we convert it to a CUfunction prior to calling a function in the shared library (so we do it in the header)
@caugonnet
Copy link
Contributor Author

/ok to test 2a75766

@caugonnet caugonnet marked this pull request as ready for review November 25, 2025 16:11
@cccl-authenticator-app cccl-authenticator-app bot moved this from In Progress to In Review in CCCL Nov 25, 2025
@github-actions

This comment has been minimized.

@caugonnet
Copy link
Contributor Author

Note: we need to make sure that we can compile our python bindings if numba-core is not available (it became an optional dep)

@caugonnet
Copy link
Contributor Author

/ok to test dd6cc26

@github-actions
Copy link
Contributor

github-actions bot commented Feb 3, 2026

😬 CI Workflow Results

🟥 Finished in 1h 58m: Pass: 48%/108 | Total: 14h 22m | Max: 1h 47m | Hits: 76%/23753

See results here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

stf Sequential Task Flow programming model

Projects

Status: In Review

Development

Successfully merging this pull request may close these issues.

4 participants