Skip to content

Reorganize PTX tests to match generator #2930

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Nov 22, 2024

Conversation

bernhardmgruber
Copy link
Contributor

@bernhardmgruber bernhardmgruber commented Nov 21, 2024

This PR splits the cuda::ptx tests into the generated and manually written part. It's just moving around content and renaming files. No other changes.

@bernhardmgruber bernhardmgruber added the libcu++ For all items related to libcu++ label Nov 21, 2024

int main(int, char**)
{
// FIXME(bgruber): why no call to test_barrier_cluster?
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I noticed we don't call any of test functions. Is this intended?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah those are all compile time tests because for many instructions its a ton of work to create a proper test

Copy link
Contributor

@ahendriksen ahendriksen Nov 22, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, this is intended.
What they test is: (1) does the C++ code generate valid PTX, (2) does it properly guard against SM version, and (3) is the PTX ISA check restrictive enough.

@bernhardmgruber bernhardmgruber marked this pull request as ready for review November 21, 2024 18:26
@bernhardmgruber bernhardmgruber requested review from a team as code owners November 21, 2024 18:26
Copy link
Contributor

🟩 CI finished in 3h 32m: Pass: 100%/396 | Total: 2d 00h | Avg: 7m 24s | Max: 39m 52s | Hits: 95%/22038
  • 🟩 libcudacxx: Pass: 100%/118 | Total: 18h 48m | Avg: 9m 33s | Max: 34m 13s | Hits: 91%/9504

    🟩 cpu
      🟩 amd64              Pass: 100%/110 | Total: 18h 18m | Avg:  9m 59s | Max: 34m 13s | Hits:  91%/9504  
      🟩 arm64              Pass: 100%/8   | Total: 29m 32s | Avg:  3m 41s | Max:  4m 44s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  2h 17m | Avg:  9m 11s | Max: 21m 35s | Hits:  98%/2182  
      🟩 11.8               Pass: 100%/3   | Total: 37m 05s | Avg: 12m 21s | Max: 20m 48s
      🟩 12.5               Pass: 100%/4   | Total:  1h 24m | Avg: 21m 08s | Max: 34m 13s
      🟩 12.6               Pass: 100%/96  | Total: 14h 28m | Avg:  9m 02s | Max: 31m 17s | Hits:  88%/7322  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/12  | Total:  2h 31m | Avg: 12m 35s | Max: 22m 01s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  2h 17m | Avg:  9m 11s | Max: 21m 35s | Hits:  98%/2182  
      🟩 nvcc11.8           Pass: 100%/3   | Total: 37m 05s | Avg: 12m 21s | Max: 20m 48s
      🟩 nvcc12.5           Pass: 100%/4   | Total:  1h 24m | Avg: 21m 08s | Max: 34m 13s
      🟩 nvcc12.6           Pass: 100%/84  | Total: 11h 57m | Avg:  8m 32s | Max: 31m 17s | Hits:  88%/7322  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/12  | Total:  2h 31m | Avg: 12m 35s | Max: 22m 01s
      🟩 nvcc               Pass: 100%/106 | Total: 16h 17m | Avg:  9m 13s | Max: 34m 13s | Hits:  91%/9504  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 39m 07s | Avg:  6m 31s | Max: 16m 53s
      🟩 Clang10            Pass: 100%/3   | Total: 25m 22s | Avg:  8m 27s | Max: 14m 14s
      🟩 Clang11            Pass: 100%/4   | Total: 33m 20s | Avg:  8m 20s | Max: 20m 08s
      🟩 Clang12            Pass: 100%/4   | Total: 41m 47s | Avg: 10m 26s | Max: 18m 31s
      🟩 Clang13            Pass: 100%/4   | Total: 19m 25s | Avg:  4m 51s | Max:  6m 02s
      🟩 Clang14            Pass: 100%/4   | Total: 31m 10s | Avg:  7m 47s | Max: 18m 20s
      🟩 Clang15            Pass: 100%/4   | Total: 28m 22s | Avg:  7m 05s | Max: 15m 06s
      🟩 Clang16            Pass: 100%/4   | Total: 33m 59s | Avg:  8m 29s | Max: 20m 40s
      🟩 Clang17            Pass: 100%/4   | Total: 19m 08s | Avg:  4m 47s | Max:  5m 42s
      🟩 Clang18            Pass: 100%/18  | Total:  3h 25m | Avg: 11m 23s | Max: 22m 01s
      🟩 GCC6               Pass: 100%/2   | Total: 21m 58s | Avg: 10m 59s | Max: 18m 48s
      🟩 GCC7               Pass: 100%/6   | Total: 49m 41s | Avg:  8m 16s | Max: 21m 35s
      🟩 GCC8               Pass: 100%/6   | Total: 48m 45s | Avg:  8m 07s | Max: 18m 25s
      🟩 GCC9               Pass: 100%/6   | Total:  1h 00m | Avg: 10m 06s | Max: 19m 58s
      🟩 GCC10              Pass: 100%/4   | Total: 27m 59s | Avg:  6m 59s | Max: 16m 16s
      🟩 GCC11              Pass: 100%/7   | Total: 54m 02s | Avg:  7m 43s | Max: 20m 48s
      🟩 GCC12              Pass: 100%/4   | Total: 25m 09s | Avg:  6m 17s | Max: 13m 42s
      🟩 GCC13              Pass: 100%/17  | Total:  3h 00m | Avg: 10m 35s | Max: 31m 17s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 26m 02s | Avg:  8m 40s | Max: 14m 48s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 18m 12s | Avg: 18m 12s | Max: 18m 12s | Hits:  98%/2182  
      🟩 MSVC14.29          Pass: 100%/2   | Total: 39m 15s | Avg: 19m 37s | Max: 23m 14s | Hits:  83%/4727  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 15m 07s | Avg: 15m 07s | Max: 15m 07s | Hits:  98%/2595  
      🟩 NVHPC24.7          Pass: 100%/4   | Total:  1h 24m | Avg: 21m 08s | Max: 34m 13s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/55  | Total:  7h 56m | Avg:  8m 40s | Max: 22m 01s
      🟩 GCC                Pass: 100%/52  | Total:  7h 48m | Avg:  9m 00s | Max: 31m 17s
      🟩 Intel              Pass: 100%/3   | Total: 26m 02s | Avg:  8m 40s | Max: 14m 48s
      🟩 MSVC               Pass: 100%/4   | Total:  1h 12m | Avg: 18m 08s | Max: 23m 14s | Hits:  91%/9504  
      🟩 NVHPC              Pass: 100%/4   | Total:  1h 24m | Avg: 21m 08s | Max: 34m 13s
    🟩 gpu
      🟩 v100               Pass: 100%/118 | Total: 18h 48m | Avg:  9m 33s | Max: 34m 13s | Hits:  91%/9504  
    🟩 jobs
      🟩 Build              Pass: 100%/110 | Total: 16h 09m | Avg:  8m 48s | Max: 34m 13s | Hits:  91%/9504  
      🟩 NVRTC              Pass: 100%/4   | Total:  1h 43m | Avg: 25m 49s | Max: 31m 17s
      🟩 Test               Pass: 100%/3   | Total: 53m 41s | Avg: 17m 53s | Max: 21m 55s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 02s | Avg:  2m 02s | Max:  2m 02s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 37m 05s | Avg: 12m 21s | Max: 20m 48s
      🟩 90                 Pass: 100%/4   | Total: 41m 36s | Avg: 10m 24s | Max: 11m 58s
      🟩 90a                Pass: 100%/8   | Total: 58m 04s | Avg:  7m 15s | Max: 12m 23s
    🟩 std
      🟩 11                 Pass: 100%/32  | Total:  3h 30m | Avg:  6m 34s | Max: 21m 35s
      🟩 14                 Pass: 100%/32  | Total:  4h 38m | Avg:  8m 41s | Max: 34m 13s | Hits:  83%/4467  
      🟩 17                 Pass: 100%/30  | Total:  5h 42m | Avg: 11m 24s | Max: 31m 17s | Hits:  96%/2442  
      🟩 20                 Pass: 100%/23  | Total:  4h 55m | Avg: 12m 49s | Max: 34m 05s | Hits:  98%/2595  
    
  • 🟩 thrust: Pass: 100%/111 | Total: 12h 30m | Avg: 6m 45s | Max: 26m 23s | Hits: 99%/9260

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 23m 13s | Avg: 11m 36s | Max: 16m 49s
    🟩 cpu
      🟩 amd64              Pass: 100%/103 | Total: 11h 52m | Avg:  6m 55s | Max: 26m 23s | Hits:  99%/9260  
      🟩 arm64              Pass: 100%/8   | Total: 37m 23s | Avg:  4m 40s | Max:  5m 42s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 19m | Avg:  5m 19s | Max: 19m 15s | Hits:  99%/1852  
      🟩 11.8               Pass: 100%/3   | Total: 16m 26s | Avg:  5m 28s | Max:  5m 49s
      🟩 12.5               Pass: 100%/4   | Total:  1h 00m | Avg: 15m 09s | Max: 15m 59s
      🟩 12.6               Pass: 100%/89  | Total:  9h 53m | Avg:  6m 39s | Max: 26m 23s | Hits:  99%/7408  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total: 20m 32s | Avg:  5m 08s | Max:  5m 23s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 19m | Avg:  5m 19s | Max: 19m 15s | Hits:  99%/1852  
      🟩 nvcc11.8           Pass: 100%/3   | Total: 16m 26s | Avg:  5m 28s | Max:  5m 49s
      🟩 nvcc12.5           Pass: 100%/4   | Total:  1h 00m | Avg: 15m 09s | Max: 15m 59s
      🟩 nvcc12.6           Pass: 100%/85  | Total:  9h 32m | Avg:  6m 44s | Max: 26m 23s | Hits:  99%/7408  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total: 20m 32s | Avg:  5m 08s | Max:  5m 23s
      🟩 nvcc               Pass: 100%/107 | Total: 12h 09m | Avg:  6m 49s | Max: 26m 23s | Hits:  99%/9260  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 32m 52s | Avg:  5m 28s | Max:  6m 34s
      🟩 Clang10            Pass: 100%/3   | Total: 19m 15s | Avg:  6m 25s | Max:  7m 05s
      🟩 Clang11            Pass: 100%/4   | Total: 20m 40s | Avg:  5m 10s | Max:  5m 30s
      🟩 Clang12            Pass: 100%/4   | Total: 20m 28s | Avg:  5m 07s | Max:  5m 22s
      🟩 Clang13            Pass: 100%/4   | Total: 20m 57s | Avg:  5m 14s | Max:  5m 39s
      🟩 Clang14            Pass: 100%/4   | Total: 20m 12s | Avg:  5m 03s | Max:  5m 20s
      🟩 Clang15            Pass: 100%/4   | Total: 21m 40s | Avg:  5m 25s | Max:  6m 11s
      🟩 Clang16            Pass: 100%/4   | Total: 21m 27s | Avg:  5m 21s | Max:  5m 39s
      🟩 Clang17            Pass: 100%/4   | Total: 21m 30s | Avg:  5m 22s | Max:  5m 48s
      🟩 Clang18            Pass: 100%/11  | Total:  1h 03m | Avg:  5m 48s | Max: 13m 19s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 06s | Avg:  4m 03s | Max:  4m 19s
      🟩 GCC7               Pass: 100%/6   | Total: 27m 02s | Avg:  4m 30s | Max:  5m 15s
      🟩 GCC8               Pass: 100%/6   | Total: 28m 40s | Avg:  4m 46s | Max:  5m 34s
      🟩 GCC9               Pass: 100%/6   | Total: 29m 11s | Avg:  4m 51s | Max:  5m 41s
      🟩 GCC10              Pass: 100%/4   | Total: 21m 00s | Avg:  5m 15s | Max:  5m 39s
      🟩 GCC11              Pass: 100%/7   | Total: 39m 35s | Avg:  5m 39s | Max:  6m 22s
      🟩 GCC12              Pass: 100%/4   | Total: 22m 33s | Avg:  5m 38s | Max:  5m 54s
      🟩 GCC13              Pass: 100%/16  | Total:  2h 16m | Avg:  8m 33s | Max: 26m 23s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 19m 51s | Avg:  6m 37s | Max:  6m 55s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 19m 15s | Avg: 19m 15s | Max: 19m 15s | Hits:  99%/1852  
      🟩 MSVC14.29          Pass: 100%/2   | Total: 32m 56s | Avg: 16m 28s | Max: 17m 23s | Hits:  99%/3704  
      🟩 MSVC14.39          Pass: 100%/2   | Total: 41m 30s | Avg: 20m 45s | Max: 23m 42s | Hits:  99%/3704  
      🟩 NVHPC24.7          Pass: 100%/4   | Total:  1h 00m | Avg: 15m 09s | Max: 15m 59s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/48  | Total:  4h 22m | Avg:  5m 28s | Max: 13m 19s
      🟩 GCC                Pass: 100%/51  | Total:  5h 13m | Avg:  6m 08s | Max: 26m 23s
      🟩 Intel              Pass: 100%/3   | Total: 19m 51s | Avg:  6m 37s | Max:  6m 55s
      🟩 MSVC               Pass: 100%/5   | Total:  1h 33m | Avg: 18m 44s | Max: 23m 42s | Hits:  99%/9260  
      🟩 NVHPC              Pass: 100%/4   | Total:  1h 00m | Avg: 15m 09s | Max: 15m 59s
    🟩 gpu
      🟩 v100               Pass: 100%/111 | Total: 12h 30m | Avg:  6m 45s | Max: 26m 23s | Hits:  99%/9260  
    🟩 jobs
      🟩 Build              Pass: 100%/103 | Total: 10h 26m | Avg:  6m 04s | Max: 19m 15s | Hits:  99%/7408  
      🟩 TestCPU            Pass: 100%/4   | Total: 46m 14s | Avg: 11m 33s | Max: 23m 42s | Hits:  99%/1852  
      🟩 TestGPU            Pass: 100%/4   | Total:  1h 17m | Avg: 19m 27s | Max: 26m 23s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 16m 26s | Avg:  5m 28s | Max:  5m 49s
      🟩 90a                Pass: 100%/4   | Total: 19m 10s | Avg:  4m 47s | Max:  5m 14s
    🟩 std
      🟩 11                 Pass: 100%/30  | Total:  2h 50m | Avg:  5m 41s | Max: 21m 17s
      🟩 14                 Pass: 100%/29  | Total:  3h 07m | Avg:  6m 27s | Max: 19m 15s | Hits:  99%/3704  
      🟩 17                 Pass: 100%/27  | Total:  2h 49m | Avg:  6m 17s | Max: 17m 23s | Hits:  99%/1852  
      🟩 20                 Pass: 100%/23  | Total:  3h 19m | Avg:  8m 39s | Max: 26m 23s | Hits:  99%/3704  
    
  • 🟩 cub: Pass: 100%/110 | Total: 12h 50m | Avg: 7m 00s | Max: 39m 52s | Hits: 99%/3028

    🟩 cpu
      🟩 amd64              Pass: 100%/102 | Total: 12h 12m | Avg:  7m 11s | Max: 39m 52s | Hits:  99%/3028  
      🟩 arm64              Pass: 100%/8   | Total: 37m 18s | Avg:  4m 39s | Max:  5m 27s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 16m | Avg:  5m 06s | Max: 15m 14s | Hits:  99%/757   
      🟩 11.8               Pass: 100%/3   | Total: 16m 16s | Avg:  5m 25s | Max:  5m 43s
      🟩 12.5               Pass: 100%/4   | Total: 35m 17s | Avg:  8m 49s | Max:  9m 09s
      🟩 12.6               Pass: 100%/88  | Total: 10h 41m | Avg:  7m 17s | Max: 39m 52s | Hits:  99%/2271  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total: 16m 57s | Avg:  4m 14s | Max:  4m 38s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 16m | Avg:  5m 06s | Max: 15m 14s | Hits:  99%/757   
      🟩 nvcc11.8           Pass: 100%/3   | Total: 16m 16s | Avg:  5m 25s | Max:  5m 43s
      🟩 nvcc12.5           Pass: 100%/4   | Total: 35m 17s | Avg:  8m 49s | Max:  9m 09s
      🟩 nvcc12.6           Pass: 100%/84  | Total: 10h 24m | Avg:  7m 26s | Max: 39m 52s | Hits:  99%/2271  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total: 16m 57s | Avg:  4m 14s | Max:  4m 38s
      🟩 nvcc               Pass: 100%/106 | Total: 12h 33m | Avg:  7m 06s | Max: 39m 52s | Hits:  99%/3028  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 31m 45s | Avg:  5m 17s | Max:  6m 05s
      🟩 Clang10            Pass: 100%/3   | Total: 18m 17s | Avg:  6m 05s | Max:  6m 32s
      🟩 Clang11            Pass: 100%/4   | Total: 22m 27s | Avg:  5m 36s | Max:  5m 50s
      🟩 Clang12            Pass: 100%/4   | Total: 21m 45s | Avg:  5m 26s | Max:  5m 39s
      🟩 Clang13            Pass: 100%/4   | Total: 20m 41s | Avg:  5m 10s | Max:  5m 28s
      🟩 Clang14            Pass: 100%/4   | Total: 20m 11s | Avg:  5m 02s | Max:  5m 07s
      🟩 Clang15            Pass: 100%/4   | Total: 21m 26s | Avg:  5m 21s | Max:  5m 43s
      🟩 Clang16            Pass: 100%/4   | Total: 22m 07s | Avg:  5m 31s | Max:  5m 46s
      🟩 Clang17            Pass: 100%/4   | Total: 22m 21s | Avg:  5m 35s | Max:  5m 50s
      🟩 Clang18            Pass: 100%/11  | Total:  1h 24m | Avg:  7m 42s | Max: 24m 37s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 32s | Avg:  4m 16s | Max:  4m 34s
      🟩 GCC7               Pass: 100%/6   | Total: 28m 16s | Avg:  4m 42s | Max:  5m 18s
      🟩 GCC8               Pass: 100%/6   | Total: 28m 42s | Avg:  4m 47s | Max:  5m 16s
      🟩 GCC9               Pass: 100%/6   | Total: 28m 41s | Avg:  4m 46s | Max:  5m 23s
      🟩 GCC10              Pass: 100%/4   | Total: 21m 35s | Avg:  5m 23s | Max:  5m 33s
      🟩 GCC11              Pass: 100%/7   | Total: 38m 05s | Avg:  5m 26s | Max:  6m 00s
      🟩 GCC12              Pass: 100%/4   | Total: 22m 21s | Avg:  5m 35s | Max:  5m 54s
      🟩 GCC13              Pass: 100%/16  | Total:  3h 21m | Avg: 12m 33s | Max: 39m 52s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 19m 19s | Avg:  6m 26s | Max:  6m 48s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 15m 14s | Avg: 15m 14s | Max: 15m 14s | Hits:  99%/757   
      🟩 MSVC14.29          Pass: 100%/2   | Total: 23m 55s | Avg: 11m 57s | Max: 12m 37s | Hits:  99%/1514  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 13m 20s | Avg: 13m 20s | Max: 13m 20s | Hits:  99%/757   
      🟩 NVHPC24.7          Pass: 100%/4   | Total: 35m 17s | Avg:  8m 49s | Max:  9m 09s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/48  | Total:  4h 45m | Avg:  5m 57s | Max: 24m 37s
      🟩 GCC                Pass: 100%/51  | Total:  6h 17m | Avg:  7m 23s | Max: 39m 52s
      🟩 Intel              Pass: 100%/3   | Total: 19m 19s | Avg:  6m 26s | Max:  6m 48s
      🟩 MSVC               Pass: 100%/4   | Total: 52m 29s | Avg: 13m 07s | Max: 15m 14s | Hits:  99%/3028  
      🟩 NVHPC              Pass: 100%/4   | Total: 35m 17s | Avg:  8m 49s | Max:  9m 09s
    🟩 gpu
      🟩 v100               Pass: 100%/110 | Total: 12h 50m | Avg:  7m 00s | Max: 39m 52s | Hits:  99%/3028  
    🟩 jobs
      🟩 Build              Pass: 100%/102 | Total:  9h 31m | Avg:  5m 36s | Max: 15m 14s | Hits:  99%/3028  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 34m 01s | Avg: 34m 01s | Max: 34m 01s
      🟩 GraphCapture       Pass: 100%/1   | Total: 21m 28s | Avg: 21m 28s | Max: 21m 28s
      🟩 HostLaunch         Pass: 100%/3   | Total: 55m 13s | Avg: 18m 24s | Max: 19m 41s
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 28m | Avg: 29m 21s | Max: 39m 52s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 16m 16s | Avg:  5m 25s | Max:  5m 43s
      🟩 90a                Pass: 100%/4   | Total: 16m 15s | Avg:  4m 03s | Max:  4m 18s
    🟩 std
      🟩 11                 Pass: 100%/30  | Total:  3h 20m | Avg:  6m 41s | Max: 39m 52s
      🟩 14                 Pass: 100%/29  | Total:  2h 48m | Avg:  5m 47s | Max: 15m 14s | Hits:  99%/1514  
      🟩 17                 Pass: 100%/27  | Total:  2h 34m | Avg:  5m 42s | Max: 12m 37s | Hits:  99%/757   
      🟩 20                 Pass: 100%/24  | Total:  4h 07m | Avg: 10m 17s | Max: 34m 01s | Hits:  99%/757   
    
  • 🟩 cudax: Pass: 100%/54 | Total: 4h 17m | Avg: 4m 46s | Max: 22m 30s | Hits: 90%/246

    🟩 cpu
      🟩 amd64              Pass: 100%/50  | Total:  4h 07m | Avg:  4m 57s | Max: 22m 30s | Hits:  90%/246   
      🟩 arm64              Pass: 100%/4   | Total: 10m 18s | Avg:  2m 34s | Max:  2m 37s
    🟩 ctk
      🟩 12.0               Pass: 100%/19  | Total:  1h 30m | Avg:  4m 46s | Max: 16m 53s | Hits:  90%/123   
      🟩 12.5               Pass: 100%/2   | Total:  9m 49s | Avg:  4m 54s | Max:  5m 12s
      🟩 12.6               Pass: 100%/33  | Total:  2h 37m | Avg:  4m 46s | Max: 22m 30s | Hits:  90%/123   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/19  | Total:  1h 30m | Avg:  4m 46s | Max: 16m 53s | Hits:  90%/123   
      🟩 nvcc12.5           Pass: 100%/2   | Total:  9m 49s | Avg:  4m 54s | Max:  5m 12s
      🟩 nvcc12.6           Pass: 100%/33  | Total:  2h 37m | Avg:  4m 46s | Max: 22m 30s | Hits:  90%/123   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/54  | Total:  4h 17m | Avg:  4m 46s | Max: 22m 30s | Hits:  90%/246   
    🟩 cxx
      🟩 Clang9             Pass: 100%/2   | Total:  6m 25s | Avg:  3m 12s | Max:  3m 25s
      🟩 Clang10            Pass: 100%/2   | Total:  6m 56s | Avg:  3m 28s | Max:  3m 56s
      🟩 Clang11            Pass: 100%/4   | Total: 11m 48s | Avg:  2m 57s | Max:  3m 14s
      🟩 Clang12            Pass: 100%/4   | Total: 11m 56s | Avg:  2m 59s | Max:  3m 18s
      🟩 Clang13            Pass: 100%/4   | Total: 11m 44s | Avg:  2m 56s | Max:  3m 14s
      🟩 Clang14            Pass: 100%/4   | Total: 25m 45s | Avg:  6m 26s | Max: 16m 53s
      🟩 Clang15            Pass: 100%/2   | Total:  6m 16s | Avg:  3m 08s | Max:  3m 15s
      🟩 Clang16            Pass: 100%/4   | Total: 11m 35s | Avg:  2m 53s | Max:  3m 16s
      🟩 Clang17            Pass: 100%/2   | Total:  6m 43s | Avg:  3m 21s | Max:  3m 22s
      🟩 Clang18            Pass: 100%/2   | Total: 18m 51s | Avg:  9m 25s | Max: 15m 37s
      🟩 GCC9               Pass: 100%/2   | Total:  6m 02s | Avg:  3m 01s | Max:  3m 07s
      🟩 GCC10              Pass: 100%/4   | Total: 11m 36s | Avg:  2m 54s | Max:  2m 59s
      🟩 GCC11              Pass: 100%/4   | Total: 11m 50s | Avg:  2m 57s | Max:  3m 10s
      🟩 GCC12              Pass: 100%/7   | Total:  1h 11m | Avg: 10m 13s | Max: 22m 30s
      🟩 GCC13              Pass: 100%/3   | Total:  8m 01s | Avg:  2m 40s | Max:  2m 52s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 11m 04s | Avg: 11m 04s | Max: 11m 04s | Hits:  90%/123   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 10m 03s | Avg: 10m 03s | Max: 10m 03s | Hits:  90%/123   
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  9m 49s | Avg:  4m 54s | Max:  5m 12s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/30  | Total:  1h 57m | Avg:  3m 55s | Max: 16m 53s
      🟩 GCC                Pass: 100%/20  | Total:  1h 49m | Avg:  5m 27s | Max: 22m 30s
      🟩 MSVC               Pass: 100%/2   | Total: 21m 07s | Avg: 10m 33s | Max: 11m 04s | Hits:  90%/246   
      🟩 NVHPC              Pass: 100%/2   | Total:  9m 49s | Avg:  4m 54s | Max:  5m 12s
    🟩 gpu
      🟩 v100               Pass: 100%/54  | Total:  4h 17m | Avg:  4m 46s | Max: 22m 30s | Hits:  90%/246   
    🟩 jobs
      🟩 Build              Pass: 100%/49  | Total:  2h 46m | Avg:  3m 23s | Max: 11m 04s | Hits:  90%/246   
      🟩 Test               Pass: 100%/5   | Total:  1h 31m | Avg: 18m 20s | Max: 22m 30s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 48s | Avg:  2m 48s | Max:  2m 48s
      🟩 90a                Pass: 100%/1   | Total:  2m 52s | Avg:  2m 52s | Max:  2m 52s
    🟩 std
      🟩 17                 Pass: 100%/29  | Total:  2h 00m | Avg:  4m 09s | Max: 19m 51s
      🟩 20                 Pass: 100%/25  | Total:  2h 17m | Avg:  5m 30s | Max: 22m 30s | Hits:  90%/246   
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 10m 42s | Avg: 5m 21s | Max: 8m 35s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 10m 42s | Avg:  5m 21s | Max:  8m 35s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total: 10m 42s | Avg:  5m 21s | Max:  8m 35s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total: 10m 42s | Avg:  5m 21s | Max:  8m 35s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 10m 42s | Avg:  5m 21s | Max:  8m 35s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 10m 42s | Avg:  5m 21s | Max:  8m 35s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 10m 42s | Avg:  5m 21s | Max:  8m 35s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total: 10m 42s | Avg:  5m 21s | Max:  8m 35s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 07s | Avg:  2m 07s | Max:  2m 07s
      🟩 Test               Pass: 100%/1   | Total:  8m 35s | Avg:  8m 35s | Max:  8m 35s
    
  • 🟩 python: Pass: 100%/1 | Total: 14m 54s | Avg: 14m 54s | Max: 14m 54s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 14m 54s | Avg: 14m 54s | Max: 14m 54s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 14m 54s | Avg: 14m 54s | Max: 14m 54s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 14m 54s | Avg: 14m 54s | Max: 14m 54s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 14m 54s | Avg: 14m 54s | Max: 14m 54s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 14m 54s | Avg: 14m 54s | Max: 14m 54s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 14m 54s | Avg: 14m 54s | Max: 14m 54s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 14m 54s | Avg: 14m 54s | Max: 14m 54s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 14m 54s | Avg: 14m 54s | Max: 14m 54s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 396)

# Runner
327 linux-amd64-cpu16
28 linux-arm64-cpu16
26 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16

@bernhardmgruber bernhardmgruber merged commit cee542b into NVIDIA:main Nov 22, 2024
414 checks passed
@bernhardmgruber bernhardmgruber deleted the ptx_tests branch November 22, 2024 10:28
davebayer pushed a commit to davebayer/cccl that referenced this pull request Nov 22, 2024
trxcllnt pushed a commit to trxcllnt/cccl that referenced this pull request Nov 23, 2024
davebayer pushed a commit to davebayer/cccl that referenced this pull request Dec 2, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
libcu++ For all items related to libcu++
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants