-
Notifications
You must be signed in to change notification settings - Fork 209
Reorganize PTX tests to match generator #2930
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
||
int main(int, char**) | ||
{ | ||
// FIXME(bgruber): why no call to test_barrier_cluster? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I noticed we don't call any of test functions. Is this intended?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yeah those are all compile time tests because for many instructions its a ton of work to create a proper test
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, this is intended.
What they test is: (1) does the C++ code generate valid PTX, (2) does it properly guard against SM version, and (3) is the PTX ISA check restrictive enough.
67b4031
to
6c6844c
Compare
🟩 CI finished in 3h 32m: Pass: 100%/396 | Total: 2d 00h | Avg: 7m 24s | Max: 39m 52s | Hits: 95%/22038
|
Project | |
---|---|
CCCL Infrastructure | |
+/- | libcu++ |
CUB | |
Thrust | |
CUDA Experimental | |
python | |
CCCL C Parallel Library | |
Catch2Helper |
Modifications in project or dependencies?
Project | |
---|---|
CCCL Infrastructure | |
+/- | libcu++ |
+/- | CUB |
+/- | Thrust |
+/- | CUDA Experimental |
+/- | python |
+/- | CCCL C Parallel Library |
+/- | Catch2Helper |
🏃 Runner counts (total jobs: 396)
# | Runner |
---|---|
327 | linux-amd64-cpu16 |
28 | linux-arm64-cpu16 |
26 | linux-amd64-gpu-v100-latest-1 |
15 | windows-amd64-cpu16 |
This PR splits the
cuda::ptx
tests into the generated and manually written part. It's just moving around content and renaming files. No other changes.