Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
compiler: Augment code generation capabilities for CUDA/HIP/SYCL support #1828
compiler: Augment code generation capabilities for CUDA/HIP/SYCL support #1828
Changes from 57 commits
88ba836
12f1b97
b11e06d
72c2977
f558777
7480305
1310826
ca7207c
dcfb96a
4cfc9c2
9ba7c08
0a599e5
860d562
8656da0
66f34b8
b821ac8
f3844f1
df06e7b
cee2c06
0b71b4f
2329077
84b8f45
2ed2cb4
0bd2aeb
e032f48
289cc9c
d7ae26c
d4bddad
4bd954f
0fd651a
13d45a0
1e45ff3
e375ba9
a60dc75
48af866
8290ed8
fa94460
b5b60aa
c78f022
e238022
954d982
f7b1f0b
860f135
2e9d8f5
1c5cecd
248d1d8
a584ebd
ad8aabd
308eb3b
c6578e5
69b146d
1f28853
853350b
108814f
979202e
d2ece83
633c4a5
846bff0
55c6352
220fe2a
File filter
Filter by extension
Conversations
Jump to
There are no files selected for viewing
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
IS this really needed to have an option and its negated?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Probably not. So, I made it this way because then in theory you could have both modes... we would need some extra machinery, but one could have eager blocking for some loops and lazy blocking for other, but yeah, I admit we don't really have use cases ATM, so if you prefer I can drop one
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nitpicking: In the docstring, I would rename blocking as loop blocking
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
that's a lot of nested try/catch isn't there an simpler way? WIth like a recursion or something like
make_tile(I) for I in x
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm sure it's possible, but here the MAX depth is fixed (3), so I think explicit is OK