Skip to content

Conversation

@krishnaraj36
Copy link
Contributor

Fixed the opencl codegen for few operators -
1. Atomic add for float - opencl doesn't have support float atomic add,
Enabled work-around for this operation with atomic_cmpexch()
2. fmodf - Opencl only support fmod for all floating point
3. nearbyint - Opencl doesn't have this function and henced replaced
with roud function.

srkreddy1238 and others added 8 commits August 14, 2024 11:06
Partition pass should shoose off loading ops based on target support
this config enables choosing target version on python api aswell as
tvmc.
Fixed the opencl codegen for few operators -
1. Atomic add for float - opencl doesn't have support float atomic add,
Enabled work-around for this operation with atomic_cmpexch()
2. fmodf - Opencl only support fmod for all floating point
3. nearbyint - Opencl doesn't have this function and henced replaced
with roud function.
This reverts commit 4ff98a8.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants