Skip to content

Conversation

@nyalloc
Copy link
Contributor

@nyalloc nyalloc commented Jun 23, 2020

This PR introduces an implementation of the pi_sampler API for the PI CUDA backend.

@nyalloc nyalloc requested a review from a team as a code owner June 23, 2020 12:52
@nyalloc nyalloc requested a review from smaslov-intel June 23, 2020 12:52
@bader bader added the cuda CUDA back-end label Jun 23, 2020
Bjoern Knafla and others added 3 commits June 23, 2020 16:06
Remove unnecessary memory fence after a CUDA memory barrier
(__syncthreads).

The emitted `bar.sync 0` PTX instruction ensures that all memory
accesses of threads involved in the barrier `0` have been performed and
that no new memory accesses happen before the barrier completes.

The removed memory fence reduced performance without adding any
functionality to the barrier memory behavior.

Signed-off-by: Bjoern Knafla <[email protected]>
Co-authored-be: Victor Lomuller <[email protected]>
Remove old forward declared SPIR-V builtin from the relational category.
Update the binding file using the header generator.
Add relevant binding test.
Remove XFAIL from related SYCL tests.

Signed-off-by: Victor Lomuller <[email protected]>
@nyalloc nyalloc changed the title [SYCL] Implemented CUDA pi_sampler [WIP] [SYCL] Implemented CUDA pi_sampler Jun 23, 2020
@nyalloc nyalloc changed the title [WIP] [SYCL] Implemented CUDA pi_sampler [SYCL][WIP] Implemented CUDA pi_sampler Jun 23, 2020
@nyalloc nyalloc changed the title [SYCL][WIP] Implemented CUDA pi_sampler [SYCL][CUDA][WIP] Implemented CUDA pi_sampler Jun 23, 2020
@nyalloc nyalloc changed the title [SYCL][CUDA][WIP] Implemented CUDA pi_sampler [SYCL][CUDA] Implemented CUDA pi_sampler Jun 24, 2020
@bader
Copy link
Contributor

bader commented Jun 25, 2020

@smaslov-intel, @intel/llvm-reviewers-cuda, ping.

@nyalloc nyalloc requested review from a team and bader as code owners June 26, 2020 10:42
@nyalloc nyalloc changed the title [SYCL][CUDA] Implemented CUDA pi_sampler [SYCL][CUDA][WIP] Implemented CUDA pi_sampler Jun 26, 2020
@nyalloc nyalloc closed this Jun 26, 2020
@nyalloc nyalloc deleted the stuart/pi_sampler branch June 26, 2020 11:07
bb-sycl pushed a commit that referenced this pull request Apr 18, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

cuda CUDA back-end

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants