Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SYCL][CUDA] Event synchronization only done for latest events #1995

Merged
merged 2 commits into from
Jun 30, 2020

Conversation

steffenlarsen
Copy link
Contributor

CUDA streams operate in-order, so when waiting for a list of events from the same stream we introduce unnecessary overhead. These changes makes the PI CUDA backend only wait for the latest event for each stream in a given list of events.

@steffenlarsen steffenlarsen requested a review from a team as a code owner June 26, 2020 19:20
CUDA streams operate in-order, so when waiting for a list of events we
introduce unnecessary overhead. These changes makes the PI CUDA backend
only wait for the latest event for each stream in a given list of
events.

Signed-off-by: Steffen Larsen <[email protected]>
@steffenlarsen steffenlarsen force-pushed the steffen/wait-for-latest-event branch from 9a41bb7 to f0f916c Compare June 26, 2020 19:23
@Ruyk
Copy link
Contributor

Ruyk commented Jun 26, 2020

@bader

@Ruyk Ruyk added the cuda CUDA back-end label Jun 26, 2020
Signed-off-by: Steffen Larsen <[email protected]>
@bader bader requested a review from smaslov-intel June 29, 2020 20:03
@bader
Copy link
Contributor

bader commented Jun 30, 2020

@steffenlarsen, could you take a look at failing test, please?

@steffenlarsen
Copy link
Contributor Author

@steffenlarsen, could you take a look at failing test, please?

I don't think the failure is related to this PR. The changes introduced here are exclusively for PI CUDA, but the failure is using PI OpenCL (on Windows).

@bader
Copy link
Contributor

bader commented Jun 30, 2020

@steffenlarsen, could you take a look at failing test, please?

I don't think the failure is related to this PR. The changes introduced here are exclusively for PI CUDA, but the failure is using PI OpenCL (on Windows).

Sorry for the noise, my bad.
For some reason I missed that they are unrelated.

@bader bader merged commit d7ee359 into intel:sycl Jun 30, 2020
jsji pushed a commit that referenced this pull request May 15, 2023
This target extension type is created here: https://github.com/intel/vc-intrinsics/blob/master/GenXIntrinsics/lib/GenXIntrinsics/GenXSPIRVWriterAdaptor.cpp#L245

As with other target extension types, reverse translation is not yet supported.

Signed-off-by: Sarnie, Nick <[email protected]>
Co-authored-by: Victor Mustya <[email protected]>

Original commit:
KhronosGroup/SPIRV-LLVM-Translator@60746d5
jsji pushed a commit that referenced this pull request May 15, 2023
This target extension type is created here: https://github.com/intel/vc-intrinsics/blob/master/GenXIntrinsics/lib/GenXIntrinsics/GenXSPIRVWriterAdaptor.cpp#L245

As with other target extension types, reverse translation is not yet supported.

Signed-off-by: Sarnie, Nick <[email protected]>
Co-authored-by: Victor Mustya <[email protected]>

Original commit:
KhronosGroup/SPIRV-LLVM-Translator@60746d5
jsji pushed a commit that referenced this pull request May 16, 2023
This target extension type is created here: https://github.com/intel/vc-intrinsics/blob/master/GenXIntrinsics/lib/GenXIntrinsics/GenXSPIRVWriterAdaptor.cpp#L245

As with other target extension types, reverse translation is not yet supported.

Signed-off-by: Sarnie, Nick <[email protected]>
Co-authored-by: Victor Mustya <[email protected]>

Original commit:
KhronosGroup/SPIRV-LLVM-Translator@60746d5
jsji pushed a commit that referenced this pull request May 16, 2023
This target extension type is created here: https://github.com/intel/vc-intrinsics/blob/master/GenXIntrinsics/lib/GenXIntrinsics/GenXSPIRVWriterAdaptor.cpp#L245

As with other target extension types, reverse translation is not yet supported.

Signed-off-by: Sarnie, Nick <[email protected]>
Co-authored-by: Victor Mustya <[email protected]>

Original commit:
KhronosGroup/SPIRV-LLVM-Translator@60746d5
jsji pushed a commit to sys-ce-bb/llvm that referenced this pull request May 16, 2023
This target extension type is created here: https://github.com/intel/vc-intrinsics/blob/master/GenXIntrinsics/lib/GenXIntrinsics/GenXSPIRVWriterAdaptor.cpp#L245

As with other target extension types, reverse translation is not yet supported.

Signed-off-by: Sarnie, Nick <[email protected]>
Co-authored-by: Victor Mustya <[email protected]>

Original commit:
KhronosGroup/SPIRV-LLVM-Translator@60746d5
@jsji jsji mentioned this pull request May 16, 2023
jsji pushed a commit that referenced this pull request May 16, 2023
This target extension type is created here: https://github.com/intel/vc-intrinsics/blob/master/GenXIntrinsics/lib/GenXIntrinsics/GenXSPIRVWriterAdaptor.cpp#L245

As with other target extension types, reverse translation is not yet supported.

Signed-off-by: Sarnie, Nick <[email protected]>
Co-authored-by: Victor Mustya <[email protected]>

Original commit:
KhronosGroup/SPIRV-LLVM-Translator@60746d5
This was referenced May 16, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
cuda CUDA back-end
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants