[SYCL][CUDA][HIP] Deprecate context interop for CUDA and HIP #10975

hdelan · 2023-08-25T16:43:55Z

The sycl::context does not map clearly to a native context for CUDA and HIP backends. This is especially true now that we are adding support for multi device context #10737 . It would be good to start this deprecation process. PRs to oneMKL and oneDNN to follow

hdelan · 2023-09-04T09:48:23Z

Some further description:

The get_native method for a sycl::context currently maps to a single native CUcontext. The CUDA backend spec KhronosGroup/SYCL-Docs#420 for SYCL specifies that a sycl::context should map to a vector of native contexts. However this PR is still unmerged meaning this mapping cannot be integrated into the core API. For this reason we have two separate versions of get_native<backend::ext_oneapi_cuda, sycl::context>. One is the standard core SYCL spec version and the other can be found here https://github.com/intel/llvm/blob/sycl/sycl/include/sycl/ext/oneapi/experimental/backend/cuda.hpp#L38 which can be used if the macro SYCL_EXT_ONEAPI_BACKEND_CUDA_EXPERIMENTAL is defined.

This dual impl has been harmless up until now since the context has been constrained to a single device. However #10737 introduces behaviour that if we have two devices in a single platform (ie two CUDA devices) then the default context will contain them both. Therefore if users are using the experimental interface for get_native which returns a vec of CUcontexts, the correct behaviour will be guaranteed. Whereas if users are using the standard interface which returns a single CUcontext this get_native will return the native context of the first device in the sycl::context. This could lead to tricky bugs in user code as well as cases where only the first context in a sycl::context is usable, making the interop interface unintuitive and cumbersome.

For this reason I think it is better to deprecate the get_native interface for sycl::context for CUDA and HIP backends, and instead encourage users to get the native context by calling [cu|hip]DevicePrimaryCtxRetain with a native device.

sycl/include/sycl/backend.hpp

sycl/test/basic_tests/interop-hip.cpp

sycl/include/sycl/ext/oneapi/experimental/backend/cuda.hpp

npmiller

LGTM

hdelan · 2023-10-02T16:08:58Z

Ping @intel/llvm-gatekeepers can we merge this please?

aelovikov-intel · 2023-10-02T16:16:59Z

Ping @intel/llvm-gatekeepers can we merge this please?

check-sycl has failed.

hdelan · 2023-10-02T20:09:00Z

Ping @intel/llvm-gatekeepers can we merge this please?

check-sycl has failed.

It seems that interop-cuda.cpp is being run for the AMD triple, even though REQUIRES: cuda is being used. I am not sure what's happening here but will restart the CI.

hdelan · 2023-10-02T21:52:48Z

@aelovikov-intel test should be fixed now, but still surprising that this is running for the AMD triple

aelovikov-intel · 2023-10-02T22:00:01Z

@aelovikov-intel test should be fixed now, but still surprising that this is running for the AMD triple

I think the semantics of LIT features might be different between sycl/test and sycl/test-e2e. #10635 could also be related somehow.

I'm not driver expert, but might it be that we specify some options to be used when targeting AMD, but then don't actually target it? IMO, that would be consistent with the "argument unused during compilation" warning we saw in that test run.

hdelan requested a review from a team as a code owner August 25, 2023 16:43

hdelan requested a review from cperkinsintel August 25, 2023 16:43

hdelan added 3 commits August 30, 2023 14:58

Deprecate context interop for CUDA and HIP

7625b60

Add some comments

115ed23

Fix test

36fbb6e

hdelan force-pushed the deprecate-cuda-hip-context-interop branch from 2dde128 to 36fbb6e Compare August 30, 2023 15:23

Fix test for HIP

80b84d2

hdelan mentioned this pull request Sep 7, 2023

[SYCL][UR][HIP] Hip adapter multi device ctx #11105

Closed

cperkinsintel approved these changes Sep 14, 2023

View reviewed changes

npmiller reviewed Sep 15, 2023

View reviewed changes

sycl/include/sycl/backend.hpp Outdated Show resolved Hide resolved

Only deprecate free funcs

8eec162

hdelan temporarily deployed to WindowsCILock September 28, 2023 09:46 — with GitHub Actions Inactive

npmiller reviewed Sep 28, 2023

View reviewed changes

sycl/test/basic_tests/interop-hip.cpp Show resolved Hide resolved

hdelan temporarily deployed to WindowsCILock September 28, 2023 10:12 — with GitHub Actions Inactive

npmiller reviewed Sep 28, 2023

View reviewed changes

sycl/include/sycl/ext/oneapi/experimental/backend/cuda.hpp Outdated Show resolved Hide resolved

Don't deprecate experimental interop

7587e04

hdelan temporarily deployed to WindowsCILock September 28, 2023 13:19 — with GitHub Actions Inactive

npmiller approved these changes Sep 28, 2023

View reviewed changes

hdelan had a problem deploying to WindowsCILock September 28, 2023 13:45 — with GitHub Actions Failure

hdelan temporarily deployed to WindowsCILock October 2, 2023 16:16 — with GitHub Actions Inactive

hdelan had a problem deploying to WindowsCILock October 2, 2023 16:17 — with GitHub Actions Failure

hdelan closed this Oct 2, 2023

hdelan reopened this Oct 2, 2023

hdelan temporarily deployed to WindowsCILock October 2, 2023 20:10 — with GitHub Actions Inactive

hdelan temporarily deployed to WindowsCILock October 2, 2023 20:35 — with GitHub Actions Inactive

Fix test

d7c7191

hdelan temporarily deployed to WindowsCILock October 2, 2023 21:53 — with GitHub Actions Inactive

hdelan temporarily deployed to WindowsCILock October 2, 2023 22:19 — with GitHub Actions Inactive

hdelan closed this Oct 16, 2023

hdelan reopened this Oct 16, 2023

hdelan temporarily deployed to WindowsCILock October 16, 2023 16:56 — with GitHub Actions Inactive

hdelan temporarily deployed to WindowsCILock October 16, 2023 18:03 — with GitHub Actions Inactive

hdelan closed this Oct 23, 2023

hdelan reopened this Oct 23, 2023

hdelan temporarily deployed to WindowsCILock October 23, 2023 13:27 — with GitHub Actions Inactive

Merge branch 'sycl' into deprecate-cuda-hip-context-interop

8ef4520

hdelan temporarily deployed to WindowsCILock October 23, 2023 13:40 — with GitHub Actions Inactive

hdelan temporarily deployed to WindowsCILock October 23, 2023 14:05 — with GitHub Actions Inactive

againull merged commit e213fe2 into intel:sycl Oct 23, 2023

hdelan mentioned this pull request Dec 7, 2023

[CUDA][HIP] Use device to get native context uxlfoundation/oneMath#425

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SYCL][CUDA][HIP] Deprecate context interop for CUDA and HIP #10975

[SYCL][CUDA][HIP] Deprecate context interop for CUDA and HIP #10975

hdelan commented Aug 25, 2023

hdelan commented Sep 4, 2023

npmiller left a comment

hdelan commented Oct 2, 2023

aelovikov-intel commented Oct 2, 2023

hdelan commented Oct 2, 2023

hdelan commented Oct 2, 2023

aelovikov-intel commented Oct 2, 2023 •

edited

Loading

[SYCL][CUDA][HIP] Deprecate context interop for CUDA and HIP #10975

[SYCL][CUDA][HIP] Deprecate context interop for CUDA and HIP #10975

Conversation

hdelan commented Aug 25, 2023

hdelan commented Sep 4, 2023

npmiller left a comment

Choose a reason for hiding this comment

hdelan commented Oct 2, 2023

aelovikov-intel commented Oct 2, 2023

hdelan commented Oct 2, 2023

hdelan commented Oct 2, 2023

aelovikov-intel commented Oct 2, 2023 • edited Loading

aelovikov-intel commented Oct 2, 2023 •

edited

Loading