[SYCL] Add support for SYCL Nvidia target #5738

AidanBeltonS · 2024-02-26T16:49:54Z

This PR adds CMake and instructions for a SYCL Nvidia target. This is done with the LLAMA_SYCL_BACKEND CMake option which defaults to INTEL but can be set to NVIDIA. This approach allows us in the future to expand this for AMD targets.

This PR is dependent on #5591 as that resolves failing Nvidia test-backend-ops tests.

AidanBeltonS · 2024-02-26T16:50:12Z

@NeoZhangJianyu, @abhilash1910, @Alcpz, feel free to review

0cc4m · 2024-02-26T18:36:57Z

Just curious, does this kind of build time selection mean that it's not possible to run an Nvidia and an Intel GPU at the same time using SYCL?

AidanBeltonS · 2024-02-27T09:56:00Z

Just curious, does this kind of build time selection mean that it's not possible to run an Nvidia and an Intel GPU at the same time using SYCL?

It is possible to target both Nvidia and Intel GPUs with the same SYCL code, if you pass -fsycl-targets=nvptx64-nvidia-cuda,spir64 you will generate both pieces of device code and then can execute on either GPU at runtime. The issue in the specific case for llama is that oneMKL must be built for the specific backend cuBlas backend, so in this case it makes sense to have it build for either device but both at the same time.

NeoZhangJianyu · 2024-02-29T01:23:30Z

@AidanBeltonS
What's the key benefit of this PR to user? Performance, easy usage?
I only see the benefit as you said is "one binary file support multiple GPUs (Intel, NV, ...)".

But this PR don't provide the key feature to user.

Another word, how to persuade user to use SYCL backend for NV GPU, instead of cuBlas backend?

AidanBeltonS · 2024-02-29T14:37:44Z

@AidanBeltonS What's the key benefit of this PR to user? Performance, easy usage? I only see the benefit as you said is "one binary file support multiple GPUs (Intel, NV, ...)".

But this PR don't provide the key feature to user.

Another word, how to persuade user to use SYCL backend for NV GPU, instead of cuBlas backend?

oneAPI by default does support a single binary file for multiple GPUs. oneMKL is the reason we cannot do it in this case. In the long term I think we should raise this issue and fix that rather than not support NVidia GPUs.
There are a few other reasons that we should want to support NVidia GPUs.

SYCL is a portable-performant standard and we should use it as such.
Supporting different vendors also makes the implementation more generic, increasing the likelihood that other SYCL implementations such as AdaptiveCpp will easily work with the code. Aiding in the development of an ecosystem rather than an exclusive backend.
Testing and accessibility, the more devices we support easier it is to test, implement CI, and more people can use it. I.e. gain adoption
Debugging, by using the same hardware between the backends we can isolate differences to the implementation removing the hardware

abhilash1910

LGTM ! Lets wait for CI.
cc @ggerganov

* Add support for nvidia target in CMake * Update sycl read-me for Nvidia target * Fix errors

AidanBeltonS force-pushed the support_sycl_nvidia branch from 26be9e6 to 8e67d2d Compare February 29, 2024 15:03

Aidan added 3 commits March 8, 2024 10:21

Add support for nvidia target in CMake

acaf1ac

Update sycl read-me for Nvidia target

9745ac3

Fix errors

8f7c98b

AidanBeltonS force-pushed the support_sycl_nvidia branch from adb139b to 8f7c98b Compare March 8, 2024 10:23

abhilash1910 approved these changes Mar 8, 2024

View reviewed changes

ggerganov approved these changes Mar 8, 2024

View reviewed changes

NeoZhangJianyu merged commit 3814a07 into ggerganov:master Mar 11, 2024
60 checks passed

NeoZhangJianyu pushed a commit to NeoZhangJianyu/llama.cpp that referenced this pull request Mar 12, 2024

[SYCL] Add support for SYCL Nvidia target (ggerganov#5738)

1c4007c

* Add support for nvidia target in CMake * Update sycl read-me for Nvidia target * Fix errors

jordankanter pushed a commit to jordankanter/llama.cpp that referenced this pull request Mar 13, 2024

[SYCL] Add support for SYCL Nvidia target (ggerganov#5738)

fea8722

* Add support for nvidia target in CMake * Update sycl read-me for Nvidia target * Fix errors

abhilash1910 mentioned this pull request Mar 26, 2024

Add Nv/AMD sycl target build cmd #5357

Closed

hodlen pushed a commit to hodlen/llama.cpp that referenced this pull request Apr 1, 2024

[SYCL] Add support for SYCL Nvidia target (ggerganov#5738)

d3a3600

* Add support for nvidia target in CMake * Update sycl read-me for Nvidia target * Fix errors

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SYCL] Add support for SYCL Nvidia target #5738

[SYCL] Add support for SYCL Nvidia target #5738

AidanBeltonS commented Feb 26, 2024

AidanBeltonS commented Feb 26, 2024

0cc4m commented Feb 26, 2024

AidanBeltonS commented Feb 27, 2024 •

edited

Loading

NeoZhangJianyu commented Feb 29, 2024

AidanBeltonS commented Feb 29, 2024

abhilash1910 left a comment

[SYCL] Add support for SYCL Nvidia target #5738

[SYCL] Add support for SYCL Nvidia target #5738

Conversation

AidanBeltonS commented Feb 26, 2024

AidanBeltonS commented Feb 26, 2024

0cc4m commented Feb 26, 2024

AidanBeltonS commented Feb 27, 2024 • edited Loading

NeoZhangJianyu commented Feb 29, 2024

AidanBeltonS commented Feb 29, 2024

abhilash1910 left a comment

Choose a reason for hiding this comment

AidanBeltonS commented Feb 27, 2024 •

edited

Loading