Enable CUDA support. #123

pearu · 2020-03-15T18:51:30Z

Checklist

Used a fork of the feedstock to propose changes
Bumped the build number (if the version is unchanged)
Re-rendered with the latest conda-smithy (Use the phrase @conda-forge-admin, please rerender in a comment in this PR for automated rerendering)
Ensured the license file is being packaged.

conda-forge-linter · 2020-03-15T18:51:38Z

Hi! This is the friendly automated conda-forge-linting service.

I wanted to let you know that I linted all conda-recipes in your PR (recipe) and found some lint.

Here's what I've got...

For recipe:

Failed to even lint the recipe, probably because of a conda-smithy bug 😢. This likely indicates a problem in your meta.yaml, though. To get a traceback to help figure out what's going on, install conda-smithy and run conda smithy recipe-lint . from the recipe directory.

conda-forge-linter · 2020-03-15T18:51:56Z

Hi! This is the friendly automated conda-forge-linting service.

I just wanted to let you know that I linted all conda-recipes in your PR (recipe) and found it was in an excellent condition.

pearu · 2020-03-15T18:52:22Z

@conda-forge-admin, please rerender

github-actions · 2020-03-15T18:53:47Z

Hi! This is the friendly automated conda-forge-webservice.
I tried to rerender for you, but it looks like I wasn't able to push to the pearu/enable-cuda
branch of Quansight/arrow-cpp-feedstock. Did you check the "Allow edits from maintainers" box?

…onda-forge-pinning 2020.03.14

pearu · 2020-03-15T20:51:00Z

There are CI failures:

2020-03-15T20:31:40.8273787Z Traceback (most recent call last):
2020-03-15T20:31:40.8277197Z   File "/opt/conda/lib/python3.7/site-packages/conda_build/environ.py", line 764, in get_install_actions
<snip>
2020-03-15T20:31:40.8292864Z     raise ResolvePackageNotFound(bad_deps)
2020-03-15T20:31:40.8293255Z conda.exceptions.ResolvePackageNotFound: 
2020-03-15T20:31:40.8293952Z   - nvcc_linux-64=None

What is the proper way to fix it?

pearu · 2020-03-17T12:54:11Z

@isuruf do you have ideas how to fix the nvcc_linux-64=None issue reported above?

isuruf · 2020-03-17T15:25:04Z

That's there to enable a build without cuda. Do you want a build without cuda? Or do you want only builds with cuda? If the former, use - {{ compiler('cuda') }} # [linux64 and cuda_compiler_version != 'None']. If the latter, use skip: True # [linux64 and cuda_compiler_version=='None']

pearu · 2020-03-17T16:13:32Z

Thanks @isuruf for the hints! The answer to your question actually depends on whether a single feedstock can support both cuda and non-cuda versions of the packages for the same platform (linux64 here) and how. The aim is that arrow-cpp would be installable to environments without cuda (the current master behavior) and to environments with cuda (the extension from this PR, the package needs to include libarrow_cuda library).

Btw, in this particular case of arrow-cpp package, any cuda version of the package would likely work in environments with no cuda installed, libarrow_cuda library would just not be unused.

jakirkham · 2020-03-17T18:49:26Z

Would take a look at ucx-split as a simple example of supporting CUDA and non-CUDA builds in the same feedstock.

pearu · 2020-03-17T18:59:32Z

Thanks @jakirkham for the link! The current PR is using the same approach as ucx-split.

pearu · 2020-03-17T19:16:55Z

Please review.
@xhochy

kkraus14

Looks good to me, thanks @pearu!

jakirkham · 2020-03-17T22:24:24Z

Will we need similar changes for pyarrow?

isuruf · 2020-03-17T22:31:08Z

Looks like this only links to libcuda.so. Then you can use 9.2 to build against and have one variant right?

jakirkham · 2020-03-18T02:14:23Z

Yeah sorry Isuru. I may not be able to answer your question as I don't know enough about Arrow's code base. Hopefully someone more knowledgeable there can provide us some idea of what CUDA features they are using 🙂

xhochy · 2020-03-18T16:47:22Z

Will we need similar changes for pyarrow?

Yes, there is a pyarrow._cuda module which gets built only when CUDA is activated.

isuruf · 2020-03-18T16:54:30Z

@xhochy, this PR should not have been merged. As it is right now, some users who don't even have a GPU will get a proprietary binary blob (cuda-toolkit).

xhochy · 2020-03-18T17:03:50Z

@xhochy, this PR should not have been merged. As it is right now, some users who don't even have a GPU will get a proprietary binary blob (cuda-toolkit).

Oh, then I didn't understand the how selection mechanism for cuda vs non-cuda works.

xhochy · 2020-03-18T17:08:39Z

@isuruf Is there something easy to fix or should we revert and mark as broken?

isuruf · 2020-03-18T17:11:38Z

It's better to mark as broken for now. There are some issues that needs to be discussed before fixing.

This reverts commit 27d886a.

xhochy · 2020-03-18T17:18:42Z

@pearu I revert the PR on master, please reopen and then we can discuss the issues @isuruf is talking about.

@isuruf Please request changes in future, so I don't merge this accidentially again ;). From the discussion I thought everything was resolved.

jakirkham · 2020-03-18T17:43:38Z

Sorry I'm confused @isuruf, what is the issue here?

isuruf · 2020-03-18T17:55:42Z

Question to @xhochy. If arrow is built with CUDA enabled, do downstream packages need to enable CUDA too? (I know that downstream package can enable CUDA, but do they need to?)
If not, then we can have one single package with CUDA enabled and people who don't want CUDA will only have a small amount of code added.

@jakirkham, there's no features added here, so the conda solver is free to choose a CUDA build which brings in cudatoolkit (unless ignore_run_exports is used) which is a huge proprietary binary blob.

xhochy · 2020-03-18T17:57:33Z

Question to @xhochy. If arrow is built with CUDA enabled, do downstream packages need to enable CUDA too? (I know that downstream package can enable CUDA, but do they need to?)
If not, then we can have one single package with CUDA enabled and people who don't want CUDA will only have a small amount of code added.

No, downstream packages without cuda will just work fine. The CUDA support/usage is contained in libarrow_cuda${SHLIB_EXT}. All other shared objects don't deal link/operate with CUDA.

jakirkham · 2020-03-18T18:17:16Z

@jakirkham, there's no features added here, so the conda solver is free to choose a CUDA build which brings in cudatoolkit (unless ignore_run_exports is used) which is a huge proprietary binary blob.

Got it. Yeah this seems like a feature we would want for other reasons. Missed that wasn't included here.

For context @pearu @xhochy, we handle this in the ucx case by adding this ucx-proc output. This allows user selection of CUDA support or not.

Another way to solve this is to add some dlopen logic around loading the CUDA libraries you have built here with fallback handling if libcuda is not available. This is how openmpi solves this problem (as Isuru alluded to earlier).

jakirkham · 2020-03-18T18:34:01Z

Sorry I may have misunderstood your comment before, @xhochy. Are you already using dlopen? If so, maybe just adding cudatoolkit to ignore_run_exports and run_constrained would solve this issue.

xhochy · 2020-03-19T08:23:25Z

No, we're not using dlopen. I looked again a bit through the Arrow source code and the following happens:

We build libarrow_cuda which links against "cuda", all other libarrow_* libraries are exactly the same, no changed functionality, no link to "cuda"
We build the plasma store and libplasma differently based on whether we have CUDA support. Thus we need to have different Arrow builds for with-cuda and without-cuda.

xhochy · 2020-03-19T08:27:28Z

Marking the builds as broken: conda-forge/admin-requests#22

jakirkham · 2020-03-19T08:56:53Z

Ok then borrowing ucx-proc should work here (arrow-proc?).

pearu · 2020-03-19T17:42:17Z

I am not sure how the splitting of arrow-cpp should work.

ucx-split seems to produce two conda packages:

ucx - main package
ucx-proc with cpu and gpu build labels - I am not sure what it does ?

I can think of the following solution:

arrow-cpp - contains everything that the current master provides.
arrow-cpp-cuda - contains arrow-cpp plus libarrow_cuda library (and cuda specific libplasma).
Can this be achieved within one feedstock?

Btw, I have created #125 but it might be nonsense.

isuruf · 2020-03-19T17:54:00Z

Here's an option. Have 2 variants.

With CUDA but with __cuda >=9.2 as a run requirement. This package will be installed only if conda>=4.8 and the host has CUDA driver built
Without CUDA

jakirkham · 2020-03-19T18:25:50Z

Though users may want to use the CPU-only package on machines that have a GPU.

isuruf · 2020-03-19T18:32:40Z

@jakirkham, then they can use the build string to get the package that doesn't use the GPU.

jakirkham · 2020-03-19T18:41:39Z

Build strings of non-mutex package tend to have other things like build numbers, which can make this a bit hard to use in practice.

Enable CUDA support.

eeebf96

pearu requested review from cpcloud, jreback, kou, kszucs, leifwalsh, pitrou, robertnishihara, siddharthteotia, wesm and xhochy as code owners March 15, 2020 18:51

pearu added 3 commits March 15, 2020 20:58

Move ARROW_CUDA=ON to EXTRA_CMAKE_ARGS

dc43ab2

MNT: Re-rendered with conda-build 3.18.11, conda-smithy 3.6.12, and c…

fc6746e

…onda-forge-pinning 2020.03.14

MNT: Re-rendered with conda-build 3.18.11, conda-smithy 3.6.12, and c…

e23655f

…onda-forge-pinning 2020.03.14

pearu added 2 commits March 17, 2020 18:36

Enable CUDA conditionally

731173d

Fix tests

59b07e8

kkraus14 approved these changes Mar 17, 2020

View reviewed changes

pearu mentioned this pull request Mar 18, 2020

[OmniSciDB] Update feedstock-omniscidb-{cpu, cuda} Quansight/omnisci#117

Closed

xhochy approved these changes Mar 18, 2020

View reviewed changes

xhochy merged commit 27d886a into conda-forge:master Mar 18, 2020

pearu deleted the pearu/enable-cuda branch March 18, 2020 16:54

xhochy added a commit that referenced this pull request Mar 18, 2020

Revert "Enable CUDA support. (#123)"

a740823

This reverts commit 27d886a.

xhochy mentioned this pull request Mar 19, 2020

Mark arrow-0.16.0, linux, build 2 as broken conda-forge/admin-requests#22

Merged

pearu restored the pearu/enable-cuda branch March 19, 2020 13:21

isuruf mentioned this pull request Mar 20, 2020

Enable CUDA support [READY] #125

Merged

4 tasks

h-vetinari mentioned this pull request Jun 27, 2020

RFC: Rename arrow-cpp to libarrow #158

Closed

asfimport mentioned this pull request Jun 22, 2020

[C++/Python] Enable CUDA Support in conda recipes apache/arrow#24354

Closed

Uh oh!

Enable CUDA support. #123

Enable CUDA support. #123

Uh oh!

Conversation

pearu commented Mar 15, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

conda-forge-linter commented Mar 15, 2020

Uh oh!

conda-forge-linter commented Mar 15, 2020

Uh oh!

pearu commented Mar 15, 2020

Uh oh!

github-actions bot commented Mar 15, 2020

Uh oh!

pearu commented Mar 15, 2020

Uh oh!

pearu commented Mar 17, 2020

Uh oh!

isuruf commented Mar 17, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pearu commented Mar 17, 2020

Uh oh!

jakirkham commented Mar 17, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pearu commented Mar 17, 2020

Uh oh!

pearu commented Mar 17, 2020

Uh oh!

kkraus14 left a comment

Choose a reason for hiding this comment

Uh oh!

jakirkham commented Mar 17, 2020

Uh oh!

isuruf commented Mar 17, 2020

Uh oh!

jakirkham commented Mar 18, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

xhochy commented Mar 18, 2020

Uh oh!

isuruf commented Mar 18, 2020

Uh oh!

xhochy commented Mar 18, 2020

Uh oh!

xhochy commented Mar 18, 2020

Uh oh!

isuruf commented Mar 18, 2020

Uh oh!

xhochy commented Mar 18, 2020

Uh oh!

jakirkham commented Mar 18, 2020

Uh oh!

isuruf commented Mar 18, 2020

Uh oh!

xhochy commented Mar 18, 2020

Uh oh!

jakirkham commented Mar 18, 2020

Uh oh!

jakirkham commented Mar 18, 2020

Uh oh!

xhochy commented Mar 19, 2020

Uh oh!

xhochy commented Mar 19, 2020

Uh oh!

jakirkham commented Mar 19, 2020

Uh oh!

pearu commented Mar 19, 2020

Uh oh!

isuruf commented Mar 19, 2020

Uh oh!

jakirkham commented Mar 19, 2020

Uh oh!

isuruf commented Mar 19, 2020

Uh oh!

jakirkham commented Mar 19, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

pearu commented Mar 15, 2020 •

edited

Loading

isuruf commented Mar 17, 2020 •

edited

Loading

jakirkham commented Mar 17, 2020 •

edited

Loading

jakirkham commented Mar 18, 2020 •

edited

Loading

jakirkham commented Mar 19, 2020 •

edited

Loading