Use `cuda.bindings` and `cuda.core` for `Linker` by brandon-b-miller · Pull Request #133 · NVIDIA/numba-cuda

brandon-b-miller · 2025-02-21T19:58:37Z

WIP
xref #129

leofang · 2025-02-22T02:17:10Z

Thanks, @brandon-b-miller. Remember our goal is to drop every Linker subclasses inside Numba, in favor of cuda.core.Linker. The current PR is not what we want. Also note that to help pynvjitlink to phase out, we already have rapidsai/pynvjitlink#111 which is essentially what this PR does today.

copy-pr-bot · 2025-02-24T14:12:10Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

numba_cuda/numba/cuda/codegen.py

brandon-b-miller · 2025-03-14T14:08:27Z

/ok to test

brandon-b-miller · 2025-03-14T14:29:56Z

@gmarkall @leofang numba-cuda contains nvjitlink tests, should we maintain support for these as part of this PR or drop them in favor of testing in upstream cuda-python?

gmarkall · 2025-03-14T14:45:57Z

@gmarkall @leofang numba-cuda contains nvjitlink tests, should we maintain support for these as part of this PR or drop them in favor of testing in upstream cuda-python?

I think we need to maintain the tests that test Numba-CUDA's interaction with the linker, like the TestLinkerUsage class and the test_*_with_linkable_code (tests with names like that). I don't think we need to keep the tests that purely test the PyNvJitLinker API like the ones that test passing different flags etc. to it.

gmarkall · 2025-03-14T14:47:52Z

Also, I think we can probably delete the PyNvJitLinker class in this PR as well - is there any reason to keep it around?

I'm comfortable with:

Using cuda.core.Linker when the user asks for pynvjitlink or the NVIDIA bindings, and
Using the ctypes linker otherwise

which is what this PR seems to offer. (correct me if I've read it wrong 🙂)

brandon-b-miller · 2025-03-14T14:50:16Z

Also, I think we can probably delete the PyNvJitLinker class in this PR as well - is there any reason to keep it around?

I'm comfortable with:

Using cuda.core.Linker when the user asks for pynvjitlink or the NVIDIA bindings, and

Using the ctypes linker otherwise

which is what this PR seems to offer. (correct me if I've read it wrong 🙂)

Correct, this is the outcome I am aiming for.

brandon-b-miller · 2025-03-14T16:37:22Z

@gmarkall on second thought, we might need to leave the MVCLinker in in some capacity as long as we're supporting cuda 11. I don't think that cuda-python supports the functionality that cubinlinker enables.

gmarkall · 2025-03-14T19:40:19Z

@brandon-b-miller Sorry, yes - I had that in mind but didn't write it down.

brandon-b-miller · 2025-03-17T16:33:08Z

@brandon-b-miller Sorry, yes - I had that in mind but didn't write it down.

Ok, just to have it written down somewhere, after this PR we will:

For cuda 11, maintain the current way of configuring which bindings to use:

Default ctypes bindings, optional cuda-python bindings with NUMBA_CUDA_USE_NVIDIA_BINDING=1, optional MVCLinker with NUMBA_CUDA_ENABLE_MINOR_VERSION_COMPATIBILITY=1.

for cuda 12, we will have:

Default ctypes bindings, optional cuda-python bindings with NUMBA_CUDA_USE_NVIDIA_BINDING=1
Use of pynvjitlink through cuda-python if NUMBA_CUDA_ENBALE_PYNVJITLINK=1

This will leave us with 3 linkers:

The ctypes linker which is used by default regardless of cuda version
the mvc linker which is used in a cuda 11 environment when mvc is required, regardless of what binding is being used
the new linker which is used in a cuda 12 enviornment either the cuda-python bindings or pynvjitlink is enabled

gmarkall · 2025-03-17T16:51:28Z

Thanks for the summary! To look a little further ahead, we want to end up with only one linker, which is the new linker. This would be achieved by deprecating / removing the other linkers as soon as appropriate:

MVCLinker can be removed as soon as CUDA 11 support is dropped.
The ctypes linker can be deprecated and removed whenever we can have a hard dependency on cuda.core and had tested the new linker in use for a bit to shake out any issues.

Are you in alignment with the above plan @brandon-b-miller ?

brandon-b-miller · 2025-03-17T16:58:27Z

Thanks for the summary! To look a little further ahead, we want to end up with only one linker, which is the new linker. This would be achieved by deprecating / removing the other linkers as soon as appropriate:

MVCLinker can be removed as soon as CUDA 11 support is dropped.

The ctypes linker can be deprecated and removed whenever we can have a hard dependency on cuda.core and had tested the new linker in use for a bit to shake out any issues.

Are you in alignment with the above plan @brandon-b-miller ?

Yup this sounds good to me.

brandon-b-miller · 2025-06-26T15:08:31Z

/ok to test 85f8710

brandon-b-miller · 2025-06-26T16:12:46Z

/ok to test 547dab5

brandon-b-miller · 2025-06-26T16:49:14Z

/ok to test 3bd469d

brandon-b-miller · 2025-06-26T17:33:08Z

/ok to test ba5c20a

brandon-b-miller · 2025-06-26T18:10:39Z

/ok to test 134f6ee

gmarkall · 2025-06-27T09:11:33Z

I thought the code changes looked good but I'm hitting an error disabling the NVIDIA binding locally now with this PR. For example:

$ NUMBA_CUDA_USE_NVIDIA_BINDING=0 python -m numba.runtests numba.cuda.tests -v
Traceback (most recent call last):
  File "<frozen runpy>", line 198, in _run_module_as_main
  File "<frozen runpy>", line 88, in _run_code
  File "/home/gmarkall/numbadev/numba/numba/runtests.py", line 9, in <module>
    sys.exit(0 if _main(sys.argv) else 1)
                  ^^^^^^^^^^^^^^^
  File "/home/gmarkall/numbadev/numba/numba/testing/_runtests.py", line 25, in _main
    return run_tests(argv, defaultTest='numba.tests',
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/gmarkall/numbadev/numba/numba/testing/__init__.py", line 54, in run_tests
    prog = NumbaTestProgram(argv=argv,
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/gmarkall/numbadev/numba/numba/testing/main.py", line 204, in __init__
    super(NumbaTestProgram, self).__init__(*args, **kwargs)
  File "/home/gmarkall/miniforge3/envs/numbadev/lib/python3.11/unittest/main.py", line 101, in __init__
    self.parseArgs(argv)
  File "/home/gmarkall/numbadev/numba/numba/testing/main.py", line 293, in parseArgs
    super(NumbaTestProgram, self).parseArgs(argv)
  File "/home/gmarkall/miniforge3/envs/numbadev/lib/python3.11/unittest/main.py", line 150, in parseArgs
    self.createTests()
  File "/home/gmarkall/miniforge3/envs/numbadev/lib/python3.11/unittest/main.py", line 161, in createTests
    self.test = self.testLoader.loadTestsFromNames(self.testNames,
                ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/gmarkall/miniforge3/envs/numbadev/lib/python3.11/unittest/loader.py", line 232, in loadTestsFromNames
    suites = [self.loadTestsFromName(name, module) for name in names]
             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/gmarkall/miniforge3/envs/numbadev/lib/python3.11/unittest/loader.py", line 232, in <listcomp>
    suites = [self.loadTestsFromName(name, module) for name in names]
              ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/gmarkall/miniforge3/envs/numbadev/lib/python3.11/unittest/loader.py", line 162, in loadTestsFromName
    module = __import__(module_name)
             ^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/gmarkall/numbadev/numba-cuda/numba_cuda/numba/cuda/__init__.py", line 57, in <module>
    raise RuntimeError("nvJitLink requires the NVIDIA CUDA bindings. ")
RuntimeError: nvJitLink requires the NVIDIA CUDA bindings.

Just looking into why this is and why it doesn't seem to occur on the ctypes binding test in CI.

gmarkall · 2025-06-27T09:21:46Z

numba_cuda/numba/cuda/__init__.py

+            "in place of pynvjitlink."
+        )
+    else:
+        raise RuntimeError("nvJitLink requires the NVIDIA CUDA bindings. ")


Because config.CUDA_ENABLE_PYNVJITLINK is enabled automatically if it's found in the environment, disabling the NVIDIA bindings if pynvjitlink is installed now leads to this exception being hit.

gmarkall

I think we can solve the issue of it not being possible to disable the NVIDIA bindings if pynvjitlink is installed by keeping track of whether it was enabled automatically, and just ignoring it if it was, like:

diff --git a/numba_cuda/numba/cuda/__init__.py b/numba_cuda/numba/cuda/__init__.py
index 430b3b7..e944fe0 100644
--- a/numba_cuda/numba/cuda/__init__.py
+++ b/numba_cuda/numba/cuda/__init__.py
@@ -9,6 +9,9 @@ import warnings
 # 1. Config setting "CUDA_ENABLE_PYNVJITLINK" (highest priority)
 # 2. Environment variable "NUMBA_CUDA_ENABLE_PYNVJITLINK"
 # 3. Auto-detection of pynvjitlink module (lowest priority)
+
+pynvjitlink_auto_enabled = False
+
 if getattr(config, "CUDA_ENABLE_PYNVJITLINK", None) is None:
     if (
         _pynvjitlink_enabled_in_env := _readenv(
@@ -17,9 +20,10 @@ if getattr(config, "CUDA_ENABLE_PYNVJITLINK", None) is None:
     ) is not None:
         config.CUDA_ENABLE_PYNVJITLINK = _pynvjitlink_enabled_in_env
     else:
-        config.CUDA_ENABLE_PYNVJITLINK = (
+        pynvjitlink_auto_enabled = (
             importlib.util.find_spec("pynvjitlink") is not None
         )
+        config.CUDA_ENABLE_PYNVJITLINK = pynvjitlink_auto_enabled
 
 # Upstream numba sets CUDA_USE_NVIDIA_BINDING to 0 by default, so it always
 # exists. Override, but not if explicitly set to 0 in the envioronment.
@@ -53,6 +57,11 @@ if config.CUDA_ENABLE_PYNVJITLINK:
             "NVIDIA bindings are enabled. cuda.core will be used "
             "in place of pynvjitlink."
         )
+    elif pynvjitlink_auto_enabled:
+        # Ignore the fact that pynvjitlink is enabled, because that was an
+        # automatic decision based on discovering pynvjitlink was present; the
+        # user didn't ask for it
+        pass
     else:
         raise RuntimeError("nvJitLink requires the NVIDIA CUDA bindings. ")

Does this seem like a workable solution? (It allows me to disable the NVIDIA binding for testing locally)

leofang · 2025-06-27T11:13:52Z

I need to catch with the progress here, but are we still keeping both cuda.core.Linker and pynvjiink?

brandon-b-miller · 2025-06-27T13:20:40Z

I need to catch with the progress here, but are we still keeping both cuda.core.Linker and pynvjiink?

No, pynvjitlink won't be required. We're just puzzling over what to do for existing users who try and enable it, or have it in their environment after this PR :)

brandon-b-miller · 2025-06-27T13:22:33Z

/ok to test 99c87f3

- Updates for recent API changes (NVIDIA#313) - Fix lineinfo generation when compile_internal used (NVIDIA#271) (NVIDIA#287) - Build docs with NVIDIA Sphinx theme (NVIDIA#312) - Don't skip debug tests when LTO enabled by default (NVIDIA#311) - Use `cuda.bindings` and `cuda.core` for `Linker` (NVIDIA#133) - Enable LTO by default when pynvjitlink is available (NVIDIA#310)

- Updates for recent API changes (#313) - Fix lineinfo generation when compile_internal used (#271) (#287) - Build docs with NVIDIA Sphinx theme (#312) - Don't skip debug tests when LTO enabled by default (#311) - Use `cuda.bindings` and `cuda.core` for `Linker` (#133) - Enable LTO by default when pynvjitlink is available (#310)

begin replacing pynvjitlinker

6b66886

brandon-b-miller added 3 commits February 24, 2025 07:30

Merge branch 'main' into cuda-core-linker

2c5e69e

begin implementing cuda-python linker

b44f6bf

fix misconfigured precommit

5f3eff0

brandon-b-miller changed the title ~~Use cuda.bindings and cuda.core for nvjitlink~~ Use cuda.bindings and cuda.core for Linker Feb 24, 2025

isVoid mentioned this pull request Mar 3, 2025

Add Module Setup and Teardown Callback to Linkable Code Interface #145

Merged

isVoid reviewed Mar 4, 2025

View reviewed changes

numba_cuda/numba/cuda/codegen.py Outdated Show resolved Hide resolved

brandon-b-miller added 4 commits March 4, 2025 08:59

properly handle module image

fa80a4f

almost pass linker tests

049ff57

trying to pass more tests

9aeff49

clean

b551d6f

gmarkall added the 2 - In Progress Currently a work in progress label Mar 7, 2025

brandon-b-miller added 5 commits March 10, 2025 13:16

pass includes correctly

ab9ac6f

pass a few more linker tests

172e340

Context.create_module_ptx wraps PTX in an ObjectCode instance

4334b8b

workaround passing lto=False to cuda-python

0c8171a

drop to ctypes ptr in nrt when using nv binding

cc70f1d

pass more tests

1ed3520

brandon-b-miller added 5 commits June 26, 2025 06:05

merge/resolve

1d94745

use have_nvjitlink in deco

3e12fc3

update test

ce00767

address reviews

d726a7a

try not forwarding lto=False to None

85f8710

brandon-b-miller added 2 commits June 26, 2025 08:51

address remaining reviews

edf0f0a

defer cuinit

547dab5

update _have_nvjitlink

3bd469d

updates

ba5c20a

catch no driver at all

134f6ee

gmarkall reviewed Jun 27, 2025

View reviewed changes

gmarkall requested changes Jun 27, 2025

View reviewed changes

track pynvjitlink env discovery

99c87f3

gmarkall approved these changes Jun 27, 2025

View reviewed changes

gmarkall merged commit 489045f into NVIDIA:main Jun 27, 2025
39 checks passed

ZzEeKkAa mentioned this pull request Jul 1, 2025

[MNT] Replace all internal Linker-related classes by cuda.core.(experimental).Linker #129

Closed

gmarkall mentioned this pull request Jul 2, 2025

Bump version to 0.16.0 #315

Merged

isVoid mentioned this pull request Jul 24, 2025

Obtain temp storage size and alignment directly from LTO IR via PTX conversion. NVIDIA/cccl#5355

Merged

leofang mentioned this pull request Aug 11, 2025

WIP: Switch to use new Python bindings from cuda-python/cuda-bindings rapidsai/pynvjitlink#111

Closed

Conversation

brandon-b-miller commented Feb 21, 2025

Uh oh!

leofang commented Feb 22, 2025

Uh oh!

copy-pr-bot bot commented Feb 24, 2025

Uh oh!

Uh oh!

brandon-b-miller commented Mar 14, 2025

Uh oh!

brandon-b-miller commented Mar 14, 2025

Uh oh!

gmarkall commented Mar 14, 2025

Uh oh!

gmarkall commented Mar 14, 2025

Uh oh!

brandon-b-miller commented Mar 14, 2025

Uh oh!

brandon-b-miller commented Mar 14, 2025

Uh oh!

gmarkall commented Mar 14, 2025

Uh oh!

brandon-b-miller commented Mar 17, 2025

Uh oh!

gmarkall commented Mar 17, 2025

Uh oh!

brandon-b-miller commented Mar 17, 2025

Uh oh!

brandon-b-miller commented Jun 26, 2025

Uh oh!

brandon-b-miller commented Jun 26, 2025

Uh oh!

brandon-b-miller commented Jun 26, 2025

Uh oh!

brandon-b-miller commented Jun 26, 2025

Uh oh!

brandon-b-miller commented Jun 26, 2025

Uh oh!

gmarkall commented Jun 27, 2025

Uh oh!

gmarkall Jun 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

gmarkall left a comment

Choose a reason for hiding this comment

Uh oh!

leofang commented Jun 27, 2025

Uh oh!

brandon-b-miller commented Jun 27, 2025

Uh oh!

brandon-b-miller commented Jun 27, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

gmarkall Jun 27, 2025 •

edited

Loading