Use `pathfinder` for dynamic libraries #308

brandon-b-miller · 2025-06-24T15:45:38Z

Towards #302

This PR switches to using pathfinder to locate nvvm and nvrtc within numba-cuda.

copy-pr-bot · 2025-06-24T15:45:41Z

Auto-sync is disabled for ready for review pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

brandon-b-miller · 2025-06-24T15:48:53Z

numba_cuda/numba/cuda/cudadrv/libs.py



 def _get_source_variable(lib, static=False):
+    # remove? only used in test()


Should we remove this function and test() below? cuda_paths also supports saying what the source of the loaded library is, such as if it's a wheel vs conda lib etc. Path finder treats this as an implementation detail.

We could always change it to just report the path without the source but its possible this function has proliferated a bit and is used for debugging out in the wild.

Path finder treats this as an implementation detail.

A hint could be added to the LoadedDL type if that's useful. Do you think it will be?

LoadedDL reliably includes the absolute path, even if the library was loaded outside of path finder. The path is more conclusive and usually it's very obvious if it's from a wheel, conda, or full CTK installation.

Thanks, that's helpful context. I think the best solution then would be one that maintains the current function, but decodes what the library source is on the numba side. I can take a whack at this and see how it looks.

If it's possible to keep the test() functionality I'd prefer that - it has been quite important for debugging in the past. For a lot of misconfigurations on peoples' systems, Numba-CUDA has not been the root cause of an issue but it is often the first place where a problem manifests, so it's been invaluable for showing people what the issue is and demonstrating that it's not just that "Numba-CUDA is broken". It's OK if the format of the output changes, but whatever information we can provide here from the CUDA pathfinder that provides parity with existing functionality will be helpful.

Also note that Numba will attempt to call it when producing sysinfo, so if it is removed entirely then it'll break numba sysinfo's (running python -m numba -s or numba -s) ability to show info about CUDA: https://github.com/numba/numba/blob/0c633b896526b039171c0f800df70320e1c61a53/numba/misc/numba_sysinfo.py#L381

but whatever information we can provide here from the CUDA pathfinder that provides parity with existing functionality will be helpful.

Ok, tx, I'll work on adding that in.

brandon-b-miller · 2025-06-24T15:49:30Z

numba_cuda/numba/cuda/cudadrv/libs.py

    configuration.
    """
-
+    # CUDA headers


These aren't a library but are still needed by numba, is this something pathfinder will support?

Yes, that's the plan.

+1 for this feature in pathfinder

leofang · 2025-06-26T15:03:18Z

@rwgk could you help review? (I can't add you as a reviewer for some reason)

rwgk

I don't have a lot of numba-cuda specific context, but what I see in this PR looks good to me.

rwgk · 2025-06-27T14:35:13Z

numba_cuda/numba/cuda/cudadrv/libs.py

    configuration.
    """
-
+    # CUDA headers


Yes, that's the plan.

rwgk · 2025-06-27T14:42:33Z

numba_cuda/numba/cuda/cudadrv/libs.py



 def _get_source_variable(lib, static=False):
+    # remove? only used in test()


Path finder treats this as an implementation detail.

A hint could be added to the LoadedDL type if that's useful. Do you think it will be?

LoadedDL reliably includes the absolute path, even if the library was loaded outside of path finder. The path is more conclusive and usually it's very obvious if it's from a wheel, conda, or full CTK installation.

leofang · 2025-06-27T14:50:48Z

numba_cuda/numba/cuda/cudadrv/libs.py

 from numba.cuda.cudadrv.driver import locate_driver_and_loader, load_driver
 from numba.cuda.cudadrv.error import CudaSupportError
 from numba.core import config
+from cuda.bindings import path_finder


@rwgk wouldn't this break once we make pathfinder a standalone module? (NVIDIA/cuda-python#723)

I was thinking not, because my intention is to keep the backward compatibility code for a cuda_bindings release or two.

I just saw @kkraus14 asked about adding a deprecation warning in that code (on NVIDIA/cuda-python#723): I'd don't think there is a rush to get rid of the few lines of backward compatibility code, therefore I'd lean towards making it easy for the world, by staging: one cuda_bindings release without deprecation warning, the next one or two cuda_bindings releases with warning, then remove the backward compatibility code.

(This is very similar to how features are deprecated in CPython.)

I'm good with the approach of phasing in the deprecation.

CLAassistant · 2025-08-20T02:00:02Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

copy-pr-bot · 2025-09-19T04:18:26Z

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

brandon-b-miller · 2025-10-15T19:01:46Z

/ok to test

brandon-b-miller · 2025-10-16T21:02:47Z

/ok to test

brandon-b-miller · 2025-10-16T22:03:41Z

/ok to test

brandon-b-miller · 2025-10-16T22:06:45Z

/ok to test

brandon-b-miller · 2025-10-17T16:44:11Z

/ok to test

greptile-apps

_{5 files reviewed, 1 comment}

_{Edit Code Review Agent Settings | Greptile}

ci/test_thirdparty_awkward.sh

brandon-b-miller · 2026-01-20T17:14:11Z

/ok to test

greptile-apps

_{5 files reviewed, 3 comments}

_{Edit Code Review Agent Settings | Greptile}

numba_cuda/numba/cuda/cuda_paths.py

numba_cuda/numba/cuda/tests/nocuda/test_library_lookup.py

ci/test_thirdparty_awkward.sh

greptile-apps

_{5 files reviewed, 2 comments}

_{Edit Code Review Agent Settings | Greptile}

numba_cuda/numba/cuda/cuda_paths.py

numba_cuda/numba/cuda/tests/nocuda/test_library_lookup.py

brandon-b-miller · 2026-01-20T18:18:30Z

/ok to test

brandon-b-miller · 2026-01-20T18:56:31Z

/ok to test

greptile-apps

_{5 files reviewed, 1 comment}

_{Edit Code Review Agent Settings | Greptile}

ci/test_thirdparty_awkward.sh

brandon-b-miller · 2026-01-20T19:10:15Z

/ok to test

numba_cuda/numba/cuda/cuda_paths.py

pyproject.toml

brandon-b-miller · 2026-01-20T21:39:42Z

/ok to test

greptile-apps

_{5 files reviewed, 1 comment}

_{Edit Code Review Agent Settings | Greptile}

pyproject.toml

Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>

brandon-b-miller · 2026-01-20T22:42:30Z

/ok to test

numba_cuda/numba/cuda/cuda_paths.py

brandon-b-miller · 2026-01-21T02:27:50Z

/ok to test

rwgk

Looks good to me!

- Add Python 3.14 to the wheel publishing matrix (NVIDIA#750) - feat: swap out internal device array usage with `StridedMemoryView` (NVIDIA#703) - Fix max block size computation in `forall` (NVIDIA#744) - Fix prologue debug line info pointing to decorator instead of def line (NVIDIA#746) - Fix kernel return type in DISubroutineType debug metadata (NVIDIA#745) - Fix missing line info in Jupyter notebooks (NVIDIA#742) - Fix: Pass correct flags to linker when debugging in the presence of LTOIR code (NVIDIA#698) - chore(deps): add cuda-pathfinder to pixi deps (NVIDIA#741) - fix: enable flake8-bugbear lints and fix found problems (NVIDIA#708) - fix: Fix race condition in CUDA Simulator (NVIDIA#690) - ci: run tests in parallel (NVIDIA#740) - feat: users can pass `shared_memory_carveout` to @cuda.jit (NVIDIA#642) - Fix compatibility with NumPy 2.4: np.trapz and np.in1d removed (NVIDIA#739) - Pass the -numba-debug flag to libnvvm (NVIDIA#681) - ci: remove rapids containers from conda ci (NVIDIA#737) - Use `pathfinder` for dynamic libraries (NVIDIA#308) - CI: Add CUDA 13.1 testing support (NVIDIA#705) - Adding `pixi run test` and `pixi run test-par` support (NVIDIA#724) - Disable per-PR nvmath tests + follow same test practice (NVIDIA#723) - chore(deps): regenerate pixi lockfile (NVIDIA#722) - Fix DISubprogram line number to point to function definition line (NVIDIA#695) - revert: chore(dev): build pixi using rattler (NVIDIA#713) (NVIDIA#719) - [feat] Initial version of the Numba CUDA GDB pretty-printer (NVIDIA#692) - chore(dev): build pixi using rattler (NVIDIA#713) - build(deps): bump the actions-monthly group across 1 directory with 8 updates (NVIDIA#704)

- Add Python 3.14 to the wheel publishing matrix (#750) - feat: swap out internal device array usage with `StridedMemoryView` (#703) - Fix max block size computation in `forall` (#744) - Fix prologue debug line info pointing to decorator instead of def line (#746) - Fix kernel return type in DISubroutineType debug metadata (#745) - Fix missing line info in Jupyter notebooks (#742) - Fix: Pass correct flags to linker when debugging in the presence of LTOIR code (#698) - chore(deps): add cuda-pathfinder to pixi deps (#741) - fix: enable flake8-bugbear lints and fix found problems (#708) - fix: Fix race condition in CUDA Simulator (#690) - ci: run tests in parallel (#740) - feat: users can pass `shared_memory_carveout` to @cuda.jit (#642) - Fix compatibility with NumPy 2.4: np.trapz and np.in1d removed (#739) - Pass the -numba-debug flag to libnvvm (#681) - ci: remove rapids containers from conda ci (#737) - Use `pathfinder` for dynamic libraries (#308) - CI: Add CUDA 13.1 testing support (#705) - Adding `pixi run test` and `pixi run test-par` support (#724) - Disable per-PR nvmath tests + follow same test practice (#723) - chore(deps): regenerate pixi lockfile (#722) - Fix DISubprogram line number to point to function definition line (#695) - revert: chore(dev): build pixi using rattler (#713) (#719) - [feat] Initial version of the Numba CUDA GDB pretty-printer (#692) - chore(dev): build pixi using rattler (#713) - build(deps): bump the actions-monthly group across 1 directory with 8 updates (#704)

brandon-b-miller added 3 commits June 17, 2025 13:35

use pathfinder naively for dynamic libs

7f56f01

other needed cuda components

c2f7611

a comment

977c8c9

brandon-b-miller commented Jun 24, 2025

View reviewed changes

gmarkall added the 3 - Ready for Review Ready for review by team label Jun 26, 2025

rwgk approved these changes Jun 27, 2025

View reviewed changes

leofang reviewed Jun 27, 2025

View reviewed changes

leofang mentioned this pull request Jul 1, 2025

Move pathfinder to cuda-python top level NVIDIA/cuda-python#723

Merged

ZzEeKkAa mentioned this pull request Jul 2, 2025

Fix nvrtc resolution when CUDA_HOME env is set #314

Merged

leofang marked this pull request as draft September 19, 2025 04:18

rwgk mentioned this pull request Oct 2, 2025

[FEA]: Make pathfinder usable as a full replacement for cuda_paths.py in numba-cuda NVIDIA/cuda-python#1036

Open

brandon-b-miller added 2 commits October 15, 2025 11:56

merge/resolve/pass

2765eae

deps

738702c

brandon-b-miller changed the title ~~Deprecate cuda_paths in favor of using cuda.bindings.path_finder to find CUDA components~~ Use pathfinder for dynamic libraries Oct 15, 2025

brandon-b-miller marked this pull request as ready for review October 15, 2025 19:07

updates

2bfa543

simpler

a0756f7

clean

03cddf2

small fixes

0ab6939

brandon-b-miller added the 0 - Blocked Cannot progress due to external reasons label Oct 24, 2025