Vendor in BaseNativeLowering and BaseLower for CUDA-specific customizations by VijayKandiah · Pull Request #329 · NVIDIA/numba-cuda

VijayKandiah · 2025-07-17T21:45:56Z

This PR vendors in BaseNativeLowering to be inherited by CUDANativeLowering, and BaseLower, Lower to be inherited by CUDALower. This is a refactoring change to allow for future CUDA-specific customizations. No new unit tests are needed.

copy-pr-bot · 2025-07-17T21:45:59Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

VijayKandiah · 2025-07-17T21:46:14Z

/ok to test 4cc0d01

gmarkall · 2025-07-22T13:39:23Z

/ok to test 4cc0d01

VijayKandiah · 2025-07-24T16:47:05Z

/ok to test bb7163b

gmarkall · 2025-07-25T14:06:48Z

I think this is probably making Numba-CUDA incompatible with Numba 0.60, which is why the CUDA 11 / Python 3.9 tests are failing (Numba 0.61 is not available for Python 3.9).

copy-pr-bot · 2025-07-25T19:07:32Z

Auto-sync is disabled for ready for review pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

VijayKandiah · 2025-07-25T19:08:09Z

/ok to test b412a4e

VijayKandiah · 2025-07-25T19:14:32Z

This PR also raises the minimum numba version required to be 0.60.0 in pyproject.toml
dependencies = ["numba>=0.60.0"]

gmarkall · 2025-07-28T13:08:49Z

numba_cuda/numba/cuda/lowering.py

+numba_version = get_versions()["version"].split(".")
+numba_minor_version = int(numba_version[1])


For future reference, we can get the version info with numba.version_info.major, numba.version_info.minor, etc.

VijayKandiah · 2025-07-28T16:35:38Z

/ok to test c11c73d

…ion >= 0.61.0

VijayKandiah · 2025-07-28T22:09:54Z

/ok to test 9ed1d65

…nager There were conflicts in: - `numba_cuda/numba/cuda/compiler.py` - `numba_cuda/numba/cuda/core/typed_passes.py` There was some overlap with the changes in NVIDIA#329, which I tried to resolve. Now a couple of debug tests are failing.

- [NFC] FileCheck tests check all overloads (NVIDIA#354) - [REVIEW][NFC] Vendor in serialize to allow for future CUDA-specific refactoring and changes (NVIDIA#349) - Vendor in usecases used in testing (NVIDIA#359) - Add thirdparty tests of numba extensions (NVIDIA#348) - Support running tests in parallel (NVIDIA#350) - Add more debuginfo tests (NVIDIA#358) - [REVIEW][NFC] Vendor in the Cache, CacheImpl used by CUDACache and CUDACacheImpl to allow for future CUDA-specific refactoring and changes (NVIDIA#334) - [NFC] Vendor in Dispatcher as CUDADispatcher to allow for future CUDA-specific customization (NVIDIA#338) - Vendor in BaseNativeLowering and BaseLower for CUDA-specific customizations (NVIDIA#329) - [REVIEW] Vendor in the CompilerBase used by CUDACompiler to allow for future CUDA-specific refactoring and changes (NVIDIA#322) - Vendor in Codegen and CodeLibrary for CUDA-specific customization (NVIDIA#327) - Disable tests that deadlock due to NVIDIA#317 (NVIDIA#356) - FIX: Add type check for shape elements in DeviceNDArrayBase constructor (NVIDIA#352) - Merge pull request NVIDIA#265 from lakshayg/fp16-support - Add performance warning - Fix tests - Create and register low++ bindings for float16 - Create typing/target registries for float16 - Replace Numbast generated lower_casts - Replace Numbast generated operators - Alias __half to numba.core.types.float16 - Generate fp16 bindings using numbast - Remove existing fp16 logic - [REVIEW][NFC] Vendor in the utils and cgutils to allow for future CUDA-specific refactoring and changes (NVIDIA#340) - [RFC,TESTING] Add filecheck test infrastructure (NVIDIA#342) - Migrate test infra to pytest (NVIDIA#347) - Add .vscode to gitignore (NVIDIA#344) - [NFC] Add dev dependencies to project config (NVIDIA#341) - Allow Inspection of Link-Time Optimized PTX (NVIDIA#326) - [NFC] Vendor in DIBuilder used by CUDADIBuilder (NVIDIA#332) - Add guidance on setting up pre-commit (NVIDIA#339) - [Refactor][NFC] Vendor in MinimalCallConv (NVIDIA#333) - [Refactor][NFC] Vendor in BaseCallConv (NVIDIA#324) - [REVIEW] Vendor in CompileResult as CUDACompileResult to allow for future CUDA-specific customizations (NVIDIA#325)

- [NFC] FileCheck tests check all overloads (#354) - [REVIEW][NFC] Vendor in serialize to allow for future CUDA-specific refactoring and changes (#349) - Vendor in usecases used in testing (#359) - Add thirdparty tests of numba extensions (#348) - Support running tests in parallel (#350) - Add more debuginfo tests (#358) - [REVIEW][NFC] Vendor in the Cache, CacheImpl used by CUDACache and CUDACacheImpl to allow for future CUDA-specific refactoring and changes (#334) - [NFC] Vendor in Dispatcher as CUDADispatcher to allow for future CUDA-specific customization (#338) - Vendor in BaseNativeLowering and BaseLower for CUDA-specific customizations (#329) - [REVIEW] Vendor in the CompilerBase used by CUDACompiler to allow for future CUDA-specific refactoring and changes (#322) - Vendor in Codegen and CodeLibrary for CUDA-specific customization (#327) - Disable tests that deadlock due to #317 (#356) - FIX: Add type check for shape elements in DeviceNDArrayBase constructor (#352) - Merge pull request #265 from lakshayg/fp16-support - Add performance warning - Fix tests - Create and register low++ bindings for float16 - Create typing/target registries for float16 - Replace Numbast generated lower_casts - Replace Numbast generated operators - Alias __half to numba.core.types.float16 - Generate fp16 bindings using numbast - Remove existing fp16 logic - [REVIEW][NFC] Vendor in the utils and cgutils to allow for future CUDA-specific refactoring and changes (#340) - [RFC,TESTING] Add filecheck test infrastructure (#342) - Migrate test infra to pytest (#347) - Add .vscode to gitignore (#344) - [NFC] Add dev dependencies to project config (#341) - Allow Inspection of Link-Time Optimized PTX (#326) - [NFC] Vendor in DIBuilder used by CUDADIBuilder (#332) - Add guidance on setting up pre-commit (#339) - [Refactor][NFC] Vendor in MinimalCallConv (#333) - [Refactor][NFC] Vendor in BaseCallConv (#324) - [REVIEW] Vendor in CompileResult as CUDACompileResult to allow for future CUDA-specific customizations (#325)

gmarkall added the 2 - In Progress Currently a work in progress label Jul 18, 2025

VijayKandiah force-pushed the vk/cuda_native_lowering branch from 4cc0d01 to bb7163b Compare July 24, 2025 16:42

gmarkall added 4 - Waiting on author Waiting for author to respond to review and removed 2 - In Progress Currently a work in progress labels Jul 25, 2025

gmarkall added 4 - Waiting on reviewer Waiting for reviewer to respond to author and removed 4 - Waiting on author Waiting for author to respond to review labels Jul 28, 2025

gmarkall reviewed Jul 28, 2025

View reviewed changes

VijayKandiah added 5 commits July 28, 2025 15:06

[Refactor] Vendor in BaseNativeLowering for CUDA-specific changes

cf60520

[Refactor] Vendor in Lower and BaseLower for CUDA-specific changes

b467aaa

[Refactor][NFC] Formatting fixes

dd62203

[Refactor][NFC] Import get_registered_loc_notify only when numba vers…

1dd44c8

…ion >= 0.61.0

[Refactor][NFC] Get numba version from numba.version_info

9ed1d65

VijayKandiah force-pushed the vk/cuda_native_lowering branch from c11c73d to 9ed1d65 Compare July 28, 2025 22:07

gmarkall approved these changes Jul 29, 2025

View reviewed changes

gmarkall merged commit 2a46811 into NVIDIA:main Jul 29, 2025
39 checks passed

VijayKandiah self-assigned this Jul 29, 2025

gmarkall mentioned this pull request Jul 31, 2025

Bump version to 0.18.0 #365

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Vendor in BaseNativeLowering and BaseLower for CUDA-specific customizations#329

Vendor in BaseNativeLowering and BaseLower for CUDA-specific customizations#329
gmarkall merged 5 commits intoNVIDIA:mainfrom
VijayKandiah:vk/cuda_native_lowering

VijayKandiah commented Jul 17, 2025

Uh oh!

copy-pr-bot bot commented Jul 17, 2025

Uh oh!

VijayKandiah commented Jul 17, 2025

Uh oh!

gmarkall commented Jul 22, 2025

Uh oh!

VijayKandiah commented Jul 24, 2025

Uh oh!

gmarkall commented Jul 25, 2025

Uh oh!

copy-pr-bot bot commented Jul 25, 2025

Uh oh!

VijayKandiah commented Jul 25, 2025

Uh oh!

VijayKandiah commented Jul 25, 2025

Uh oh!

gmarkall Jul 28, 2025

Uh oh!

VijayKandiah commented Jul 28, 2025

Uh oh!

VijayKandiah commented Jul 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		numba_version = get_versions()["version"].split(".")
		numba_minor_version = int(numba_version[1])

Conversation

VijayKandiah commented Jul 17, 2025

Uh oh!

copy-pr-bot bot commented Jul 17, 2025

Uh oh!

VijayKandiah commented Jul 17, 2025

Uh oh!

gmarkall commented Jul 22, 2025

Uh oh!

VijayKandiah commented Jul 24, 2025

Uh oh!

gmarkall commented Jul 25, 2025

Uh oh!

copy-pr-bot bot commented Jul 25, 2025

Uh oh!

VijayKandiah commented Jul 25, 2025

Uh oh!

VijayKandiah commented Jul 25, 2025

Uh oh!

gmarkall Jul 28, 2025

Choose a reason for hiding this comment

Uh oh!

VijayKandiah commented Jul 28, 2025

Uh oh!

VijayKandiah commented Jul 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants