[REVIEW] Vendor in the CompilerBase used by CUDACompiler to allow for future CUDA-specific refactoring and changes by atmnp · Pull Request #322 · NVIDIA/numba-cuda

atmnp · 2025-07-16T19:09:01Z

The CompilerBase class and relevant helpers are moved into the Numba CUDA repo. This is an incremental NFC change (so no extra unit tests).

copy-pr-bot · 2025-07-16T19:09:05Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

gmarkall · 2025-07-18T13:59:41Z

/ok to test 4227eb6

gmarkall · 2025-07-23T13:41:09Z

/ok to test 4dd1d0a

atmnp · 2025-07-24T15:04:00Z

/ok to test cc46b9a

gmarkall · 2025-07-28T12:25:17Z

/ok to test

copy-pr-bot · 2025-07-28T12:25:20Z

/ok to test

@gmarkall, there was an error processing your request: E1

See the following link for more information: https://docs.gha-runners.nvidia.com/cpr/e/1/

gmarkall · 2025-07-28T13:03:09Z

/ok to test be885f6

- [NFC] FileCheck tests check all overloads (NVIDIA#354) - [REVIEW][NFC] Vendor in serialize to allow for future CUDA-specific refactoring and changes (NVIDIA#349) - Vendor in usecases used in testing (NVIDIA#359) - Add thirdparty tests of numba extensions (NVIDIA#348) - Support running tests in parallel (NVIDIA#350) - Add more debuginfo tests (NVIDIA#358) - [REVIEW][NFC] Vendor in the Cache, CacheImpl used by CUDACache and CUDACacheImpl to allow for future CUDA-specific refactoring and changes (NVIDIA#334) - [NFC] Vendor in Dispatcher as CUDADispatcher to allow for future CUDA-specific customization (NVIDIA#338) - Vendor in BaseNativeLowering and BaseLower for CUDA-specific customizations (NVIDIA#329) - [REVIEW] Vendor in the CompilerBase used by CUDACompiler to allow for future CUDA-specific refactoring and changes (NVIDIA#322) - Vendor in Codegen and CodeLibrary for CUDA-specific customization (NVIDIA#327) - Disable tests that deadlock due to NVIDIA#317 (NVIDIA#356) - FIX: Add type check for shape elements in DeviceNDArrayBase constructor (NVIDIA#352) - Merge pull request NVIDIA#265 from lakshayg/fp16-support - Add performance warning - Fix tests - Create and register low++ bindings for float16 - Create typing/target registries for float16 - Replace Numbast generated lower_casts - Replace Numbast generated operators - Alias __half to numba.core.types.float16 - Generate fp16 bindings using numbast - Remove existing fp16 logic - [REVIEW][NFC] Vendor in the utils and cgutils to allow for future CUDA-specific refactoring and changes (NVIDIA#340) - [RFC,TESTING] Add filecheck test infrastructure (NVIDIA#342) - Migrate test infra to pytest (NVIDIA#347) - Add .vscode to gitignore (NVIDIA#344) - [NFC] Add dev dependencies to project config (NVIDIA#341) - Allow Inspection of Link-Time Optimized PTX (NVIDIA#326) - [NFC] Vendor in DIBuilder used by CUDADIBuilder (NVIDIA#332) - Add guidance on setting up pre-commit (NVIDIA#339) - [Refactor][NFC] Vendor in MinimalCallConv (NVIDIA#333) - [Refactor][NFC] Vendor in BaseCallConv (NVIDIA#324) - [REVIEW] Vendor in CompileResult as CUDACompileResult to allow for future CUDA-specific customizations (NVIDIA#325)

- [NFC] FileCheck tests check all overloads (#354) - [REVIEW][NFC] Vendor in serialize to allow for future CUDA-specific refactoring and changes (#349) - Vendor in usecases used in testing (#359) - Add thirdparty tests of numba extensions (#348) - Support running tests in parallel (#350) - Add more debuginfo tests (#358) - [REVIEW][NFC] Vendor in the Cache, CacheImpl used by CUDACache and CUDACacheImpl to allow for future CUDA-specific refactoring and changes (#334) - [NFC] Vendor in Dispatcher as CUDADispatcher to allow for future CUDA-specific customization (#338) - Vendor in BaseNativeLowering and BaseLower for CUDA-specific customizations (#329) - [REVIEW] Vendor in the CompilerBase used by CUDACompiler to allow for future CUDA-specific refactoring and changes (#322) - Vendor in Codegen and CodeLibrary for CUDA-specific customization (#327) - Disable tests that deadlock due to #317 (#356) - FIX: Add type check for shape elements in DeviceNDArrayBase constructor (#352) - Merge pull request #265 from lakshayg/fp16-support - Add performance warning - Fix tests - Create and register low++ bindings for float16 - Create typing/target registries for float16 - Replace Numbast generated lower_casts - Replace Numbast generated operators - Alias __half to numba.core.types.float16 - Generate fp16 bindings using numbast - Remove existing fp16 logic - [REVIEW][NFC] Vendor in the utils and cgutils to allow for future CUDA-specific refactoring and changes (#340) - [RFC,TESTING] Add filecheck test infrastructure (#342) - Migrate test infra to pytest (#347) - Add .vscode to gitignore (#344) - [NFC] Add dev dependencies to project config (#341) - Allow Inspection of Link-Time Optimized PTX (#326) - [NFC] Vendor in DIBuilder used by CUDADIBuilder (#332) - Add guidance on setting up pre-commit (#339) - [Refactor][NFC] Vendor in MinimalCallConv (#333) - [Refactor][NFC] Vendor in BaseCallConv (#324) - [REVIEW] Vendor in CompileResult as CUDACompileResult to allow for future CUDA-specific customizations (#325)

atmnp changed the title ~~[WIP] Vendor in the CompilerBase used by CUDACompiler to allow for future CUDA-specific refactoring and changes~~ [REVIEW] Vendor in the CompilerBase used by CUDACompiler to allow for future CUDA-specific refactoring and changes Jul 16, 2025

gmarkall added 3 - Ready for Review Ready for review by team 2 - In Progress Currently a work in progress and removed 3 - Ready for Review Ready for review by team labels Jul 18, 2025

atmnp force-pushed the atmn/vendor-in-compiler branch from 4227eb6 to 4dd1d0a Compare July 22, 2025 21:46

[Refactor][NFC] Vendor in CompilerBase

cc46b9a

atmnp force-pushed the atmn/vendor-in-compiler branch from 4dd1d0a to cc46b9a Compare July 24, 2025 15:03

gmarkall added 3 - Ready for Review Ready for review by team and removed 2 - In Progress Currently a work in progress labels Jul 25, 2025

Merge branch 'main' into atmn/vendor-in-compiler

be885f6

gmarkall added 4 - Waiting on CI Waiting for a CI run to finish successfully and removed 3 - Ready for Review Ready for review by team labels Jul 28, 2025

gmarkall approved these changes Jul 28, 2025

View reviewed changes

gmarkall added 5 - Ready to merge Testing and reviews complete, ready to merge and removed 4 - Waiting on CI Waiting for a CI run to finish successfully labels Jul 28, 2025

gmarkall merged commit 1f0ab9c into NVIDIA:main Jul 28, 2025
39 checks passed

VijayKandiah assigned atmnp Jul 29, 2025

gmarkall mentioned this pull request Jul 31, 2025

Bump version to 0.18.0 #365

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[REVIEW] Vendor in the CompilerBase used by CUDACompiler to allow for future CUDA-specific refactoring and changes#322

[REVIEW] Vendor in the CompilerBase used by CUDACompiler to allow for future CUDA-specific refactoring and changes#322
gmarkall merged 2 commits intoNVIDIA:mainfrom
atmnp:atmn/vendor-in-compiler

atmnp commented Jul 16, 2025

Uh oh!

copy-pr-bot bot commented Jul 16, 2025

Uh oh!

gmarkall commented Jul 18, 2025

Uh oh!

gmarkall commented Jul 23, 2025

Uh oh!

atmnp commented Jul 24, 2025

Uh oh!

gmarkall commented Jul 28, 2025

Uh oh!

copy-pr-bot bot commented Jul 28, 2025

Uh oh!

gmarkall commented Jul 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

atmnp commented Jul 16, 2025

Uh oh!

copy-pr-bot bot commented Jul 16, 2025

Uh oh!

gmarkall commented Jul 18, 2025

Uh oh!

gmarkall commented Jul 23, 2025

Uh oh!

atmnp commented Jul 24, 2025

Uh oh!

gmarkall commented Jul 28, 2025

Uh oh!

copy-pr-bot bot commented Jul 28, 2025

Uh oh!

gmarkall commented Jul 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants