[FEA] Support JITting and Linking LTOIR from cuda source inputs #45

isVoid · 2024-08-23T01:28:34Z

Is your feature request related to a problem? Please describe.
After Pynvjitlink is introduced into Numba-CUDA, I would like to be able to jit cuda source files into LTOIR and link it with nvjitlink. This basically requires the linker lto flag to be respected by the jit and linker machinery.

Describe the solution you'd like
Add lto flag to nvrtc.compile, and add -dlto flag to compile option when set. In the Linker class, use add_ltoir in add_cu when self.lto is set.

The text was updated successfully, but these errors were encountered:

isVoid · 2024-08-23T03:24:49Z

As a follow up to this PR - it would be helpful to support -ptx flag for nvjitlink to emit the PTX under LTOIR linkage, given NUMBA_DUMP_ASSEMBLY is set. This makes sure that Numba-CUDA shows the LTO-ed PTX and is more useful in debugging optimization related issues.

isVoid added the feature request New feature or request label Aug 23, 2024

gmarkall added this to the v0.0.18 milestone Oct 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEA] Support JITting and Linking LTOIR from cuda source inputs #45

[FEA] Support JITting and Linking LTOIR from cuda source inputs #45

isVoid commented Aug 23, 2024

isVoid commented Aug 23, 2024 •

edited

Loading

[FEA] Support JITting and Linking LTOIR from cuda source inputs #45

[FEA] Support JITting and Linking LTOIR from cuda source inputs #45

Comments

isVoid commented Aug 23, 2024

isVoid commented Aug 23, 2024 • edited Loading

isVoid commented Aug 23, 2024 •

edited

Loading