-
Notifications
You must be signed in to change notification settings - Fork 178
JIT compile interleaved_scan_kernel for CUDA 13
#1405
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
rapids-bot
merged 105 commits into
rapidsai:main
from
divyegala:jit-lto-ivf-flat-interleaved
Feb 14, 2026
Merged
Changes from all commits
Commits
Show all changes
105 commits
Select commit
Hold shift + click to select a range
a024f61
jit lto interleaved scan
divyegala 45da4aa
fix dependencies.yaml
divyegala a7c8621
generate files at build time, use tags to avoid compilation of types
divyegala eb2d74b
passing tests
divyegala d2318e8
update gitignore
divyegala 5e6afcd
separate out distance function from main kernel
divyegala 6eee4da
fix deps
divyegala 1de8f28
add filters as jit device functions, rework caching logic
divyegala 84c6020
lto post lambda, cleanup files, generate cmake in build dir
divyegala 22680c8
don't read hardcoded kernels, use generator properly
divyegala 37f1163
random cmake changes carried over from 25.10
divyegala 0ae5383
cmake format
divyegala fe56aec
remove dep on kernel list
divyegala 40c8fd6
attempt to solve overlinking problem
divyegala e87a8c7
reorder if-else in compiler check
divyegala 179d733
Merge branch 'branch-25.12' into jit-lto-ivf-flat-interleaved
divyegala 32a67bd
use cudart apis
divyegala c27612e
merge
divyegala a4b48b1
attempt to link cudart
divyegala d5d692e
revert cudart link, try all arch build of jit lto fatbin sources
divyegala 1c6dd94
cmake format
divyegala 30f5ab6
missing shared mem setting
divyegala 9674969
separate cuda 12 and 13 compilation
divyegala 24fc47d
merge upstream
divyegala db9a487
remove bench
divyegala aa9294f
c include directory
divyegala 2eb77fe
style check
divyegala 6c685fa
merge upstream
divyegala 3e35b99
guard cuda calls and use shared_ptr
divyegala d0ff62c
add AlgorithmPlanner to main target
divyegala eb87577
merge upstream
divyegala 445a6c4
remove nvjitlink as cuda 12 dep
divyegala 92a27d4
address review
divyegala 8549172
merge upstream
divyegala 67579f4
add include guard
divyegala 7ad8774
add and remove couple of comments
divyegala 816a480
merge upstream
divyegala ab35ef3
delete readme
divyegala cdd4c85
increase warmup time
divyegala 87334b2
merge upstream
divyegala c1eff9f
use new copyright
divyegala ece09b8
new copyright
divyegala 4dacc6e
remove one more straggling comment
divyegala 1fd95cd
use raft expects
divyegala 64cde0d
Merge branch 'main' into jit-lto-ivf-flat-interleaved
divyegala 5ac127b
merge upstream
divyegala 78002c6
address review
divyegala 9ad6a0b
pre-commit
divyegala bf4c4ad
address review
divyegala 18b2af9
Generate kernel files in CMake instead of Python
KyleFromNVIDIA ece5cad
Merge remote-tracking branch 'refs/remotes/github/divyegala/jit-lto-i…
KyleFromNVIDIA 8ce70c2
Style
KyleFromNVIDIA fdc4239
Style
KyleFromNVIDIA be3cf0d
Style
KyleFromNVIDIA 7e644c3
Lint
KyleFromNVIDIA 235938a
Style, lint
KyleFromNVIDIA e3b749d
Fix nvjitlink_checker
KyleFromNVIDIA f42ae3f
Style
KyleFromNVIDIA b606df9
Merge branch 'main' into jit-lto-ivf-flat-interleaved
KyleFromNVIDIA 5ce7aab
Refactor JIT LTO kernel compilation
KyleFromNVIDIA eaad347
Style
KyleFromNVIDIA eb3b468
pic
KyleFromNVIDIA 912279c
style
KyleFromNVIDIA 19f1af3
Verbose build
KyleFromNVIDIA 087b943
static
KyleFromNVIDIA c16e109
style
KyleFromNVIDIA 323b79f
TARGET_OBJECTS
KyleFromNVIDIA 9f13e73
Disable sccache
KyleFromNVIDIA eaf9d39
Recache
KyleFromNVIDIA ce40c51
Revert CI debugging
KyleFromNVIDIA 0d0abb9
Install and link object library
KyleFromNVIDIA 84bfa92
Style
KyleFromNVIDIA 21241eb
Alias
KyleFromNVIDIA 7c0ac13
Make cuvs_jit_lto_kernels a static library
KyleFromNVIDIA 880dbf2
Style
KyleFromNVIDIA d04d7c1
rapids_cuda_init_architectures() for C tests
KyleFromNVIDIA 19581f9
Be more specific about where we search for libclang
KyleFromNVIDIA a61f019
More libclang updates
KyleFromNVIDIA 2eeb913
Revert "Fix libclang download for Rust, CUDA initialization for C tests"
KyleFromNVIDIA 55ec26c
Merge branch 'main' into jit-lto-ivf-flat-interleaved
KyleFromNVIDIA 10228c5
Merge branch 'main' into jit-lto-ivf-flat-interleaved
KyleFromNVIDIA 031ce21
Merge branch 'main' into jit-lto-ivf-flat-interleaved
KyleFromNVIDIA 088c21e
Copyright
KyleFromNVIDIA 8ca1062
Apply suggestions from code review
divyegala d5ab5bf
merge upstream
divyegala b8c0d42
address some review comments
divyegala 17d34ae
remove too many underscores
divyegala 282b376
Simplify
KyleFromNVIDIA 609a4d6
Merge branch 'main' into jit-lto-ivf-flat-interleaved
KyleFromNVIDIA 3115d07
address reviews
divyegala bb524ae
Merge remote-tracking branch 'origin/main' into jit-lto-ivf-flat-inte…
divyegala 30a8a9f
Merge branch 'jit-lto-ivf-flat-interleaved' of github.com:divyegala/c…
divyegala 72ddb36
Merge branch 'main' into jit-lto-ivf-flat-interleaved
divyegala 4bd2102
add to docs and log about jit
divyegala fb722f0
Merge branch 'jit-lto-ivf-flat-interleaved' of github.com:divyegala/c…
divyegala 3523b96
Merge remote-tracking branch 'origin/main' into jit-lto-ivf-flat-inte…
divyegala ba758a2
address review
divyegala 42b78ae
rename inner_product to inner_prod
divyegala 2e3a471
Merge remote-tracking branch 'origin/main' into jit-lto-ivf-flat-inte…
divyegala f6377fa
include header and form better log
divyegala 533b770
address review and move
divyegala af23585
Merge remote-tracking branch 'origin/main' into jit-lto-ivf-flat-inte…
divyegala 78c59d9
one more fix
divyegala 7f8802b
correct path
divyegala 27acbb6
merge upstream
divyegala File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.