-
Notifications
You must be signed in to change notification settings - Fork 1.1k
perf: speed up StringViewArray gc 1.4 ~5.x faster #7873
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from all commits
Commits
Show all changes
37 commits
Select commit
Hold shift + click to select a range
e0728ed
Perf: implement fast gc for string view
zhuqi-lucas 02f2870
Merge remote-tracking branch 'upstream/main' into fast_gc
zhuqi-lucas 5815519
polish
zhuqi-lucas f5488cc
format
zhuqi-lucas 4c3c7ee
polish code
zhuqi-lucas 1601ab6
Address comments
zhuqi-lucas a809c98
remove unused code
zhuqi-lucas 2bb5b93
Merge remote-tracking branch 'upstream/main' into fast_gc
zhuqi-lucas 5b5a05c
don't use value_unchecked which is duplicating check len, etc
zhuqi-lucas 3cb9431
polish code
zhuqi-lucas de6a199
fix comments
zhuqi-lucas 55fb826
Merge remote-tracking branch 'upstream/main' into fast_gc
zhuqi-lucas d4b7243
benchmark: Add StringViewArray gc benchmark with not null cases
zhuqi-lucas a774986
fmt and comments
zhuqi-lucas 75eb0d0
Merge branch 'add_not_null_gc_benchmark' into fast_gc
zhuqi-lucas fbd47d6
Use unchecked
zhuqi-lucas b214ab7
fmt
zhuqi-lucas adc2b1f
address comments
zhuqi-lucas 5310dd1
Merge branch 'add_not_null_gc_benchmark' into fast_gc
zhuqi-lucas 1a01926
optimize null check
zhuqi-lucas 39cf32c
fix clippy
zhuqi-lucas 4786066
address comments and polish code
zhuqi-lucas 3350638
address comments
zhuqi-lucas 7a08ce0
Merge remote-tracking branch 'upstream/main' into fast_gc
zhuqi-lucas 32f27cb
add rich tests and add fast path for inline data and fix empty buffer
zhuqi-lucas adb8605
fmt
zhuqi-lucas 219aabf
Address comments
zhuqi-lucas c16d236
Don't need null caculation
zhuqi-lucas 6e6387d
Merge remote-tracking branch 'upstream/main' into fast_gc
zhuqi-lucas 135cbeb
fast path for no data buffer
zhuqi-lucas 625c421
Merge remote-tracking branch 'apache/main' into fast_gc
alamb 2097747
Update arrow-array/src/array/byte_view_array.rs
zhuqi-lucas c1a1065
Update arrow-array/src/array/byte_view_array.rs
zhuqi-lucas 797b63e
address comments
zhuqi-lucas 5e30749
fmt
zhuqi-lucas 0110ef9
Merge remote-tracking branch 'upstream/main' into fast_gc
zhuqi-lucas 214f844
Merge remote-tracking branch 'apache/main' into fast_gc
alamb File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There are some good tricks here that maybe we can apply to
coalese-- inarrow-rs/arrow-select/src/coalesce/byte_view.rs
Line 114 in 38a7a1a
The only difference is the ability to extend exsiting views/buffers rather than allocate entirely new buffers 🤔
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Good point @alamb , add the sub-task, i will start the investigation after this PR, thanks!
Improve the performance for coalese with StringView
In epic:
#7802