-
Notifications
You must be signed in to change notification settings - Fork 4k
ARROW-10163: [Rust] [DataFusion] Add DictionaryArray coercion support #8463
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ARROW-10163: [Rust] [DataFusion] Add DictionaryArray coercion support #8463
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
this is just a refactor, it is not meant to change the semantics
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I updated the parameter names here to make this macro more consistent with its documentation
rust/datafusion/tests/sql.rs
Outdated
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This test demonstrates DictionaryArrays being used in DataFusion
8630722 to
434bd01
Compare
jorgecarleitao
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I went through this and LGTM. Thanks a lot, @alamb . Really sweet and easy to follow.
The comments on the tests helped a lot!
nevi-me
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, there's a comment from Jorge @alamb, let us know if you'd like to address it, otherwise we can merge this
434bd01 to
cd319b0
Compare
Thanks @nevi-me -- I responded to @jorgecarleitao and I have rebased this PR against master. I think it is good to go, but I will wait for @jorgecarleitao 's response |
This PR adds: 1. Basic `DictionaryArray` coercion (not cast) support in DataFusion 2. A test in `sql.rs` demonstrating basic operations using DataFusion on `DictionaryArray` arrays Note that the performance operating on `DictionaryArrays` is likely to leave a lot to be desired -- specifically almost any operation will cause the `DictionaryArray` to get unpacked to a normal array reducing most/all of any performance gains. I plan to add additional performance improvements over time -- but I felt getting queries to run was the first important step. Closes #8463 from alamb/alamb/ARROW-10159-dictionary-array-coercion-take-3 Authored-by: alamb <[email protected]> Signed-off-by: Neville Dipale <[email protected]>
This PR adds:
DictionaryArraycoercion (not cast) support in DataFusionsql.rsdemonstrating basic operations using DataFusion onDictionaryArrayarraysNote that the performance operating on
DictionaryArraysis likely to leave a lot to be desired -- specifically almost any operation will cause theDictionaryArrayto get unpacked to a normal array reducing most/all of any performance gains.I plan to add additional performance improvements over time -- but I felt getting queries to run was the first important step.