-
Notifications
You must be signed in to change notification settings - Fork 2.9k
ORC: Use ConstantReader for identity partition columns #1191
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
| } | ||
|
|
||
| @Override | ||
| public C read(ColumnVector ignored, int ignoredRow) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is ColumnVector still materialized? Is it possible to avoid reading that entirely?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, I guess we can do that by not asking ORC to project these columns. Let me give it a try.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
That's going to be a bigger time savings and that's what we do for Parquet. We just drop it from the projection we pass down to the format.
|
+1, just a minor issue. |
|
I think this code will be changed a bit in #1021 to handle not just constant columns but also metadata columns, since we would want to avoid materializing a |
Fixes #1056
cc: @rdsr Since you had worked on this initially