Skip to content

Conversation

@shardulm94
Copy link
Contributor

@shardulm94 shardulm94 commented Jul 10, 2020

Fixes #1056

cc: @rdsr Since you had worked on this initially

}

@Override
public C read(ColumnVector ignored, int ignoredRow) {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is ColumnVector still materialized? Is it possible to avoid reading that entirely?

Copy link
Contributor Author

@shardulm94 shardulm94 Jul 10, 2020

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, I guess we can do that by not asking ORC to project these columns. Let me give it a try.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

That's going to be a bigger time savings and that's what we do for Parquet. We just drop it from the projection we pass down to the format.

@rdblue
Copy link
Contributor

rdblue commented Jul 10, 2020

+1, just a minor issue.

@shardulm94
Copy link
Contributor Author

shardulm94 commented Jul 10, 2020

I think this code will be changed a bit in #1021 to handle not just constant columns but also metadata columns, since we would want to avoid materializing a ColumnVector for metadata columns too.

@rdblue rdblue merged commit a2796f0 into apache:master Jul 11, 2020
@shardulm94 shardulm94 deleted the 1056 branch July 15, 2020 03:51
aokolnychyi pushed a commit to aokolnychyi/iceberg that referenced this pull request Aug 18, 2020
cmathiesen pushed a commit to ExpediaGroup/iceberg that referenced this pull request Aug 19, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Use ConstantReaders for identity partition columns in OrcValueReader

2 participants