Skip to content

Conversation

@ajantha-bhat
Copy link
Member

Some filters fails with exception mentioned in #3756.
Hence use the spark resolved expressions in rewrite_data_files procedure

Fixes #3756

@ajantha-bhat
Copy link
Member Author

cc: @RussellSpitzer , @jackye1995

@jackye1995 jackye1995 added this to the Iceberg 0.13.0 Release milestone Dec 16, 2021
@jackye1995
Copy link
Contributor

Thanks. I think this is short enough to fix, could you directly do it across all versions affected?

@ajantha-bhat
Copy link
Member Author

Thanks. I think this is short enough to fix, could you directly do it across all versions affected?

@jackye1995 : I will do that once PR is approved. Else I might have to rework on multiple files if I get comments.

@ajantha-bhat
Copy link
Member Author

@jackye1995 , @RussellSpitzer @huaxingao : I have addressed the comments. Please have a look at it again

@ajantha-bhat ajantha-bhat force-pushed the expression branch 2 times, most recently from c8e2656 to b19f0ba Compare December 20, 2021 15:29
@ajantha-bhat
Copy link
Member Author

@RussellSpitzer , @jackye1995 , @huaxingao : PR is ready. Please take a look at it again.

Copy link
Contributor

@jackye1995 jackye1995 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

looks good to me, thanks for working on this! @ajantha-bhat please let me know if you would like to have also add fixes in the other version in this PR, or have separated PRs, up to you.

private void insertData(int filesCount) {
ThreeColumnRecord record1 = new ThreeColumnRecord(1, "foo", "detail1");
ThreeColumnRecord record2 = new ThreeColumnRecord(2, "bar", "detail2");
ThreeColumnRecord record1 = new ThreeColumnRecord(1, "foo", null);
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I missed why was this changed?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

while adding a new test cases for all possible filters, I wanted some null data so that my NOT NULL filter will not execute compaction (doesn't select data). C3 was never used in the testcases. So reused it with null data.

Copy link
Member

@RussellSpitzer RussellSpitzer left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

One question about the null's in the test insert, but i'm good for merge as well.

@ajantha-bhat
Copy link
Member Author

looks good to me, thanks for working on this! @ajantha-bhat please let me know if you would like to have also add fixes in the other version in this PR, or have separated PRs, up to you.

I think separated is better as we are following for spark PRs.

@jackye1995 jackye1995 merged commit 952b987 into apache:master Dec 22, 2021
@jackye1995
Copy link
Contributor

merged, thanks @ajantha-bhat for the fix, and thanks @RussellSpitzer and @rdblue for reviews!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

In-filter throws exception for rewrite_data_files procedure.

5 participants