-
Notifications
You must be signed in to change notification settings - Fork 3k
Implement performing row-level delta to branches #5595
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
61d90ed to
6880349
Compare
|
Er. It may make more sense to come back to row level delta after implementing branch writing for the other operations. There's more validation logic to modify and meaningful testing of row level delta is difficult without implementing branch writes for the other operations. |
| /** | ||
| * Validates that no delete files matching a filter have been added to the table since a starting | ||
| * snapshot. | ||
| * ToDo: Remove after branch writing implementation complete |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We can remove the existing validate after we implement the different write operations because we will always have an ending snapshot. This is fine because validations are all protected/private.
| AssertHelpers.assertThrows( | ||
| "Should fail to commit when validating from non-ancestor snapshot", | ||
| ValidationException.class, | ||
| "Cannot commit, missing data files", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
need to fix the message
|
Looks like @namrathamyske also has a PR for this https://github.com/apache/iceberg/pull/5234/files/ , the approach is fundamentally the same but I was thinking it makes more sense to iterate on one operation at a time rather than doing all the merging snapshot producers at once. We can have copies of the validate methods which accept an ending snapshot. That way we can validate the code and tests for a single operation at a time which seems easier for review IMO, but open to discussion |
|
Actually the approach in @namrathamyske works because for the other operations the branch write wouldn't be supported. It's just passing through the end snapshot. So in that case, that PR makes sense, we can just continue discussion on there then. |
No description provided.