Skip to content

Conversation

@aokolnychyi
Copy link
Contributor

@aokolnychyi aokolnychyi commented Nov 11, 2022

This PR preserves file seq numbers while rewriting manifests in Spark 3.3.

long sequenceNumber = row.getLong(1);
Row file = row.getStruct(2);
writer.existing(wrapper.wrap(file), snapshotId, sequenceNumber);
Long fileSequenceNumber = row.isNullAt(2) ? null : row.getLong(2);
Copy link
Contributor Author

@aokolnychyi aokolnychyi Nov 11, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This action is long overdue for a refactoring and optimization. It would be more reliable to compute indices but I want to unblock the release asap.

@rdblue rdblue merged commit 186ee7e into apache:master Nov 15, 2022
@rdblue
Copy link
Contributor

rdblue commented Nov 15, 2022

Thanks, @aokolnychyi!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants