Skip to content

Conversation

@rajarshisarkar
Copy link
Contributor

I noticed that use-starting-sequence-number option is missing while developing #4377 and #4579. Adding support for the same in this PR.

When use-starting-sequence-number is enabled then compaction should use the sequence number of the snapshot at compaction start time for new data files, instead of using the sequence number of the newly produced snapshot. This avoids commit conflicts with updates that add newer equality deletes at a higher sequence number.


cc: @RussellSpitzer @jackye1995 @szehon-ho

@github-actions github-actions bot added the spark label May 5, 2022
@rajarshisarkar rajarshisarkar force-pushed the rewrite-data-files-with-starting-sequence-number branch from 92d29b8 to 9e4a797 Compare May 5, 2022 12:47
@RussellSpitzer RussellSpitzer merged commit bf582eb into apache:master May 5, 2022
@RussellSpitzer
Copy link
Member

Thanks @rajarshisarkar !

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants