-
Notifications
You must be signed in to change notification settings - Fork 2.5k
[HUDI-1518] Remove the logic that delete replaced file when archive #3310
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[HUDI-1518] Remove the logic that delete replaced file when archive #3310
Conversation
Codecov Report
@@ Coverage Diff @@
## master #3310 +/- ##
============================================
- Coverage 47.83% 47.78% -0.05%
- Complexity 5565 5598 +33
============================================
Files 936 938 +2
Lines 41663 41815 +152
Branches 4197 4211 +14
============================================
+ Hits 19929 19983 +54
- Misses 19960 20045 +85
- Partials 1774 1787 +13
Flags with carried forward coverage won't be shown. Click here to find out more.
Continue to review full report at Codecov.
|
|
@satishkotha do you mind taking a second look at this? Seems ok to do. |
satishkotha
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. Couple minor comments to see if we can remove more dead code.
|
|
||
| TableFileSystemView fileSystemView = this.table.getFileSystemView(); | ||
| List<String> replacedPartitions = getReplacedPartitions(instant); | ||
| return ReplaceArchivalHelper.deleteReplacedFileGroups(context, metaClient, fileSystemView, instant, replacedPartitions); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Do we need ReplaceArchivalHelper class?
|
|
||
| // when using insert_overwrite or insert_overwrite_table | ||
| // first commit may without replaceFileIds | ||
| createReplaceMetadataWithoutReplaceFileId("000"); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Could we also remove these helper methods?
|
@hudi-bot run azure LGTM. i'll merge after azure builds pass |
|
Hi @satishkotha Thanks a lot for your review and merge! |
…pache#3310) * remove delete replaced file when archive * done * remove unsed import * remove delete replaced files when archive realted UT * code reviewed Co-authored-by: yuezhang <[email protected]>
…pache#3310) * remove delete replaced file when archive * done * remove unsed import * remove delete replaced files when archive realted UT * code reviewed Co-authored-by: yuezhang <[email protected]>
…rchive (apache#3310)" This reverts commit 2c2cab1e32e42cd59b5edee9b37dfd1c5946a9de.
…pache#3310) * remove delete replaced file when archive * done * remove unsed import * remove delete replaced files when archive realted UT * code reviewed Co-authored-by: yuezhang <[email protected]>
…rchive (apache#3310)" This reverts commit 2c2cab1e32e42cd59b5edee9b37dfd1c5946a9de.
What is the purpose of the pull request
Since #2422 Cleaner now takes care of removing replaced file groups. This call to deleteReplacedFileGroups can be removed.
Also it's a little confused that archive can delete data file even users set auto.clean false.
Brief change log
(for example:)
Verify this pull request
(Please pick either of the following options)
This pull request is a trivial rework / code cleanup without any test coverage.
(or)
This pull request is already covered by existing tests, such as (please describe tests).
(or)
This change added tests and can be verified as follows:
(example:)
Committer checklist
Has a corresponding JIRA in PR title & commit
Commit message is descriptive of the change
CI is green
Necessary doc changes done or have another open PR
For large changes, please consider breaking it into sub-tasks under an umbrella JIRA.