[HUDI-52] Enabling savepoint and restore for MOR table#4507
[HUDI-52] Enabling savepoint and restore for MOR table#4507codope merged 2 commits intoapache:masterfrom
Conversation
123962d to
2968b17
Compare
codope
left a comment
There was a problem hiding this comment.
Change looks good to me. Can you please check the CI failure? The same test failed even after retrying.
2968b17 to
cb52a5a
Compare
* Enabling restore for MOR table * Fixing savepoint for compaction commits in MOR
* Enabling restore for MOR table * Fixing savepoint for compaction commits in MOR
* Enabling restore for MOR table * Fixing savepoint for compaction commits in MOR
There was a problem hiding this comment.
I was kind of (pleasantly) surprised how simple this was.
How about these scenarios?
- If we perform a savepoint from the CLI, while there are inflight writes or pending table services?
- In line 92, we only work with the base file view? how would this correctly restore the log data?
|
Key thing to verify here is that - compaction or cleaning does not affect the file slice that is part of the save point |
|
If we perform a savepoint from the CLI, while there are inflight writes or pending table services? In line 92, we only work with the base file view? how would this correctly restore the log data? |
|
Key thing to verify here is that - compaction or cleaning does not affect the file slice that is part of the save point |
* Enabling restore for MOR table * Fixing savepoint for compaction commits in MOR
|
Tested savepoint/restore for all cases that I can think of. documented the cases here https://issues.apache.org/jira/browse/HUDI-3705 |
What is the purpose of the pull request
Adding/Enabling restore for MOR table
Brief change log
Verify this pull request
This change added tests and can be verified as follows:
TestHoodieSparkMergeOnReadTableRollback.testMORTableRestore
Verified the below cases manually (without metadata enabled) via hudi-cli
a. insert, upsert, upsert, savepoint, upsert, rollback. validate records.
b. insert, upsert, upsert, savepoint, upsert, upsert, rollback. validate records.
c. insert, upsert, upsert, savepoint, upsert, insert, rollback. validate records.
d. insert, upsert, upsert, savepoint, upsert, upsert, compact, rollback. validate records.
e. insert, upsert, upsert, savepoint, upsert, upsert, compact, upsert, rollback. validate records. upsert, validate records.
Committer checklist
Has a corresponding JIRA in PR title & commit
Commit message is descriptive of the change
CI is green
Necessary doc changes done or have another open PR
For large changes, please consider breaking it into sub-tasks under an umbrella JIRA.