-
Notifications
You must be signed in to change notification settings - Fork 2.5k
[HUDI-3520] Introduce DeleteSupportSchemaPostProcessor to support adding _hoodie_is_deleted column to schema #4921
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
2a7b4e0 to
89eb9f6
Compare
|
@alexeykudinkin : Can you review this please. |
89eb9f6 to
71dd885
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
thanks for the contribution.
left some comments.
...c/main/java/org/apache/hudi/utilities/schema/processor/DeleteSupportSchemaPostProcessor.java
Outdated
Show resolved
Hide resolved
...-utilities/src/main/java/org/apache/hudi/utilities/schema/processor/SchemaPostProcessor.java
Outdated
Show resolved
Hide resolved
50e7fc8 to
4a265c6
Compare
|
@nsivabalan please take another look when free |
…ing _hoodie_is_deleted column to schema
|
This adds a useful missing piece. Thank you for contributing this @wangxianghu |
nsivabalan
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. I see we are adding few schema post processors (delete in this patch, and then json in another patch).
So, can you look into adding support for multiple schema post processors in general to deltastreamer (in a follow up patch).
|
@nsivabalan |
…ing _hoodie_is_deleted column to schema (apache#4921)
…ing _hoodie_is_deleted column to schema (apache#4921)
Tips
What is the purpose of the pull request
In our use case, data is read from kafka(binlog with some preprocess), schema read from hive, we need
add
_hoodie_is_deletedcolumn to schema to support delete operationBrief change log
(for example:)
Verify this pull request
This pull request is already covered by existing tests, such as:
org.apache.hudi.utilities.schema.TestSchemaPostProcessor#testDeleteSupport
Committer checklist
Has a corresponding JIRA in PR title & commit
Commit message is descriptive of the change
CI is green
Necessary doc changes done or have another open PR
For large changes, please consider breaking it into sub-tasks under an umbrella JIRA.