[HUDI-2815] add partial overwrite payload to support partial overwrit… #4724

stayrascal · 2022-01-31T12:06:03Z

…e case

Tips

Thank you very much for contributing to Apache Hudi.
Please review https://hudi.apache.org/contribute/how-to-contribute before opening a pull request.

What is the purpose of the pull request

Add a customized payload PartialOverwriteWithLatestAvroPayload to support partial overwrite records fields.
For example, two incoming record batch A(1, null, 1, 1) and B(2, 2, null, 2) will be combined as the result (2, 2, 1, 2)
(For example: This pull request adds quick-start document.)

Brief change log

Add a PartialOverwriteWithLatestAvroPayload to support partial overwrite records
Add a config partial.overwrite.enabled to control if enable partial overwrite
Change the ValueState of BucketAssignFunction from HoodieRecordGlobalLocation to HoodieRecord
Support partition path changed case
- Merge HoodieRecord before storing the record in ValueSate
- Copy and sink a new record from state to downstream operator while the record partition is changed
Add compareTo to compare two HoodieRecords to chose their sequence, because cannot get the sequence from #preCombine method if merge two records in #preCombine method
Add schema filed(String) in new AvroPayload, which will be used to #preCombine two records
Change BootstrapOperator to load HoodieRecord with data instead of only HoddieKey
Change RowDataToHoodieFunction to create payload with schema

Verify this pull request

Added PartialOverwriteWithLatestAvroPayloadTest to verify the change.
Manually verified the change by running a job locally with both preCombine and combineAndGetUpdateValue methods

Committer checklist

Has a corresponding JIRA in PR title & commit
Commit message is descriptive of the change
CI is green
Necessary doc changes done or have another open PR
For large changes, please consider breaking it into sub-tasks under an umbrella JIRA.

…e case

# Conflicts: # hudi-flink/src/main/java/org/apache/hudi/sink/partitioner/BucketAssignFunction.java

stayrascal · 2022-02-07T11:05:17Z

@hudi-bot run azure

stayrascal · 2022-02-07T11:31:52Z

@hudi-bot run azure

danny0405

Thanks for the contribution ~ I kind of think we better do this after id-based schema evolution is supported, only after that, we have more light-wright solution to support per-record schema.

Generally, take schema with record seems not a good solution.

danny0405 · 2022-02-08T02:59:12Z

...nt/hudi-flink-client/src/main/java/org/apache/hudi/table/action/commit/FlinkWriteHelper.java

-      boolean choosePrev = data1.equals(reducedData);
+      boolean choosePrev = data2.compareTo(data1) < 0;
      HoodieKey reducedKey = choosePrev ? rec1.getKey() : rec2.getKey();
      HoodieOperation operation = choosePrev ? rec1.getOperation() : rec2.getOperation();


Why we must need a compareTo here ?

The previous logic of data2.preCombine(data1) is that return one of data1 or data2 ordering by their orderVal. But if we merge/combine data1 and data2 into a new payload(reduceData), the data1.equals(reduceData) is always false. In order to get the HoodieKey and HoodieOperation for new HoodieRecord with reduceData, we need to get the latest HoodieKey and HoodieOperation from data1 and data2， compareTo is used for replace #preCombine to compare their orderingVal.

@Override public int compareTo(OverwriteWithLatestAvroPayload oldValue) { return orderingVal.compareTo(oldValue.orderingVal); }

@Test public void testCompareFunction() { GenericRecord record = new GenericData.Record(schema); record.put("id", "1"); record.put("partition", "partition1"); record.put("ts", 0L); record.put("_hoodie_is_deleted", false); record.put("city", "NY0"); record.put("child", Arrays.asList("A")); PartialOverwriteWithLatestAvroPayload payload1 = new PartialOverwriteWithLatestAvroPayload(record, 1); PartialOverwriteWithLatestAvroPayload payload2 = new PartialOverwriteWithLatestAvroPayload(record, 2); assertEquals(payload1.compareTo(payload2), -1); assertEquals(payload2.compareTo(payload1), 1); assertEquals(payload1.compareTo(payload1), 0); }

Actually, rec1 and rec2 should have same HoodieKey here, right, but the HodieOperation might different.

danny0405 · 2022-02-08T03:08:22Z

...common/src/main/java/org/apache/hudi/common/model/PartialOverwriteWithLatestAvroPayload.java

+          return new PartialOverwriteWithLatestAvroPayload(currentRecord, chooseCurrent ? this.orderingVal : oldValue.orderingVal, this.schema);
+        } else {
+          return isDeleteRecord(insertRecord) ? this : oldValue;
+        }


We should be caution of the DELETEs, should we still merge for DELETE messages ?

yeah, if one of record is DELETE record, just return themselves directly, no need to merge, the DELETE message to delete old record during hudi write. Only when two records are not DELETE records, we need to merge them.

danny0405 · 2022-02-08T03:14:17Z

hudi-flink/src/main/java/org/apache/hudi/sink/partitioner/BucketAssignFunction.java

  @Override
  public void initializeState(FunctionInitializationContext context) {
-    ValueStateDescriptor<HoodieRecordGlobalLocation> indexStateDesc =
+    ValueStateDescriptor<HoodieRecord> indexStateDesc =


This would increase the state size significantly. We better avoid this with better solution, and why we must sore full record instead of index here ?

Thanks @danny0405 for reviewing this.

Yeah, as mentioned in this issue #4030.
The reason why we need full record here is want to handle the case that the record partition path is changed.

In the original logic, once the record partition path changed, will sink a delete record to old partition file to delete old record, and then sink the incoming record with new partition to new partition file, the final record will only contains the info from incoming record and miss the info from old record. (note the OverwriteNonDefaultsWithLatestAvroPayload also have this issue)

So we need to retrieve the old/existing record from base file, and then merge/combine with incoming record, as currently we don't support lookup record from base file, so we have to store the old/existing in somewhere. e.g. fink state. BucketAssignFunction is the only place we can store the old/existing record and change its location from old partition file to new partition file.

So the new logic is that:

store the old record(from source or base file by enable Bootstrap) in Flink state

once a new record coming with same record key but the partition changed

sink a delete record to old partition file to delete file

retrieve & copy old record from state and change its location with new partition, and sink to new partition file

the copied record and incoming record will be merged by #preCombine

the drawback here is that it will increase the state size, but if we don't use the state to store full record, it seems that we don't have approach to merge incoming record with existing record in base file while partition change.

I also consider this problem, what I'm thinking to avoid impact the current logic(overwrite with latest payload) is that create a updateState abstract method and treat indexState as a abstract filed, different sub class will implement the logic with ValueState<HoodieRecordGlobalLocation> or ValueState<HoodieRecord>, or a StateHelper to handle state operation.

stayrascal · 2022-02-08T15:20:41Z

Thanks for the contribution ~ I kind of think we better do this after id-based schema evolution is supported, only after that, we have more light-wright solution to support per-record schema.

Generally, take schema with record seems not a good solution.

Yeah, agree with taking schema with record is not a good solution, it couple the byte data with schema and hard to evolute schema later, but it seems that we cannot merge two Payload without schema. I also tired if we can pass the schema to #preCombine method, just similar with #combineAndGetUpdateValue. But we only can get the schema info from flink configuration, the FlinkWriteHelper doesn't(and should not) support flink configuration to support retrieve schema from configuration, so I have to inject schema via constructor.

nsivabalan · 2022-02-15T21:49:15Z

CC @xushiyan @alexeykudinkin : who might be working on refactoring payload interfaces. and @xiarixiaoyao who might be working on schema evolution story. can you folks take a look at the patch.

stayrascal · 2022-02-16T05:11:07Z

CC @xushiyan @alexeykudinkin : who might be working on refactoring payload interfaces. and @xiarixiaoyao who might be working on schema evolution story. can you folks take a look at the patch.

Thanks @nsivabalan to follow up. Take a summary, the current implementation has two drawbacks:

embedding schema in payload might hard to do schema evolution later.
using flink state to store full HoodieRecord instead of location, will increase the state size.

And there are two challenges need to be solved for supporting partial update.

how combine/merge two payload in #preCombine if without schema.
how to lookup the record from base/log file during the record partition patch changed, and then combine/merge with the incoming record later.

Any thoughts on these two challenges?

nsivabalan

Left some high level comments. Will let Danny follow up on reviews since this fixes flink code base.

nsivabalan · 2022-02-20T21:04:53Z

hudi-common/src/main/java/org/apache/hudi/common/model/OverwriteWithLatestAvroPayload.java

+  /**
+   * the schema of generic record
+   */
+  public final String schema;


this might be confusing w/ schema arg with combineAndGetUpdateValue. can you fix either of the names.

but in general, storing schema along w/ payload might have an impact on the performance. and thats why initial payload was designed that way. So, do add a line here for payload implementations setting this schema field might have to watch out for performance.

removed this field.

nsivabalan · 2022-02-20T21:07:27Z

...common/src/main/java/org/apache/hudi/common/model/PartialOverwriteWithLatestAvroPayload.java

+    }
+
+    GenericRecord currentRecord = (GenericRecord) currentValue;
+    List<Schema.Field> fields = schema.getFields();


guess, this has to be "this.schema.getFields". as I commented earlier, its confusing :) . can we fix the naming of either of them.

fixed the schema name.

nsivabalan · 2022-02-20T21:07:49Z

...common/src/main/java/org/apache/hudi/common/model/PartialOverwriteWithLatestAvroPayload.java

+    GenericRecord currentRecord = (GenericRecord) currentValue;
+    List<Schema.Field> fields = schema.getFields();
+    fields.forEach(field -> {
+      Object value = incomingRecord.get(field.name());


do we need to deal w/ nested fields here ?

current logic will overwrite whole nested field, if the incoming field is not null.

And I think we don't need to support the partial update inner nested field, for example, for Map, List, etc. we should not merge map(1 -> 'a', 2 -> 'b') & map(1 -> ''', 3 -> 'c') to map(1 -> '', 2 -> 'b', 3 -> 'c') incase the upstream want to delete the key '2', if we merge them together, they cannot delete some elements. the List as well.

nsivabalan · 2022-02-20T21:12:02Z

...common/src/main/java/org/apache/hudi/common/model/PartialOverwriteWithLatestAvroPayload.java

+  }
+
+  @Override
+  public OverwriteWithLatestAvroPayload preCombine(OverwriteWithLatestAvroPayload oldValue) {


instead of storing the schema with payload, did you think about adding a new preCombine method as follows

OverwriteWithLatestAvroPayload preCombine(OverwriteWithLatestAvroPayload oldValue, Schema schema);

this would make it a lot simpler right. Since preCombine is used only to dedup records within a single batch, both records should have same schema.

Hi @nsivabalan , thanks a lot for review this.

Regarding adding new preCombine method with Schema, I considered this, but it means that the method caller who need to get the schema info at first, and currently, it seems that we only can get the schema info from Configuration(from hoodie.avro.schema field). Sometimes, the caller might hard to get the schema info, Especially for FlinkWriteHeler.deduplicateRecords(List<HoodieRecord<T>> records, HoodieIndex<?, ?> index, int parallelism).

But compare the performance, it seems that passing the schema in method might be a better approach.
BTW, since we already have had the method preCombine(T oldValue, Properties properties), how about put the schema string in properties, and then parse the schema string to Schema later, so that we don't need to create a new method any more. otherwise, I cannot image when will we will Properties.

nsivabalan · 2022-02-20T21:12:35Z

...common/src/main/java/org/apache/hudi/common/model/PartialOverwriteWithLatestAvroPayload.java

+    try {
+      Schema schema = new Schema.Parser().parse(this.schema);
+      Option<IndexedRecord> incomingOption = getInsertValue(new Schema.Parser().parse(this.schema));
+      Option<IndexedRecord> insertRecordOption = oldValue.getInsertValue(new Schema.Parser().parse(oldValue.schema));


insertRecordOption -> oldRecordOption

nsivabalan · 2022-02-20T21:14:40Z

...common/src/main/java/org/apache/hudi/common/model/PartialOverwriteWithLatestAvroPayload.java

+    }
+
+    try {
+      Schema schema = new Schema.Parser().parse(this.schema);


argh. this again clashes w/ instance variable "schema". Can we fix the naming.

# Conflicts: # hudi-flink/src/main/java/org/apache/hudi/sink/bootstrap/BootstrapOperator.java

…payload. 2. revert change for storing full HoodieRecord in the flink state to support the case the partition path changed

stayrascal · 2022-02-21T15:50:09Z

Hi @danny0405 , regarding the two changes:

embedding schema in payload might hard to do schema evolution later.
using flink state to store full HoodieRecord instead of location, will increase the state size.

for first one, I remove the schema filed from payload class, instead, by passing the schema to preCombine method, it should be easy to do schema evolution if need.
for second one, I revert my changes about store whole HoodieRecord back to store location, it still has the problem that once the record partition path changed, but i think it should enough for supporting the case that the table don't have partition, or the partition path of record won't changed.

BTW, do you think is it worthy to implement a new bucket assign function(can be controlled by feature toggle/configuration ) which store the full record to support partial update totally ?

alexeykudinkin · 2022-02-23T21:58:39Z

hudi-common/src/main/java/org/apache/hudi/common/model/HoodieRecordPayload.java

+   * @return the combined value
+   */
+  @PublicAPIMethod(maturity = ApiMaturityLevel.STABLE)
+  default T preCombine(T oldValue, Properties properties, Schema schema) {


I think currently established semantic for preCombine -- you select either A or B, but you don't produce new record based on those 2, since it's mostly used to de-dupe records in the incoming batch. I can hardly imagine the case to fuse 2 incoming records into something third. Can you help me understand what use-case you have in mind here?

Thanks @alexeykudinkin for reviewing this.

What we are trying to do is implement partial update purpose. For example, let's assume the record schema is (f0 int , f1 int, f2 int), The first record value is: (1, 2, 3), the second record value is: (4, 5, null) with the field f2 value as null. We hope that the result after run preCombine is (4, 5, 3), which means we need to combine/merge two records to a third one, not only choose one of them.

Actually, what we want to implement is similar with #combineAndGetUpdateValue(IndexedRecord currentValue, Schema schema) which used for combine the incoming record with existing record from base/log file.
But #preCombine will be used for combing/merging two incoming records in a batch.

Right, that's exactly my question: why do you want to implement such semantic w/in preCombine? What use-case you're trying to accommodate for here?

Essentially with this change you will introduce a way for 2 records w/in the batch to be combined into 1. But why do you need this?

After all you can achieve the same goal if you just stop de-duping your records, and then subsequently merge them against what is on disk

Hi @alexeykudinkin, I got your point. if we have to combine two records to a combined one, we'd better to implement the combine logics in other place, maybe in some util or helper classes, or skip the de-duping logic, right?

Here are some options from mine that #preCombine might be a better place to implement these logics, or create new merge method in HoodieRecordPayload interface.

First, from the description of preCombine method, it used for combining multiple records with same HoodieKey before attempting to insert/upsert to disk. The "combine multiple records" might not mean only choosing one of them, we also can combine & merged them to a new one, just depends on how the sub-class implement the preCombine logic(Please correct me if my understanding is wrong :) ). Yeah, it might be a little bit confused that we need Schema if we are trying to merged them.

Second, I checked when will we call preCombine method is trying to duplicate records with same HoodieKey before insert/update to disk, especially in Flink write case, even through the duplicated logic is choose the latest record, but we need to ensure that one HoodieKey should only contains one record before comparing to existing record and write to disk, otherwise, some records will missed. For example, in HoodieMergeHandle.init(fieId, newRecordsIter), it will convert the record iterator to a map and treat the recordKey as key. So we might not stop de-duping logics and merge them against what is on disk unless we change the logic here. And also we implement another class/method to handle the merge logic, and switch the existing de-duping logic from calling preCombine to new class/method, we have to add an condition to control whether should we call preCombine or not, I think it might not a good way. Instead, we should handle it in preCombine method by different implemented payload.

That's what my thought here, and I'm glad to listen your useful suggestions. :)

Let me try to clarify a few things:

preCombine has a very specific semantic: it's de-duplicating by the way of picking "most recent" among records in the batch. Expectation always is that it being handed 2 records it will have to return either of them. It could not produce new record. If we want to revisit this semantic this is a far larger change that will surely require writing an RFC and broader discussion regarding the merits of such migration. Please also keep in mind that as of RFC-46 there's an effort underway to abstract whole "record combination/merging" semantic out of RecordPayload hierarchy into standalone Combination/Merge Engine API.

First, from the description of preCombine method, it used for combining multiple records with same HoodieKey before attempting to insert/upsert to disk. The "combine multiple records" might not mean only choosing one of them, we also can combine & merged them to a new one, just depends on how the sub-class implement the preCombine logic(Please correct me if my understanding is wrong :) ). Yeah, it might be a little bit confused that we need Schema if we are trying to merged them.

Please see my comment regarding preCombine semantic above. I certainly agree with you that the name is confusing, but i've tried to clear that confusion. Let me know if you have more questions about it.

Second, I checked when will we call preCombine method is trying to duplicate records with same HoodieKey before insert/update to disk, especially in Flink write case, even through the duplicated logic is choose the latest record, but we need to ensure that one HoodieKey should only contains one record before comparing to existing record and write to disk, otherwise, some records will missed. For example, in HoodieMergeHandle.init(fieId, newRecordsIter), it will convert the record iterator to a map and treat the recordKey as key. So we might not stop de-duping logics and merge them against what is on disk unless we change the logic here. And also we implement another class/method to handle the merge logic, and switch the existing de-duping logic from calling preCombine to new class/method, we have to add an condition to control whether should we call preCombine or not, I think it might not a good way. Instead, we should handle it in preCombine method by different implemented payload.

You're bringing up a good points, let's dive into them one by one: so currently we have 2 mechanisms

preCombine that allows to select "most recent" record among those having the same key w/in the batch

combineAndGetUpdateValue that allows to combine previous or "historical" record (on Disk) with the new incoming one (all partial merging semantic is currently implemented in this method)

You rightfully mention some of the invariants are currently that the batch would be de-duped at certain level (b/c we have to maintain PK uniqueness on disk), and so we might need to shift that to accommodate for case that you have. And that's exactly what my question was: if you can elaborate on use-case that you have at hand that you're trying to solve w/ this PR, i would be able to better understand where you're coming from and what's the best path forward for us here.

Questions i'm looking an answers for are basically following:

What's nature of your use-case? (domain, record types, frequency, size, etc)

Where requirements for partial updates are coming from?

and etc. I'm happy to set some 30min to talk in person regarding this or connect on Slack and discuss it there.

Hi @alexeykudinkin , Thanks a lot for you detail clarification.

Regarding the design of preCombine, I'm clear now. I'm sorry I don't know the detail of RFC-46, and also I didn't find the link RFC-46 from here, cloud you please share the link?

and regarding the requirements for partial updates/overwrite, I saw some same requirements from community. In my case, generally, we want to build a customer profile with multiple attributes, these attributes might come from different systems, one system might only provides some attributes in a event/record, and two systems might the events/records with different attributes, we should not only choose the recent one, we need to merged them before writing to disk. Otherwise, we have to keep all change logs, and then start a new job to dedup & merge these attributes among the change logs. For example, we have 10 attributes a1-a10(all of them are optional), source system A only has the a1-a5, source system B only has a6-a10, what result we expect is that the final record contains a1-a10, not only a1-a5 or a6-a10. And because we might receive two events/records in same time, they might be in a same batch, that's why we want to merge them before combineAndGetUpdateValue .

BTW, thanks a lot for you time, will ping you on slack.

CC @rmahindra123 who encountered a necessity to do preCombine but to combine bits and pieces from both records to return a new one. Rajesh: do you wanna go over your use-case may be.

@alexeykudinkin , I'm sorry that I still don't find a suitable time to align with online, may i check any thoughts or suggests on this PR?

alexeykudinkin · 2022-02-23T22:01:56Z

...common/src/main/java/org/apache/hudi/common/model/PartialOverwriteWithLatestAvroPayload.java

+      Option<IndexedRecord> incomingOption = getInsertValue(schema);
+      Option<IndexedRecord> oldRecordOption = oldValue.getInsertValue(schema);
+
+      if (incomingOption.isPresent() && oldRecordOption.isPresent()) {


In general it's better to express common functionality in a way that would allow it to be re-used and adopted in other places: here for ex, we can reuse the same routine of combining 2 records into one, across 2 methods if we properly abstract it

abstracted the merge method, but still in current class.

alexeykudinkin · 2022-02-23T22:09:48Z

hudi-flink/src/main/java/org/apache/hudi/configuration/FlinkOptions.java

      .withDescription("Payload class used. Override this, if you like to roll your own merge logic, when upserting/inserting.\n"
          + "This will render any value set for the option in-effective");

+  public static final ConfigOption<Boolean> PARTIAL_OVERWRITE_ENABLED = ConfigOptions


What's the idea for this additional configuration (beside the record payload class)?

This is the feature toggle to control another change about BucketAssignFunction to support the case the record partition path is changed, but I have removed it. so this feature toggle can be removed as well.

LinMingQiang · 2022-02-24T07:35:39Z

Do I need to modify the preCombine in the HoodieMergedLogRecordScanner.processNextRecord method? What I understand is that when we read a log file, we need to do deduplication and also call preCombine.

LinMingQiang · 2022-02-25T08:13:37Z

...common/src/main/java/org/apache/hudi/common/model/PartialOverwriteWithLatestAvroPayload.java

+      if (incomingOption.isPresent() && oldRecordOption.isPresent()) {
+        GenericRecord incomingRecord = (GenericRecord) incomingOption.get();
+        GenericRecord oldRecord = (GenericRecord) oldRecordOption.get();
+        boolean chooseIncomingRecord = this.orderingVal.compareTo(oldValue.orderingVal) > 0;


This place needs to be changed to >= , because when we do not set the preCombine field, the first data will always be used instead of the latest

Good point here, thx.

stayrascal · 2022-02-25T14:12:41Z

HoodieMergedLogRecordScanner

As I know, if we also want to achieve partial update purpose, we just need to pass the Schema here.(Assume this pull request has been approved).

…ogic

…ocess

nsivabalan · 2022-03-02T20:47:42Z

Just FYI for all interested folks.
Precombine is not just used to dedup two records within same incoming batch, but also to deduce the winner when we merge records in LogRecordReader.

stayrascal · 2022-03-03T02:28:36Z

yeah, @LinMingQiang has mentioned this one above.

From my understanding, if we want to enable "partial update" feature by defining customized payload class, it should running "partial update" in these three cases:

merged the incoming batch records before write to disk
read records from the log file(read from MOR table)
read records from log file and compact into base file

So I also update the HoodieMergedLogRecordScanner.processNextRecord () by passing Schema info, if the use case is not using "partial update" with other payload class, it will use the default preCombine() logic to choose "recent" one.

The current situation is that will we treat preCombine return one of two records, or we can merged them to a new record.

Just FYI for all interested folks. Precombine is not just used to dedup two records within same incoming batch, but also to deduce the winner when we merge records in LogRecordReader.

# Conflicts: # hudi-common/src/main/java/org/apache/hudi/common/table/log/HoodieMergedLogRecordScanner.java

…pache#5090)

…e#5117) * Remove glob pattern basePath from the deltastreamer tests. * [HUDI-3689] Fix file scheme config for CI failure in TestHoodieRealTimeRecordReader Co-authored-by: Raymond Xu <[email protected]>

* Make sure nulls are properly handled in `HoodieColumnRangeMetadata`

…able (apache#5098)

Co-authored-by: Sagar Sumit <[email protected]>

…5128)

…a' is true (apache#5088)

…in Data Skipping flow (apache#4996)

…ECOMBINE_FIELD_TYPE_PROP (apache#5096)

# Conflicts: # .github/workflows/bot.yml # azure-pipelines.yml # hudi-client/hudi-client-common/src/main/java/org/apache/hudi/io/HoodieMergeHandle.java # hudi-client/hudi-client-common/src/main/java/org/apache/hudi/io/HoodieWriteHandle.java # hudi-client/hudi-client-common/src/main/java/org/apache/hudi/table/action/commit/HoodieWriteHelper.java # hudi-client/hudi-flink-client/src/main/java/org/apache/hudi/table/action/commit/FlinkWriteHelper.java # hudi-client/hudi-spark-client/src/main/scala/org/apache/hudi/SparkAdapterSupport.scala # hudi-client/hudi-spark-client/src/main/scala/org/apache/spark/sql/hudi/SparkAdapter.scala # hudi-client/hudi-spark-client/src/test/java/org/apache/hudi/client/functional/TestHoodieClientOnCopyOnWriteStorage.java # hudi-common/src/main/java/org/apache/hudi/avro/HoodieAvroUtils.java # hudi-common/src/main/java/org/apache/hudi/common/table/log/HoodieMergedLogRecordScanner.java # hudi-examples/hudi-examples-flink/src/main/resources/source-file.json # hudi-examples/hudi-examples-flink/src/test/resources/log4j-surefire-quiet.properties # hudi-examples/hudi-examples-flink/src/test/resources/log4j-surefire.properties # hudi-flink/src/test/resources/log4j-surefire-quiet.properties # hudi-flink/src/test/resources/log4j-surefire.properties # hudi-flink/src/test/resources/test_source.data # hudi-spark-datasource/hudi-spark-common/src/main/scala/org/apache/hudi/HoodieBaseRelation.scala # hudi-spark-datasource/hudi-spark-common/src/main/scala/org/apache/hudi/HoodieFileIndex.scala # hudi-spark-datasource/hudi-spark-common/src/main/scala/org/apache/spark/sql/hudi/DataSkippingUtils.scala # hudi-spark-datasource/hudi-spark/src/main/scala/org/apache/spark/sql/hudi/command/procedures/RunClusteringProcedure.scala # hudi-spark-datasource/hudi-spark/src/test/scala/org/apache/hudi/TestDataSkippingUtils.scala # hudi-spark-datasource/hudi-spark2/src/main/scala/org/apache/spark/sql/adapter/Spark2Adapter.scala # hudi-spark-datasource/hudi-spark3-common/src/main/scala/org/apache/spark/sql/adapter/BaseSpark3Adapter.scala # hudi-spark-datasource/hudi-spark3/src/main/scala/org/apache/spark/sql/adapter/Spark3_2Adapter.scala # hudi-utilities/src/main/java/org/apache/hudi/utilities/sources/processor/maxwell/MaxwellJsonKafkaSourcePostProcessor.java `

# Conflicts: # .github/workflows/bot.yml # hudi-client/hudi-client-common/src/main/java/org/apache/hudi/io/HoodieMergeHandle.java # hudi-client/hudi-flink-client/src/main/java/org/apache/hudi/table/action/commit/FlinkWriteHelper.java # hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/index/columnstats/ColumnStatsIndexHelper.java # hudi-client/hudi-spark-client/src/main/scala/org/apache/spark/sql/hudi/SparkAdapter.scala # hudi-examples/hudi-examples-common/pom.xml # hudi-examples/hudi-examples-common/src/main/java/org/apache/hudi/examples/common/HoodieExampleDataGenerator.java # hudi-examples/hudi-examples-flink/pom.xml # hudi-examples/hudi-examples-flink/src/test/java/org/apache/hudi/examples/quickstart/TestHoodieFlinkQuickstart.java # hudi-examples/hudi-examples-flink/src/test/java/org/apache/hudi/examples/quickstart/TestQuickstartData.java # hudi-examples/hudi-examples-flink/src/test/resources/log4j-surefire-quiet.properties # hudi-examples/hudi-examples-java/pom.xml # hudi-examples/hudi-examples-spark/pom.xml # hudi-examples/hudi-examples-spark/src/main/java/org/apache/hudi/examples/quickstart/HoodieSparkQuickstart.java # hudi-examples/hudi-examples-spark/src/test/java/org/apache/hudi/examples/quickstart/TestHoodieSparkQuickstart.java # hudi-spark-datasource/hudi-spark-common/src/main/scala/org/apache/hudi/HoodieBaseRelation.scala # hudi-spark-datasource/hudi-spark-common/src/main/scala/org/apache/hudi/HoodieFileIndex.scala # hudi-spark-datasource/hudi-spark-common/src/main/scala/org/apache/spark/sql/hudi/DataSkippingUtils.scala # hudi-spark-datasource/hudi-spark/src/test/scala/org/apache/hudi/TestDataSkippingUtils.scala # hudi-spark-datasource/hudi-spark2/src/main/scala/org/apache/spark/sql/HoodieSpark2CatalystExpressionUtils.scala # hudi-spark-datasource/hudi-spark2/src/main/scala/org/apache/spark/sql/adapter/Spark2Adapter.scala # hudi-spark-datasource/hudi-spark3-common/src/main/scala/org/apache/spark/sql/adapter/BaseSpark3Adapter.scala # hudi-spark-datasource/hudi-spark3.1.x/src/main/scala/org/apache/spark/sql/HoodieSpark3_1CatalystExpressionUtils.scala # hudi-spark-datasource/hudi-spark3.1.x/src/main/scala/org/apache/spark/sql/adapter/Spark3_1Adapter.scala # hudi-spark-datasource/hudi-spark3/src/main/scala/org/apache/spark/sql/HoodieSpark3_2CatalystExpressionUtils.scala # hudi-spark-datasource/hudi-spark3/src/main/scala/org/apache/spark/sql/adapter/Spark3_2Adapter.scala # pom.xml

alvarolemos

Just added a comment, suggestion a generalization of the proposed approach :)

alvarolemos · 2022-04-14T17:11:20Z

...common/src/main/java/org/apache/hudi/common/model/PartialOverwriteWithLatestAvroPayload.java

+   * @param secondRecord the new record provide new field value
+   * @return merged records
+   */
+  private GenericRecord overwriteWithNonNullValue(Schema schema, GenericRecord firstRecord, GenericRecord secondRecord) {


@stayrascal,

I really liked the idea of having a record payload that does partial merging. However, if I understood it correctly, what's proposed here is to do so in a very specific way: you're favoring the income record field's values, unless they are null (in which case, you would keep the existing one). I'm not saying this is not valuable, but that the idea of doing partial merging is so good that maybe we could have something more generic. I'm going to suggest a few changes in order to accomplish that:

Make PartialOverwriteWithLatestAvroPayload an abstract class

Instead of having mergeFunc as a parameter of the mergeRecord method, it could become an abstract method. This would lead to the removal of the overwriteWithNonNullValue method, which makes this implementation specific to your merging logic

For the original use case (partial merge favoring non-null values), implement the proposed abstract class and implement the mergeFunc method with what you have in overwriteWithNonNullValue: (first, second) -> Objects.isNull(second) ? first : second

It's just an idea, that could make what you proposed useful for many more use cases. Hope this made sense, and thanks for bringing this idea!

Hi @alvarolemos , thanks a lot for your useful suggestion. Yeah, I also considered to abstract the merge logic by using an abstract merge method or passing merge function into a generic function, and I choose the later. The reason as follow:

the preCombine and combineAndGetUpdateUpdate might have different merge/combine logic, only implement one abstract merge function might not enough for both two cases. For example, these two methods in OverwriteWithLatestAvroPayload have different merge/combine logic.

In current implementation, actually, the mergeRecord is a generic method even through it's a private method currently, but it don't care the detail merge logic and can be changed to protected/public scope if need. Instead, the overwriteWithNonNullValue is merge implementation in current "Payload", which is wrapper of mergeFunc and we can create two wrappers for preCombine and combineAndGetUpdateValue two scenarios if need, which is similar with what you mentioned about implement detail mergeFunc logic in sub class. We can still inherit this class implement detail mergeFunc logic, and pass to mergeRecord method.

Another reason why i didn't chose creating abstract class currently is that there will only one sub class, we can refactor it if we have many case need to inherit this class, right now, just make it simple as much as possible.

stayrascal · 2022-04-18T06:41:09Z

@hudi-bot run azure

hudi-bot · 2022-04-18T08:06:35Z

CI report:

20b1ee4 Azure: FAILURE Azure: FAILURE

Bot commands

@hudi-bot supports the following commands:

@hudi-bot run azure re-run the last Azure build

nsivabalan · 2022-10-20T04:52:08Z

@stayrascal : We landed a partial payload support via #4676.
Let us know if we can close this patch or if its possible to enhance the 4676 or if this patch is addressing something different.

[HUDI-2815] add partial overwrite payload to support partial overwrit…

d73d7f5

…e case

nsivabalan assigned nsivabalan and xushiyan Feb 3, 2022

stayrascal added 2 commits February 6, 2022 22:57

[HUDI-2815] add compareTo test case

6b6a60f

Merge branch 'master' into HUDI-2815

2fa2d57

# Conflicts: # hudi-flink/src/main/java/org/apache/hudi/sink/partitioner/BucketAssignFunction.java

nsivabalan mentioned this pull request Feb 6, 2022

[SUPPORT] During compaction, can I merge only modified columns in a record and leave others unchanged #4729

Closed

[HUDI-2815] fix conflict by changing HoodieRecord to HoodieAvroRecord

21df6fe

danny0405 reviewed Feb 8, 2022

View reviewed changes

nsivabalan reviewed Feb 20, 2022

View reviewed changes

stayrascal added 3 commits February 21, 2022 19:55

Merge remote-tracking branch 'origin/master' into HUDI-2815

940f6de

# Conflicts: # hudi-flink/src/main/java/org/apache/hudi/sink/bootstrap/BootstrapOperator.java

[HUDI-2815] 1. passing the payload schema instead of embedding it in …

14edef0

…payload. 2. revert change for storing full HoodieRecord in the flink state to support the case the partition path changed

[HUDI-2815] add test case for nest type for testing partial update

ce561bc

alexeykudinkin requested changes Feb 23, 2022

View reviewed changes

LinMingQiang reviewed Feb 25, 2022

View reviewed changes

stayrascal added 3 commits February 25, 2022 23:21

[HUDI-2815] remove unused configuration and refactor partial update l…

c6f524e

…ogic

[HUDI-2815] pass schema during precombine two record in compaction pr…

d3b3e05

…ocess

[MINOR] fix get builtin function issue from Hudi catalog

10e080b

nsivabalan assigned nsivabalan and unassigned nsivabalan Mar 2, 2022

Merge branch 'master' into HUDI-2815

a86b7ff

# Conflicts: # hudi-common/src/main/java/org/apache/hudi/common/table/log/HoodieMergedLogRecordScanner.java

codope and others added 23 commits April 2, 2022 14:37

[HUDI-3642] Handle NPE due to empty requested replacecommit metadata (a…

b709f75

…pache#5090)

Fixing non partitioned all files record in MDT (apache#5108)

1ce9a5e

[minor] Checks the data block type for archived timeline (apache#5106)

dcbb074

[HUDI-3689] Fix glob path and hive sync in deltastreamer tests (apach…

0640f20

…e#5117) * Remove glob pattern basePath from the deltastreamer tests. * [HUDI-3689] Fix file scheme config for CI failure in TestHoodieRealTimeRecordReader Co-authored-by: Raymond Xu <[email protected]>

[HUDI-3684] Fixing NPE in ParquetUtils (apache#5102)

d482527

* Make sure nulls are properly handled in `HoodieColumnRangeMetadata`

[HUDI-3689] Remove Azure CI cache (apache#5121)

7f5ee51

[HUDI-3689] Fix UT failures in TestHoodieDeltaStreamer (apache#5120)

5558b79

[HUDI-3706] Downgrade maven surefire and failsafe version (apache#5123)

a9b4110

[HUDI-3689] Fix delta streamer tests (apache#5124)

ffac31e

[HUDI-3689] Disable flaky tests in TestHoodieDeltaStreamer (apache#5127)

5854243

[HUDI-3624] Check all instants before starting a commit in metadata t…

f8092a3

…able (apache#5098)

[HUDI-3638] Make ZookeeperBasedLockProvider serializable (apache#5112)

32b9700

[HUDI-3701] Flink bulk_insert support bucket hash index (apache#5118)

27adaa2

[HUDI-1180] Upgrade HBase to 2.4.9 (apache#5004)

9c49e43

Co-authored-by: Sagar Sumit <[email protected]>

[HUDI-3703] Reset taskID in restoreWriteMetadata (apache#5122)

1959d8b

[HUDI-3580] Claim RFC number 48 for LogCompaction action RFC (apache#…

4568fae

…5128)

[HUDI-3678] Fix record rewrite of create handle when 'preserveMetadat…

c43747e

…a' is true (apache#5088)

[HUDI-3594] Supporting Composite Expressions over Data Table Columns …

5b66abf

…in Data Skipping flow (apache#4996)

[HUDI-3711] Fix typo in MaxwellJsonKafkaSourcePostProcessor.Config#PR…

06ac8cb

…ECOMBINE_FIELD_TYPE_PROP (apache#5096)

[HUDI-3563] Make quickstart examples covered by CI tests (apache#5082)

24cc379

fix conflict

20b1ee4

alvarolemos reviewed Apr 14, 2022

View reviewed changes

vinothchandar self-assigned this Jul 13, 2022

yihua added the writer-core label Sep 13, 2022

stayrascal closed this Feb 10, 2023

[HUDI-2815] add partial overwrite payload to support partial overwrit… #4724

[HUDI-2815] add partial overwrite payload to support partial overwrit… #4724

Uh oh!

Conversation

stayrascal commented Jan 31, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Tips

What is the purpose of the pull request

Brief change log

Verify this pull request

Committer checklist

Uh oh!

stayrascal commented Feb 7, 2022

Uh oh!

stayrascal commented Feb 7, 2022

Uh oh!

danny0405 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

stayrascal Feb 8, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

stayrascal commented Feb 8, 2022

Uh oh!

nsivabalan commented Feb 15, 2022

Uh oh!

stayrascal commented Feb 16, 2022

Uh oh!

nsivabalan left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

stayrascal commented Feb 21, 2022

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

stayrascal Feb 25, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

stayrascal commented Jan 31, 2022 •

edited

Loading

stayrascal Feb 8, 2022 •

edited

Loading

stayrascal Feb 25, 2022 •

edited

Loading