[Core][Flink][Spark]: Refactor `TaskWriter` implementations #4132

dungdm93 · 2022-02-15T16:27:15Z

This PR aim to:

replace BaseTaskWriter inner classes (BaseRollingWriter, RollingFileWriter, RollingEqDeleteWriter)
by the respective implementations of RollingFileWriter interface.
provide single implementation of TaskWriter that can handle both partition & un-partition data (by delegate to the PartitioningWriter)

Here is my approach, from top down:

TaskWriter
    |
    V
PartitioningWriter
    |
    V
RollingFileWriter
    |
    V
FileWriter
    |
    V
FileAppender

TaskWriter is used to handle diffent kinds of record.
- With Insert-only data, TaskWriter basically call PartitioningWriter.write. See DirectTaskWriter for more details.
- With delta data, TaskWriter can have 3 PartitioningWriters for insertWriter, equalityDeleteWriter and positionDeleteWriter. So, for each incomming record, base on its type (insert, update or delete), TaskWriter will call corresponding writer to write data. See FlinkTaskWriter for more details.
PartitioningWriter is used to write to multiple specs and partitions.
Note that for unpartitioned tables, partition = null is passed to PartitioningWriter.write
Internally, PartitioningWriter already use RollingFileWriter for rolling to new file if current file is large. RollingFileWriter is a just wrapper of other FileWriter.
FileWriter is used to write to single file.

Why we need that

The scope that inner classes of bounded to outer object (BaseTaskWriter instance) that make the code more complex and hard to read and understand.
Currently, the approach of separated class for partition & un-partition data is still OK. But it's better if we can unify it into a single class, this mean:
- Less effort is needed for new engine
- No need to create new class when existing engine wanna extends new use-case. For example, now Flink only support fan-out partition which is required for streaming execution mode, but when execution mode is batch, people may prefer using ClusteredPartitionWriter because it take less resources.

Result

With new 2 TaskWriters DirectTaskWriter and FlinkTaskWriter, it can cover all Flink and Spark cases.
Bellow is equal code of Flink's DeltaTaskWriter:

write INSERT only data, unpartition table:

// current
taskWriter = new UnpartitionedWriter<>(...);

// new
partitioner = DirectTaskWriter.unpartition();
taskWriter = new DirectTaskWriter<>(partitioner, ...);

write INSERT only data, partition table:

// current
class RowDataPartitionedFanoutWriter extends PartitionedFanoutWriter<RowData> {...}
taskWriter = new RowDataPartitionedFanoutWriter(...);

// new
partitioner = FlinkTaskWriter.partitionerFor(spec, schema, flinkSchema);
taskWriter = new DirectTaskWriter<>(partitioner, ...);

write both INSERT and DELETE data, unpartition table:

// current
taskWriter = new UnpartitionedDeltaWriter(...);

// new
partitioner = DirectTaskWriter.unpartition();
taskWriter = new FlinkTaskWriter(partitioner, ...);

write both INSERT and DELETE data, partition table:

// current
taskWriter = new PartitionedDeltaWriter(...);

// new
partitioner = FlinkTaskWriter.partitionerFor(spec, schema, flinkSchema);
taskWriter = new FlinkTaskWriter(partitioner, ...);

How do I test it

Pass all unit-tests and run with some sample datasets in my local machine.

dungdm93 · 2022-02-15T16:32:04Z

cc @aokolnychyi, @rdblue, @jackye1995, @openinx, @stevenzwu, @szehon-ho, @RussellSpitzer

dungdm93

It's breaking change, but only effected if you have custom implementation.
Never the less, those 2 interface just introduce in 0.13, so the number of affected users can be negligible.

dungdm93 · 2022-02-15T16:40:41Z

core/src/main/java/org/apache/iceberg/io/FileWriter.java

+   * @return PathOffset of written row
   */
-  void write(T row);
+  PathOffset write(T row);


For a delete, it can have 2 rows in delete files. One for EqualityDelete to delete record in previous snapshot, and one for PositionDelete to delete record in current snapshot. So it's required to track PathOffset of all inserted records in current snapshot.

You mean each abstracted FileWriter will get a PathOffset back when append a newly row ? That does not make sense for me because not every writer need this PathOffset to do the following thing.

I agree that not every writer need PathOffset, but it's required for writing delta data (like Flink's DeltaTaskWriter)
For the writer that does not need this, just ignore the return.

rdblue · 2022-02-15T17:30:33Z

@dungdm93, I don't think I understand quite what you're trying to do in this PR from the description. Can you add some more detail about what your motivation is and what you're changing? It would probably also help to do this in several smaller PRs.

dungdm93 · 2022-02-16T01:53:53Z

@rdblue sorry for my bad wording. Let's me try to add more details

dungdm93 · 2022-02-16T10:07:20Z

Hello @rdblue, I just update the PR description, hope it's clear enough to understand.

api/src/main/java/org/apache/iceberg/PartitionKey.java

rdblue · 2022-02-16T22:58:26Z

core/src/main/java/org/apache/iceberg/deletes/EqualityDeleteWriter.java

-  public void write(T row) {
+  public PathOffset write(T row) {
    appender.add(row);
+    return PathOffset.of(location, recordCount++);


In Iceberg, we don't use the return value of ++ operators because it is hard to read code that uses them. Can you move the increment to a separate line?

Changed to:

long offset = recordCount++; return PathOffset.of(location, offset);

RussellSpitzer · 2022-02-16T23:07:55Z

The description makes sense to me here, It will take me some time though to get through this whole PR, I'll try to set aside time later this week.

openinx · 2022-02-17T09:46:44Z

api/src/main/java/org/apache/iceberg/PartitionKey.java

   */
  @SuppressWarnings("unchecked")
-  public void partition(StructLike row) {
+  public PartitionKey partition(StructLike row) {


I'm think this will produce an api compatibility issue, why do we need to change this basic API ?

This is just a side-change, I made it to align with other StructLike wrappers, StructProjection.wrap, IndexedStructLike.wrap, InternalRowWrapper.wrap,... to name a few.
@openinx Could you please explain how this can make an API compatibility issue.

The downstream users may add this iceberg-api module to their application project, since the PartitionKey is a public API, then their application artifact does include the void partition(StructLike row) . When they upgrade their iceberg-api to the next release version, then it will fail to load the expected void partition(StructLike row). That breaks a user's normal upgrade process and that's why we say it's an API compatibility issue.

Yeah. Let's me roll it back.
@openinx could you help me review other changes

Yes, I'm currently checking the whole write path. I think I will need one or two day to understand the whole newly introduced writers since 0.13.x. Replacing the old writer API with the new one is a great thing. I think we can collaborate to make this forward. Thanks

openinx · 2022-02-17T09:49:56Z

core/src/main/java/org/apache/iceberg/io/FileWriter.java

+   * @return PathOffset of written row
   */
-  void write(T row);
+  PathOffset write(T row);


You mean each abstracted FileWriter will get a PathOffset back when append a newly row ? That does not make sense for me because not every writer need this PathOffset to do the following thing.

Signed-off-by: Đặng Minh Dũng <[email protected]>

openinx · 2022-03-01T03:31:58Z

core/src/main/java/org/apache/iceberg/io/PartitioningWriterFactory.java

+import org.apache.iceberg.io.DefaultPartitioningWriterFactory.Type;
+import org.apache.iceberg.relocated.com.google.common.base.Preconditions;
+
+public interface PartitioningWriterFactory<T> {


Why name it as PartitioningWriterFactory ? I don't see any partition info for those defined interface methods.

It's factory class used to create PartitioningWriters

I just add a little docs for better understanding.

core/src/main/java/org/apache/iceberg/io/PartitioningWriterFactory.java

core/src/main/java/org/apache/iceberg/io/PathOffset.java

flink/v1.14/flink/src/main/java/org/apache/iceberg/flink/sink/FlinkTaskWriter.java

Signed-off-by: Đặng Minh Dũng <[email protected]>

to use `DirectTaskWriter` and `FlinkTaskWriter` Signed-off-by: Đặng Minh Dũng <[email protected]>

Signed-off-by: Đặng Minh Dũng <[email protected]>

dungdm93 · 2022-03-01T04:25:52Z

core/src/main/java/org/apache/iceberg/io/DirectTaskWriter.java

+import org.apache.iceberg.StructLike;
+import org.apache.iceberg.util.Tasks;
+
+public class DirectTaskWriter<T> implements TaskWriter<T> {


@openinx are you have any naming suggestion for this class, DirectTaskWriter, AppendTaskWriter,...?

github-actions · 2024-08-07T00:13:52Z

This pull request has been marked as stale due to 30 days of inactivity. It will be closed in 1 week if no further activity occurs. If you think that’s incorrect or this pull request requires a review, please simply write any comment. If closed, you can revive the PR at any time and @mention a reviewer or discuss it on the [email protected] list. Thank you for your contributions.

github-actions · 2024-08-15T00:13:28Z

This pull request has been closed due to lack of activity. This is not a judgement on the merit of the PR in any way. It is just a way of keeping the PR queue manageable. If you think that is incorrect, or the pull request requires review, you can revive the PR at any time.

github-actions bot added API core flink spark labels Feb 15, 2022

dungdm93 commented Feb 15, 2022

View reviewed changes

dungdm93 force-pushed the refactor-task-writer branch from 1b8b040 to f40f485 Compare February 15, 2022 23:55

dungdm93 force-pushed the refactor-task-writer branch from f40f485 to 0359202 Compare February 16, 2022 04:57

github-actions bot added the data label Feb 16, 2022

dungdm93 mentioned this pull request Feb 16, 2022

Core: add FanoutEqualityDeleteWriter and FanoutPositionDeleteWriter #4091

Closed

dungdm93 force-pushed the refactor-task-writer branch 6 times, most recently from 2977511 to da95b4d Compare February 16, 2022 16:12

rdblue reviewed Feb 16, 2022

View reviewed changes

api/src/main/java/org/apache/iceberg/PartitionKey.java Outdated Show resolved Hide resolved

rdblue reviewed Feb 16, 2022

View reviewed changes

api/src/main/java/org/apache/iceberg/PartitionKey.java Show resolved Hide resolved

rdblue reviewed Feb 16, 2022

View reviewed changes

dungdm93 force-pushed the refactor-task-writer branch from da95b4d to 18bc270 Compare February 17, 2022 08:47

dungdm93 requested a review from rdblue February 17, 2022 09:06

openinx reviewed Feb 17, 2022

View reviewed changes

dungdm93 force-pushed the refactor-task-writer branch 2 times, most recently from 06f2708 to 3ef9ea5 Compare February 19, 2022 09:05

Core: add FanoutEqualityDeleteWriter and FanoutPositionDeleteWriter

dcdaccf

Signed-off-by: Đặng Minh Dũng <[email protected]>

dungdm93 force-pushed the refactor-task-writer branch from 3ef9ea5 to 86128e7 Compare February 19, 2022 15:14

dungdm93 requested a review from openinx February 28, 2022 08:39

Core: make StructCopy public

6e6b519

Signed-off-by: Đặng Minh Dũng <[email protected]>

dungdm93 force-pushed the refactor-task-writer branch from 86128e7 to 197b233 Compare March 1, 2022 02:49

openinx reviewed Mar 1, 2022

View reviewed changes

flink/v1.14/flink/src/main/java/org/apache/iceberg/flink/sink/FlinkTaskWriter.java Show resolved Hide resolved

openinx reviewed Mar 1, 2022

View reviewed changes

flink/v1.14/flink/src/main/java/org/apache/iceberg/flink/sink/FlinkTaskWriter.java Outdated Show resolved Hide resolved

dungdm93 added 8 commits March 1, 2022 11:18

Core: make FileWriter.write return PathOffset

6747c61

Signed-off-by: Đặng Minh Dũng <[email protected]>

Core: make PartitioningWriter.write return PathOffset

3432723

Signed-off-by: Đặng Minh Dũng <[email protected]>

Core: implement PartitioningWriterFactory

78ff4a0

Signed-off-by: Đặng Minh Dũng <[email protected]>

Core: implement DirectTaskWriter

f2b60cf

Signed-off-by: Đặng Minh Dũng <[email protected]>

Flink: implement FlinkTaskWriter

76fd2e9

Signed-off-by: Đặng Minh Dũng <[email protected]>

Flink: re-implement RowDataTaskWriterFactory

da59042

to use `DirectTaskWriter` and `FlinkTaskWriter` Signed-off-by: Đặng Minh Dũng <[email protected]>

Flink: remove legacy DeltaTaskWriters

732ef14

Signed-off-by: Đặng Minh Dũng <[email protected]>

Spark: refactor to use DirectTaskWriter

40354bb

Signed-off-by: Đặng Minh Dũng <[email protected]>

dungdm93 force-pushed the refactor-task-writer branch from 197b233 to 40354bb Compare March 1, 2022 04:22

dungdm93 commented Mar 1, 2022

View reviewed changes

openinx mentioned this pull request Mar 4, 2022

Flink: Refactor to use the BaseEqualityDeltaWriter. #4264

Closed

github-actions bot added the stale label Aug 7, 2024

github-actions bot closed this Aug 15, 2024

dungdm93 deleted the refactor-task-writer branch August 15, 2024 02:45

[Core][Flink][Spark]: Refactor TaskWriter implementations #4132

[Core][Flink][Spark]: Refactor TaskWriter implementations #4132

Uh oh!

Conversation

dungdm93 commented Feb 15, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Why we need that

Result

How do I test it

Uh oh!

dungdm93 commented Feb 15, 2022

Uh oh!

dungdm93 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dungdm93 Feb 17, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rdblue commented Feb 15, 2022

Uh oh!

dungdm93 commented Feb 16, 2022

Uh oh!

dungdm93 commented Feb 16, 2022

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

RussellSpitzer commented Feb 16, 2022

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dungdm93 Feb 17, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Aug 7, 2024

Uh oh!

github-actions bot commented Aug 15, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[Core][Flink][Spark]: Refactor `TaskWriter` implementations #4132

[Core][Flink][Spark]: Refactor `TaskWriter` implementations #4132

dungdm93 commented Feb 15, 2022 •

edited

Loading

dungdm93 Feb 17, 2022 •

edited

Loading

dungdm93 Feb 17, 2022 •

edited

Loading