Flink: Replace Row with RowData in flink write path. #1320

openinx · 2020-08-11T03:05:20Z

This patch addressing the third point in issue #1305

We will need an extra patch doing the refactor to replace all the Row type with RowData (I have implemented one in my own branch 2af37c5), and make sure all the unit tests could pass. From this point in time, all flink development and unit tests will use RowData.

After this patch, only FlinkParquetReaders, FlinkParquetWriters, RandomData, TestFlinkParquetReaderWriter will reference Row data type.

Once we merged the pull requests about RowData parquet readers (#1266) and parquet writers (#1272), there should be no other core classes that will use Row data type.

rdblue · 2020-08-11T19:17:14Z

flink/src/main/java/org/apache/iceberg/flink/RowDataTaskWriterFactory.java

    this.targetFileSizeBytes = targetFileSizeBytes;
    this.format = format;
-    this.appenderFactory = new FlinkFileAppenderFactory(schema, tableProperties);
+    this.flinkSchema = FlinkSchemaUtil.convert(schema);


In Spark, we had a bug where Spark may produce a row with a short, which is stored as an int in Iceberg. In a CTAS query, data would actually get passed to Iceberg with the short and we would end up with a ClassCastException. That's why we now pass the dataset schema when creating writers. You might want to watch out for a similar bug.

Thanks for the great remanding, I think the bug you mentioned is this one #999. The flink don't support CTAS but support INSERT INTO iceberg_table SELECT * from table_2, if the table_2 has a TINYINT or SMALLINT, them its BinaryRowData queried from SELECT will be byte or short, we also need the raw flink's schema to read the values from BinaryRowData (rather than the flink schema converted from iceberg schema), and write those byte or short into integer. Let me consider how to fix this.

This only appears in Spark with a CTAS query because that's the only time that Spark doesn't get a schema back from the table. When Spark has a table schema, it will automatically insert casts to the appropriate types so this problem doesn't happen. I'm not sure if Flink does that, but if it does then you wouldn't need to worry about that bug.

chenjunjiedada · 2020-08-12T02:57:54Z

@openinx , I rebased #1272 and found failed UT. It turns out #1272 depends on this. I think this should be merged before #1272. #1266 could be merged at first though.

openinx · 2020-08-12T06:52:38Z

flink/src/main/java/org/apache/iceberg/flink/RowDataWrapper.java

  private static PositionalGetter<?> buildGetter(LogicalType logicalType, Type type) {
-    switch (type.typeId()) {
-      case STRING:
+    switch (logicalType.getTypeRoot()) {


I changed the type from iceberg type to flink's logical type here, because the value of tinyint & smallint is a byte & short, when cast to the byte or short to Integer here, it will throw a cast failure exception. Using logical type here so that we could cast it to integer right way.

rdblue · 2020-08-12T20:06:16Z

flink/src/main/java/org/apache/iceberg/flink/RowDataWrapper.java

-          return new UUID(mostSigBits, leastSigBits);
-        };
+      case VARBINARY:
+        if (Type.TypeID.UUID.equals(type.typeId())) {


I think an identity check would be okay since this is an enum symbol, but either way is fine.

rdblue · 2020-08-12T20:06:53Z

flink/src/main/java/org/apache/iceberg/flink/RowDataWrapper.java

+            ByteBuffer bb = ByteBuffer.wrap(row.getBinary(pos));
+            long mostSigBits = bb.getLong();
+            long leastSigBits = bb.getLong();
+            return new UUID(mostSigBits, leastSigBits);


Looks like another area where we should have a util method to convert (though it shouldn't block this commit).

It's true, we could have a separate pull request for this.

rdblue · 2020-08-12T20:08:33Z

flink/src/main/java/org/apache/iceberg/flink/RowDataWrapper.java

+        TimestampType timestampType = (TimestampType) logicalType;
+        return (row, pos) -> {
+          LocalDateTime localDateTime = row.getTimestamp(pos, timestampType.getPrecision()).toLocalDateTime();
+          return DateTimeUtil.microsFromTimestamp(localDateTime);


Why not use the same logic here that is used in the other timestamp type? Both of the values are TimestampData that is returned by getTimestamp. It seems like converting directly to a microsecond value is better than going through LocalDateTime here.

Yeah, it could be the same to convert TimestampData to a long. I separate them because the TimestampType are different, and we are depending the TimestampType.getPrecision() or LocalZonedTimestampType.getPrecision() to get the precision (though we could use the constant 6 here, but better to use the timestamp's precision getter).

rdblue · 2020-08-12T20:11:19Z

flink/src/test/java/org/apache/iceberg/flink/SimpleDataUtil.java

+    for (RowData row : rows) {
+      Integer id = row.isNullAt(0) ? null : row.getInt(0);
+      String data = row.isNullAt(1) ? null : row.getString(1).toString();
+      records.add(createRecord(id, data));


This could also use the assertEquals implementation that @chenjunjiedada has added in #1266. That would be better than converting a specific record type.

Make sense.

rdblue · 2020-08-12T20:16:06Z

I have a few minor comments, but I don't think any of those should block this going in. I think it is correct. We can make tests better by using assertEquals between generics and Flink RowData in follow-ups once #1266 is in.

rdblue · 2020-08-12T20:16:37Z

Merged. Thanks for working on this, @openinx!

Flink: Replace Row with RowData in flink write path.

a2baa14

rdblue reviewed Aug 11, 2020

View reviewed changes

openinx added 2 commits August 12, 2020 11:34

Address the schema issue from ryan

f43ec61

Add unit tests.

44da7c3

openinx commented Aug 12, 2020

View reviewed changes

Minor changes.

018d16d

openinx mentioned this pull request Aug 12, 2020

Flink: use schema visitor for parquet writer #1272

Merged

rdblue reviewed Aug 12, 2020

View reviewed changes

rdblue merged commit a6a511a into apache:master Aug 12, 2020

openinx mentioned this pull request Aug 13, 2020

Refactor to use a common UUIDUtil to convert between bytes and UUID #1330

Merged

cmathiesen pushed a commit to ExpediaGroup/iceberg that referenced this pull request Aug 19, 2020

Flink: Replace Row with RowData in Flink write path (apache#1320)

17d9a5b

Flink: Replace Row with RowData in flink write path. #1320

Flink: Replace Row with RowData in flink write path. #1320

Uh oh!

Conversation

openinx commented Aug 11, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

chenjunjiedada commented Aug 12, 2020

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rdblue commented Aug 12, 2020

Uh oh!

rdblue commented Aug 12, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

openinx commented Aug 11, 2020 •

edited

Loading