Iceberg: support Parquet read with delete filter by jackye1995 · Pull Request #8534 · trinodb/trino

jackye1995 · 2021-07-13T04:32:49Z

support using Iceberg's DeleteFilter to filter delete files in the read path, this implementation only supports Parquet first because it already has the ability to generate row id channel. Will add ORC later if this impl is accepted. The general idea is that:

express Trino Page as an iterable of TrinoRow s, where each row is defined by the underlying the block array and the position in the page.
TrinoRows are implemented as Iceberg StructLike so that it can be used to directly leverage Iceberg's DeleteFilter
DeleteFilter is used to filter pages produced by the Parquet page source.
the result filtered rows can be used to derive the positions to keep in a page
Page.getPositions is used to only retain rows in the particular positions and complete the merge-on-read process

I have not added unit tests yet, only tested with internal Trino installation that supports multi-catalog against tables in Glue catalog. There might be some backport error I missed. Once we agree upon the general implementation idea, I will add back tests and fix performance issues if any.

@phd3 @electrum @findepi @losipiuk @caneGuy @rdblue

losipiuk · 2021-07-13T07:47:48Z

@hashhar You've been working on this one, not sure what is the current shape. PTAL.

hashhar

Looks good at first glance. Both position and equality deletes are supported with this change.

One question - the FileIO ends up using the Iceberg Parquet readers to read the delete instead of the Trino native parquet reader. This is different from the normal read path. How difficult would it be to use the Trino parquet reader for reading and applying the deletes?

I've yet to look at calls into the Iceberg library code to see if something more.

hashhar · 2021-07-13T07:49:46Z

plugin/trino-iceberg/pom.xml

    <properties>
        <air.main.basedir>${project.parent.basedir}</air.main.basedir>
-        <dep.iceberg.version>0.11.0</dep.iceberg.version>
+        <dep.iceberg.version>0.11.1</dep.iceberg.version>


Is this to make the MetadataColumns available?

hashhar · 2021-07-13T09:21:13Z

plugin/trino-iceberg/src/main/java/io/trino/plugin/iceberg/IcebergPageSourceProvider.java

+                .collect(toImmutableList());
+        Schema deleteReadSchema = new Schema(deleteReadFields);
+        TrinoDeleteFilter deleteFilter = new TrinoDeleteFilter(fileIo, split.getTask(), deleteReadSchema, deleteReadSchema);
+        getColumns(deleteFilter.requiredSchema(), typeManager).stream()


If I understand correctly the deleteFilter.requiredSchema will always be a superset of the columns we request (the columns arg to this method). So do we need to create the initial regularColumns at all? Can we just assign the result of this stream to regularColumns?

findepi · 2021-07-14T15:04:43Z

One question - the FileIO ends up using the Iceberg Parquet readers to read the delete instead of the Trino native parquet reader.

@hashhar did you have implementation for positional deletes with Trino Parquet reader?
did you already create a PR for this?

caneGuy · 2021-07-15T09:34:39Z

Use TrinoRow to reconstruct Page and Blocks looks good!

EmbeddedSoftwareChenXiangLing · 2021-08-06T12:01:35Z

Hello @jackye1995, I am very interested in using trino to read iceberg table with delete filter, and have compiled your branch to test. but there is an exception was thrown io.trino.spi.TrinoException: row index unavailable.

It seems like row index column is empty when the parquetPageSource.getNextPage execute. But I'm not certain how to fix it to meet your ideas. Is it the completely code for read iceberg v2 table? Could you guide me to solve this problem?

Thank you for your reply !

jackye1995 · 2021-08-17T01:04:37Z

For anyone subscribing to this PR, I was mostly focusing on the multi catalog support in the past few weeks, will start the work on this one.

dijiekstra · 2021-08-19T09:10:21Z

Looking forward to this feature, we are currently using iceberg v2 table and writing binlog by flink, and want to read and complete ETL through Trino

findepi · 2021-11-22T15:30:25Z

plugin/trino-iceberg/src/main/java/io/trino/plugin/iceberg/IcebergTableHandle.java

    private final String tableName;
    private final TableType tableType;
    private final Optional<Long> snapshotId;
+    private final byte[] serializedSchema;


I think there already was an idea to add schema to IcebergTableHandle and it was rejected (?) for some reason.

@phd3 do you remember?

findepi · 2021-11-22T15:31:16Z

plugin/trino-iceberg/src/main/java/io/trino/plugin/iceberg/IcebergTableHandle.java

+    public Schema getSchema()
+    {
+        if (schema == null) {
+            schema = deserializeFromBytes(serializedSchema);


Unsafe publication of the Schema object, since this.schema is not volatile.

findepi · 2021-11-22T15:32:10Z

plugin/trino-iceberg/src/main/java/io/trino/plugin/iceberg/TrinoDeleteFilter.java

+    }
+
+    @Override
+    protected InputFile getInputFile(String s)


findepi · 2021-11-22T15:33:27Z

plugin/trino-iceberg/src/main/java/io/trino/plugin/iceberg/TrinoRow.java

+    }
+
+    @Override
+    public <T> T get(int i, Class<T> aClass)


is it necessary to impl equality-based deletes?

findepi · 2021-11-22T15:34:49Z

plugin/trino-iceberg/src/main/java/io/trino/plugin/iceberg/TrinoRow.java

+            value = aClass.cast(type.getDouble(block, position));
+        }
+        else if (type.equals(TIME_MICROS)) {
+            value = aClass.cast(type.getLong(block, position) / PICOSECONDS_PER_MICROSECOND);


This logic could ideally be in a shared function, doing mapping reverse to io.trino.plugin.iceberg.IcebergTypes#convertIcebergValueToTrino

findepi · 2021-11-22T15:35:42Z

plugin/trino-iceberg/src/main/java/io/trino/plugin/iceberg/TrinoRow.java

+    @Override
+    public <T> void set(int i, T t)
+    {
+        throw new TrinoException(NOT_SUPPORTED, "writing to TrinoRow is not supported");


TrinoException should be used only when we know what is the reason of the failure.

throw new UnsupportedOperationException();

findepi · 2021-11-22T15:36:06Z

plugin/trino-iceberg/src/main/java/io/trino/plugin/iceberg/TrinoDeleteFilter.java

+    @Override
+    protected InputFile getInputFile(String s)
+    {
+        return fileIO.newInputFile(s);


is this method used?

findepi · 2021-11-22T15:37:10Z

How can we test this?

jackye1995 · 2021-11-26T08:31:18Z

close in favor of #10075

Iceberg: support Parquet read with delete filter

c77ebcf

cla-bot bot added the cla-signed label Jul 13, 2021

hashhar reviewed Jul 13, 2021

View reviewed changes

jackye1995 mentioned this pull request Jul 15, 2021

Iceberg: support row-level delete and update #8565

Closed

findepi force-pushed the master branch from 8538e49 to 1f896ea Compare July 30, 2021 22:13

velctor mentioned this pull request Sep 17, 2021

Iceberg: Apply row-level delete when reading #7226

Closed

jackye1995 mentioned this pull request Oct 1, 2021

Core: add Jackson serialization util for Trino and Presto apache/iceberg#3210

Closed

electrum mentioned this pull request Nov 18, 2021

Parquet: avoid premature closing of FSDataInputStream object apache/iceberg#3276

Closed

findepi reviewed Nov 22, 2021

View reviewed changes

jackye1995 mentioned this pull request Nov 26, 2021

Support Iceberg row-level delete and update #10075

Closed

jackye1995 closed this Nov 26, 2021

Conversation

jackye1995 commented Jul 13, 2021

Uh oh!

losipiuk commented Jul 13, 2021

Uh oh!

hashhar left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

findepi commented Jul 14, 2021

Uh oh!

caneGuy commented Jul 15, 2021

Uh oh!

EmbeddedSoftwareChenXiangLing commented Aug 6, 2021

Uh oh!

jackye1995 commented Aug 17, 2021

Uh oh!

dijiekstra commented Aug 19, 2021

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

findepi commented Nov 22, 2021

Uh oh!

jackye1995 commented Nov 26, 2021

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

7 participants