Refactor Iceberg table statistics to be deterministic by alexjo2144 · Pull Request #9906 · trinodb/trino

alexjo2144 · 2021-11-08T20:07:03Z

Existing table statistics were non-deterministic because they depended on the order that data files were loaded from the Iceberg API. This hopefully cleans the code up a bit and makes it more consistent.

findepi · 2021-11-09T11:10:15Z

@alexjo2144 the build is red

alexjo2144 · 2022-01-12T20:55:03Z

@findepi finally had some time to get back to this. Mind taking a look?

findepi · 2022-01-13T14:16:31Z

plugin/trino-iceberg/src/main/java/io/trino/plugin/iceberg/IcebergStatistics.java

@Immutable, final

findepi · 2022-01-13T14:16:42Z

plugin/trino-iceberg/src/main/java/io/trino/plugin/iceberg/IcebergStatistics.java

defensive copy

findepi · 2022-01-13T14:22:44Z

plugin/trino-iceberg/src/main/java/io/trino/plugin/iceberg/IcebergStatistics.java

defensive copy

findepi · 2022-01-13T14:23:13Z

plugin/trino-iceberg/src/main/java/io/trino/plugin/iceberg/IcebergStatistics.java

add empty line between immutable and mutable state

findepi · 2022-01-13T14:30:53Z

plugin/trino-iceberg/src/main/java/io/trino/plugin/iceberg/IcebergStatistics.java

acceptDataFile ?

findepi · 2022-01-13T15:38:35Z

plugin/trino-iceberg/src/main/java/io/trino/plugin/iceberg/PartitionTable.java

computeIfAbsent (without another get) gives you 0.5x map access and skips IcebergStatisticsBuilder allocation.

findepi · 2022-01-13T15:39:15Z

plugin/trino-iceberg/src/main/java/io/trino/plugin/iceberg/PartitionTable.java

Functions.compose -> plan lambda entry -> ...

findepi · 2022-01-13T15:42:37Z

plugin/trino-iceberg/src/main/java/io/trino/plugin/iceberg/IcebergStatistics.java

add // invalidate

findepi · 2022-01-13T15:45:00Z

plugin/trino-iceberg/src/main/java/io/trino/plugin/iceberg/IcebergStatistics.java

You're doing 3 map lookups (contains, get, merge).
You can do this in one shot:

nullCounts.merge(id, nullCount, (existingCount, newCount) -> existingCount.isPresent() && newCount.isPresent() ? Optional.of(existingCount.get() + newCount.get()) : Optional.empty());

(you can extract lambda body to a method like sumOfOptionals)

This method is also equivalent to this:

nullCounts.merge(id, nullCount, (oldCount, newCount) -> oldCount.flatMap(oldValue -> newCount.map(newValue -> newValue + oldValue)));

Though after looking at this, it's definitely less clear about the intent, though I would use this form if extracting a method to add optionals.

Actually, instead of just addition, maybe mergeOptionals would be better (I see somewhere else in this file it could be used). Here, it would be used like mergeOptionals(oldCount, newCount, Long::sum).

/** * Apply a function to the values in two optionals, returning an optional containing the result. * If either argument is empty, return empty. */ <A, B, C> Optional<C> mergeOptionals(Optional<A> a, Optional<B> b, BiFunction<A, B, C> mergeFunction) { return a.flatMap(aa -> b.map(bb -> mergeFunction.apply(aa, bb))); }

return a.flatMap(aa -> b.map(bb -> mergeFunction.apply(aa, bb)));

i thought about that, but i don't find it readable, that's why i suggested ?: use

findepi · 2022-01-13T15:46:17Z

plugin/trino-iceberg/src/test/java/io/trino/plugin/iceberg/BaseIcebergConnectorTest.java

where did these go to?

Before my changes the ORC version of this table was deemed to have invalid column metrics, thus the NULL min/max/null count rows. I think that was a bug though, and the stats are the same now for both Parquet and ORC, with a few exceptions below.

jirassimok

I don't fully understand how this works, but overall it looks pretty good.

plugin/trino-iceberg/src/main/java/io/trino/plugin/iceberg/IcebergStatistics.java

jirassimok · 2022-01-13T14:46:33Z

plugin/trino-iceberg/src/main/java/io/trino/plugin/iceberg/IcebergStatistics.java

Should the builder's method return this?

jirassimok · 2022-01-13T15:13:56Z

plugin/trino-iceberg/src/main/java/io/trino/plugin/iceberg/IcebergStatistics.java

Do these need to be declared so long before they're used?

Any closer and they'd be inside the loop

jirassimok · 2022-01-13T15:26:11Z

plugin/trino-iceberg/src/main/java/io/trino/plugin/iceberg/IcebergStatistics.java

I see partitionValue.ifPresentOrElse here to avoid Optional.get.

Or maybe just an ifPresent, leaving the updateNullCountStats call outside:

partitionValues.get(id).ifPresent(partition -> { // ... updateMinMaxStats(...); }); updateNullCountStats(id, partitionValue.map(v -> 0).orElseGet(dataFile::recordCount));

I see what you're getting at. I kinda like the separation as it is because there's a "this partition value is non-null" block, and a "this partition value is null" block.

jirassimok · 2022-01-13T15:27:30Z

plugin/trino-iceberg/src/main/java/io/trino/plugin/iceberg/IcebergStatistics.java

I'm not a fan of these null checks.

Why not inline upperBounds and lowerBounds here as optionals (changing the signatureof convertBounds)?

Object lowerBound = convertBounds(idToTypeMapping, dataFile.lowerBounds()) .map(bounds -> convertIcebergValueToTrino(column.type(), bounds) .orElse(null);

Actually, maybe the bounds should actually be Optional themselves, rather than nullable.

jirassimok · 2022-01-13T16:20:21Z

plugin/trino-iceberg/src/main/java/io/trino/plugin/iceberg/IcebergStatistics.java

This method is also equivalent to this:

nullCounts.merge(id, nullCount, (oldCount, newCount) -> oldCount.flatMap(oldValue -> newCount.map(newValue -> newValue + oldValue)));

Though after looking at this, it's definitely less clear about the intent, though I would use this form if extracting a method to add optionals.

Actually, instead of just addition, maybe mergeOptionals would be better (I see somewhere else in this file it could be used). Here, it would be used like mergeOptionals(oldCount, newCount, Long::sum).

/** * Apply a function to the values in two optionals, returning an optional containing the result. * If either argument is empty, return empty. */ <A, B, C> Optional<C> mergeOptionals(Optional<A> a, Optional<B> b, BiFunction<A, B, C> mergeFunction) { return a.flatMap(aa -> b.map(bb -> mergeFunction.apply(aa, bb))); }

jirassimok · 2022-01-13T16:32:36Z

plugin/trino-iceberg/src/main/java/io/trino/plugin/iceberg/IcebergStatistics.java

This could be a stream.

idToMetricMap.entrySet().stream().map(...).collect(toImmutableMap(Entry::getKey, Entry::getValue))

jirassimok · 2022-01-13T16:33:36Z

plugin/trino-iceberg/src/main/java/io/trino/plugin/iceberg/IcebergStatistics.java

Or make the method return an Optional (also noted in an earlier comment).

jirassimok · 2022-01-13T16:34:56Z

plugin/trino-iceberg/src/main/java/io/trino/plugin/iceberg/IcebergStatistics.java

If it is, ImmutableMap will throw an exception.

jirassimok · 2022-01-13T16:55:32Z

plugin/trino-iceberg/src/main/java/io/trino/plugin/iceberg/IcebergStatistics.java

This could use ifPresent, or even better, it could use map or mergeOptionals (suggested above).

this.min = this.min.map(currentMin -> min != null && compareTrinoValue(min, currentMin) < 0 ? newValue : currentMin); this.min = mergeOptionals(this.min, Optional.ofNullable(min), (currentValue, newValue) -> compareTrinoValue(newValue, currentMin) < 0 ? newValue : currentValue);

(If you make the bound variables Optionals as I suggested in an earlier comment, then I think mergeOptionals is best here. Otherwise, I think the map version is better.)

These miss a case where we want to invalidate the stats. For example, if the first file has stats for a column and then the second file does not, we should treat that the same as if the order is reversed.

alexjo2144 · 2022-01-13T21:40:48Z

Comments addressed in the fixup, however I realized this doesn't work with tables that have gone through some schema evolution. Need to work on a fix for that before re-reviewing.

findepi

comments to the second fixup

findepi · 2022-01-14T10:04:45Z

plugin/trino-iceberg/src/main/java/io/trino/plugin/iceberg/IcebergStatistics.java

Are Iceberg Types safe to use as map keys?

(my initial thought was to have List<> columnTrinoType and correlate with columns based on list index.)

findepi · 2022-01-14T10:06:57Z

plugin/trino-iceberg/src/main/java/io/trino/plugin/iceberg/IcebergStatistics.java

Mapping them here is useful when you have many columns of same type, with stats.
And it only matters when # files isn't huge.

Doing this in ColumnStatistics::new is simpler and IMO sufficient, as you do lookup per column.
(originally you had lookup per column x file)

findepi · 2022-01-14T10:07:36Z

plugin/trino-iceberg/src/main/java/io/trino/plugin/iceberg/IcebergStatistics.java

Why Optional?
How does empty() differ from empty map?

findepi · 2022-01-14T10:09:44Z

plugin/trino-iceberg/src/main/java/io/trino/plugin/iceberg/IcebergStatistics.java

This is what breaks the schema evolution.
I notice that you didn't have this check and deemed OK.

Maybe we want if (identityPartitionFieldIds.contains(id) && partitionValues.containsKey(id)), so that we still try to take min/max from file stats?

Actually, is identityPartitionFieldIds.contains(id) important?
if partitionValues.containsKey(id) should be enough. The current table partitioning is not important when calculating the stats.

Actually, is identityPartitionFieldIds.contains(id) important?

I was using that to proxy checking if the partition has a transform. If the partitioning is on hour(ts) we can't use the partition information to calculate max(ts), but you're right we can't use the current partitioning it needs to be the spec for that file.

findepi · 2022-01-14T10:10:26Z

plugin/trino-iceberg/src/main/java/io/trino/plugin/iceberg/IcebergStatistics.java

lowerBounds, upperBounds are unnecessarily Optional. "No entry" (null) is treated the same as "no map at all".

findepi · 2022-01-14T10:11:21Z

plugin/trino-iceberg/src/main/java/io/trino/plugin/iceberg/IcebergStatistics.java

Do typeToComparisonHandle.get(type) only if constructing new ColumnStatistics

findepi · 2022-01-14T10:15:24Z

plugin/trino-iceberg/src/main/java/io/trino/plugin/iceberg/IcebergStatistics.java

i missed that previously -- the ColumnStatistics captures initial bounds, so on the first round the call to updateMinMax is noop.

You can skip the call in a somewhat verbose manner with Map.compute

columnStatistics.compute(id, (ignored, columnStatistics) -> { if (columnStatistics == null) { columnStatistics = new ColumnStatistics(lowerBound, upperBound, comparisonHandle); } else { columnStatistics.updateMinMax(lowerBound, upperBound); } return columnStatistics; });

(or document that you're doing what you're doing currently)

findepi · 2022-01-14T10:16:05Z

plugin/trino-iceberg/src/main/java/io/trino/plugin/iceberg/IcebergStatistics.java

comparisonHandle is immutable state, so fits better as first arg (as you put the field order)

findepi · 2022-01-14T10:16:58Z

plugin/trino-iceberg/src/main/java/io/trino/plugin/iceberg/IcebergStatistics.java

i'd remove Optional.of().
also Optional.empty() -> "Empty"

(the fact that value is wrapped in an Optional is obvious, and doesn't need to be talked about)

alexjo2144 · 2022-01-14T16:57:53Z

Linking a thread I started on the iceberg slack channel on how to deal with schema evolution vs missing metrics, still don't have a clear answer for how to tell the two apart though https://apache-iceberg.slack.com/archives/C025PH0G1D4/p1642177083095300

alexjo2144 · 2022-01-20T14:30:36Z

AC thanks

findepi · 2022-01-21T12:16:55Z

@nineinchnick what is Pull Request Labeler / Test Report failure?
https://github.com/trinodb/trino/pull/9906/checks?check_run_id=4862862822 / https://github.com/trinodb/trino/runs/4862862822

findepi

@alexjo2144 please squash (sans rebase)

findepi · 2022-01-21T12:15:25Z

plugin/trino-iceberg/src/main/java/io/trino/plugin/iceberg/TableStatisticsMaker.java

Fix bug in Iceberg partition schema evolution

this will be squashed, right?

also, should we have a test, with two partitioning transformations over a column?

Yeah, I'll put it in a separate PR though

nineinchnick · 2022-01-21T12:33:39Z

@nineinchnick what is Pull Request Labeler / Test Report failure? https://github.com/trinodb/trino/pull/9906/checks?check_run_id=4862862822 / https://github.com/trinodb/trino/runs/4862862822

@findepi the Test Report is a check added by the separate workflow that's triggered by but not associated with the PR, hence it doesn't have its own workflow header and looks like it belongs to the Pull Request Labeler workflow. This is Github's UI issue. This check contains annotations with a summary of failures in jobs of the ci workflow, so you don't have to go into single jobs and search for failures.

It's marked as failed, because there were other failures. Marking it as successful would be a false positive if it contained error annotations.

findepi · 2022-01-21T13:31:11Z

@nineinchnick could it be attached to the ci flow somehow?
it's confusing to see Pull Request Labeler fail.

alexjo2144 · 2022-01-21T15:28:22Z

Squashed. Thanks @findepi

vincentpoon · 2022-02-17T23:00:38Z

We have millions of files and find SELECT * from table$partitions hangs in the acceptDataFile here.

findepi · 2022-02-18T14:55:14Z

@vincentpoon this PR probably didn't change how the stats are calculated, just made the code saner & "more correct"
or, do you mean this is a behavioral regression what you're observing?

in any case, let's have an issue

cla-bot bot added the cla-signed label Nov 8, 2021

alexjo2144 added the WIP label Nov 8, 2021

alexjo2144 force-pushed the iceberg/stats-refactor branch 3 times, most recently from d4515b5 to 248943a Compare January 11, 2022 20:46

alexjo2144 requested review from findepi and phd3 January 12, 2022 20:54

alexjo2144 removed the WIP label Jan 12, 2022

alexjo2144 requested a review from jirassimok January 12, 2022 20:55

findepi reviewed Jan 13, 2022

View reviewed changes

jirassimok reviewed Jan 13, 2022

View reviewed changes

findepi reviewed Jan 14, 2022

View reviewed changes

alexjo2144 force-pushed the iceberg/stats-refactor branch from c041780 to 64bbf63 Compare January 18, 2022 22:19

github-actions bot added the tests:hive label Jan 19, 2022

alexjo2144 requested a review from findepi January 20, 2022 14:30

findepi approved these changes Jan 21, 2022

View reviewed changes

Refactor Iceberg table statistics to be deterministic

712277b

alexjo2144 force-pushed the iceberg/stats-refactor branch from 64bbf63 to 712277b Compare January 21, 2022 15:28

findepi approved these changes Jan 21, 2022

View reviewed changes

findepi merged commit df61aea into trinodb:master Jan 21, 2022

mosabua mentioned this pull request Jan 21, 2022

Add Trino 369 release notes #10553

Merged

github-actions bot added this to the 369 milestone Jan 21, 2022

alexjo2144 deleted the iceberg/stats-refactor branch February 7, 2022 14:43

Conversation

alexjo2144 commented Nov 8, 2021

Uh oh!

findepi commented Nov 9, 2021

Uh oh!

alexjo2144 commented Jan 12, 2022

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jirassimok left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

alexjo2144 commented Jan 13, 2022

Uh oh!

findepi left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

nineinchnick commented Jan 21, 2022 •

edited

Loading