Test all file formats in TestSparkCompatibility by lxynov · Pull Request #6699 · trinodb/trino

lxynov · 2021-01-23T06:29:57Z

Trino started failing to read Spark-created ORC Iceberg tables for some reason. See CI result for reference

Update: Fixed after iceberg.use-file-size-from-metadata=false is set. Thanks to @phd3

phd3

@lxynov One possible reason behind this is #6174 , can you please try with iceberg.use-file-size-from-metadata=false ( PR #6539 )?

lxynov · 2021-01-26T02:53:54Z

Thanks @phd3! It was indeed because the Spark writer populated incorrect file sizes in Iceberg metadata. Now the test passes. I've just requested a review from you since you've probably reviewed this part when you were reviewing #4776

phd3

Thanks, just some minor comments.

...ces/docker/presto-product-tests/conf/environment/singlenode-spark-iceberg/iceberg.properties

testing/trino-product-tests/src/main/java/io/trino/tests/iceberg/TestSparkCompatibility.java

lxynov · 2021-01-30T03:26:20Z

@phd3 Changed TestSparkCompatibility to include formats in temporary table names to circumvent concurrency issues. This PR should be good to go. cc @electrum

phd3 · 2021-02-06T01:12:43Z

testing/trino-product-tests/src/main/java/io/trino/tests/iceberg/TestSparkCompatibility.java

+        String baseTableName = "test_spark_reads_presto_partitioned_table_" + format;
        String prestoTableName = prestoTableName(baseTableName);
-        onPresto().executeQuery(format("CREATE TABLE %s (_string VARCHAR, _bigint BIGINT) WITH (partitioning = ARRAY['_string'])", prestoTableName));
+        onPresto().executeQuery(format("CREATE TABLE %s (_string VARCHAR, _bigint BIGINT) WITH (partitioning = ARRAY['_string'], format = '" + format + "')", prestoTableName));


Suggested change

onPresto().executeQuery(format("CREATE TABLE %s (_string VARCHAR, _bigint BIGINT) WITH (partitioning = ARRAY['_string'], format = '" + format + "')", prestoTableName));

onPresto().executeQuery(format("CREATE TABLE %s (_string VARCHAR, _bigint BIGINT) WITH (partitioning = ARRAY['_string'], format = '%s')", prestoTableName, format));

sorry if it was confusing, meant a change like the above in #6699 (comment)

ebyhr · 2022-07-13T01:37:32Z

Superseded by #8751

cla-bot bot added the cla-signed label Jan 23, 2021

phd3 reviewed Jan 25, 2021

View reviewed changes

lxynov force-pushed the TestSparkCompatibility branch 2 times, most recently from ee8bf92 to 465f8a8 Compare January 25, 2021 23:36

lxynov changed the title ~~[WIP] Test all file formats in TestSparkCompatibility~~ Test all file formats in TestSparkCompatibility Jan 26, 2021

lxynov mentioned this pull request Jan 26, 2021

Add Avro support to Iceberg Connector #4776

Closed

lxynov requested a review from phd3 January 26, 2021 02:50

phd3 reviewed Jan 26, 2021

View reviewed changes

lxynov added 2 commits January 27, 2021 09:40

Fix typo

f00c464

Drop table at the end of tests

a2db7ca

lxynov force-pushed the TestSparkCompatibility branch from 465f8a8 to 63ac92b Compare January 27, 2021 17:47

Test all file formats in TestSparkCompatibility

4dc1f6a

lxynov force-pushed the TestSparkCompatibility branch from 63ac92b to 4dc1f6a Compare January 29, 2021 23:25

phd3 self-requested a review February 3, 2021 04:25

phd3 reviewed Feb 6, 2021

View reviewed changes

findepi force-pushed the master branch from 8538e49 to 1f896ea Compare July 30, 2021 22:13

ebyhr closed this Jul 13, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Test all file formats in TestSparkCompatibility#6699

Test all file formats in TestSparkCompatibility#6699
lxynov wants to merge 3 commits intotrinodb:masterfrom
lxynov:TestSparkCompatibility

lxynov commented Jan 23, 2021 •

edited

Loading

Uh oh!

phd3 left a comment

Uh oh!

lxynov commented Jan 26, 2021

Uh oh!

phd3 left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lxynov commented Jan 30, 2021

Uh oh!

phd3 Feb 6, 2021

Uh oh!

ebyhr commented Jul 13, 2022

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

3 participants

	onPresto().executeQuery(format("CREATE TABLE %s (_string VARCHAR, _bigint BIGINT) WITH (partitioning = ARRAY['_string'], format = '" + format + "')", prestoTableName));
	onPresto().executeQuery(format("CREATE TABLE %s (_string VARCHAR, _bigint BIGINT) WITH (partitioning = ARRAY['_string'], format = '%s')", prestoTableName, format));

Conversation

lxynov commented Jan 23, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

phd3 left a comment

Choose a reason for hiding this comment

Uh oh!

lxynov commented Jan 26, 2021

Uh oh!

phd3 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lxynov commented Jan 30, 2021

Uh oh!

phd3 Feb 6, 2021

Choose a reason for hiding this comment

Uh oh!

ebyhr commented Jul 13, 2022

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

3 participants

lxynov commented Jan 23, 2021 •

edited

Loading