Skip to content

Showcase incompatibilities between optimized parquet writer and hive#10613

Merged
findepi merged 2 commits intotrinodb:masterfrom
findinpath:showcase-incompatibilities-between-optimized-parquet-writer-and-hive
Jan 27, 2022
Merged

Showcase incompatibilities between optimized parquet writer and hive#10613
findepi merged 2 commits intotrinodb:masterfrom
findinpath:showcase-incompatibilities-between-optimized-parquet-writer-and-hive

Conversation

@findinpath
Copy link
Copy Markdown
Contributor

No description provided.

@findinpath findinpath force-pushed the showcase-incompatibilities-between-optimized-parquet-writer-and-hive branch from c423590 to 206f1dc Compare January 14, 2022 16:40
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Could be worth having two tests, one with a short timestamp precision and one with a long timestamp precision.

@findinpath findinpath force-pushed the showcase-incompatibilities-between-optimized-parquet-writer-and-hive branch from 206f1dc to 6d2d359 Compare January 25, 2022 06:52
@findepi findepi merged commit 6028e90 into trinodb:master Jan 27, 2022
@github-actions github-actions bot added this to the 370 milestone Jan 27, 2022

// Hive expects `FIXED_LEN_BYTE_ARRAY` for decimal values irrespective of the Parquet specification which allows `INT32`, `INT64` for short precision decimal types
assertQueryFailure(() -> onHive().executeQuery("SELECT a_decimal FROM " + tableName))
.hasMessageMatching(".* org.apache.parquet.io.ParquetDecodingException: Can not read value at 1 in block 0 in file .*");
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think this is consistently failing with missing org.apache. in the package name:

tests               | 2022-01-31 12:13:37 INFO: FAILURE     /    io.trino.tests.product.hive.TestHiveCompatibility.testSmallDecimalFieldWrittenByOptimizedParquetWriterCannotBeReadByHive (Groups: storage_formats_detailed) took 0.7 seconds
tests               | 2022-01-31 12:13:37 SEVERE: Failure cause:
tests               | java.lang.AssertionError: 
tests               | Expecting message:
tests               |   <"java.io.IOException: parquet.io.ParquetDecodingException: Can not read value at 1 in block 0 in file hdfs://hadoop-master:9000/user/hive/warehouse/parquet_table_small_decimal_created_in_trino/20220131_062836_01082_j2ikr_8c200c2a-4add-4f35-a3ed-b70c4732e54b">
tests               | to match regex:
tests               |   <".* org.apache.parquet.io.ParquetDecodingException: Can not read value at 1 in block 0 in file .*">
tests               | but did not.

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

fixed now

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Development

Successfully merging this pull request may close these issues.

4 participants