Add support for writing partitionValues_parsed for the add entries in the Delta Lake checkpoint#19662
Conversation
00ab843 to
d17b1f7
Compare
94fb6e1 to
d49d727
Compare
|
@ebyhr pls do run the PR with secrets. |
partitionValues_parsed in the Delta Lake checkpointpartitionValues_parsed for the add entries in the Delta Lake checkpoint
|
/test-with-secrets sha=d49d727bc2d0a1aac8968aaa6a9bbb7af724781d |
|
The CI workflow run with tests that require additional secrets finished as failure: https://github.com/trinodb/trino/actions/runs/6818424653 |
e54e63a to
406aaf3
Compare
|
Could you rebase on master to resolve conflicts? |
406aaf3 to
7df2c68
Compare
...lake/src/main/java/io/trino/plugin/deltalake/transactionlog/checkpoint/CheckpointWriter.java
Outdated
Show resolved
Hide resolved
plugin/trino-delta-lake/src/test/java/io/trino/plugin/deltalake/TestDeltaLakeConnectorTest.java
Outdated
Show resolved
Hide resolved
...ts/src/main/java/io/trino/tests/product/deltalake/TestDeltaLakeCheckpointsCompatibility.java
Outdated
Show resolved
Hide resolved
7df2c68 to
f5d9db2
Compare
f5d9db2 to
14e9de8
Compare
|
CI hit #16315 |
There was a problem hiding this comment.
Is TIMESTAMP_NTZ type disallowed as a partition column? It would be nice to create the table in Spark and cover the type if it's supported.
There was a problem hiding this comment.
Added corresponding test in TestDeltaLakeBasic.
There was a problem hiding this comment.
Thanks, but TestDeltaLakeBasic doesn't ensure that the field in checkpoint is readable by Spark.
...ts/src/main/java/io/trino/tests/product/deltalake/TestDeltaLakeCheckpointsCompatibility.java
Outdated
Show resolved
Hide resolved
...ts/src/main/java/io/trino/tests/product/deltalake/TestDeltaLakeCheckpointsCompatibility.java
Outdated
Show resolved
Hide resolved
...ts/src/main/java/io/trino/tests/product/deltalake/TestDeltaLakeCheckpointsCompatibility.java
Outdated
Show resolved
Hide resolved
...lake/src/main/java/io/trino/plugin/deltalake/transactionlog/checkpoint/CheckpointWriter.java
Outdated
Show resolved
Hide resolved
6885610 to
60b723b
Compare
60b723b to
d585417
Compare
|
@ebyhr AC |
There was a problem hiding this comment.
Thanks, but TestDeltaLakeBasic doesn't ensure that the field in checkpoint is readable by Spark.
Description
Write
partitionValues_parsedfield for theaddentries in the Delta Lake checkpoint file.This information can be used to massively reduce the amount of information read from the checkpoint file in case the
SELECTquery from the Delta Lake table has a partition filter. See #19588 for details.Fixes #19586
Additional context and related issues
Release notes
(x) This is not user-visible or is docs only, and no release notes are required.