Skip to content

Do not hide errors, which trigger manual delta check point file read#28808

Merged
ebyhr merged 1 commit intotrinodb:masterfrom
vlad-lyutenko:vlad-lyutenko/fix-delta-log
Mar 25, 2026
Merged

Do not hide errors, which trigger manual delta check point file read#28808
ebyhr merged 1 commit intotrinodb:masterfrom
vlad-lyutenko:vlad-lyutenko/fix-delta-log

Conversation

@vlad-lyutenko
Copy link
Copy Markdown
Contributor

@vlad-lyutenko vlad-lyutenko commented Mar 23, 2026

Description

It could be situation, like access deny or other permission failure error, which actually should not trigger manual check point read, because we can not be sure, which exceptions could appear here - at least we should not silently hide them, after we collect more of such situations we could add exclusion rules here

io.trino.plugin.deltalake.DeltaLakeMetadata.getTableHandle (DeltaLakeMetadata.java:831)
io.trino.plugin.deltalake.DeltaLakeMetadata.getTableHandle (DeltaLakeMetadata.java:406)
io.trino.plugin.objectstore.TracingObjectStoreConnectorMetadata.getTableHandle (TracingObjectStoreConnectorMetadata.java:157)
io.trino.plugin.objectstore.ObjectStoreMetadata.getTableHandleInOrder (ObjectStoreMetadata.java:250)
io.trino.plugin.objectstore.ObjectStoreMetadata.getTableHandle (ObjectStoreMetadata.java:278)
io.trino.plugin.base.classloader.ClassLoaderSafeConnectorMetadata.getTableHandle (ClassLoaderSafeConnectorMetadata.java:1342)
io.trino.tracing.TracingConnectorMetadata.getTableHandle (TracingConnectorMetadata.java:144)
io.trino.metadata.MetadataManager.lambda$getTableHandle$0 (MetadataManager.java:303)
java.util.Optional.flatMap (Optional.java:289)
io.trino.metadata.MetadataManager.getTableHandle (MetadataManager.java:294)
io.trino.metadata.MetadataManager.getRedirectionAwareTableHandle (MetadataManager.java:2017)
io.trino.tracing.TracingMetadata.getRedirectionAwareTableHandle (TracingMetadata.java:1639)
io.trino.sql.analyzer.StatementAnalyzer$Visitor.getTableHandle (StatementAnalyzer.java:6084)
io.trino.sql.analyzer.StatementAnalyzer$Visitor.visitTable (StatementAnalyzer.java:2334)
io.trino.sql.analyzer.StatementAnalyzer$Visitor.visitTable (StatementAnalyzer.java:534)
io.trino.sql.tree.Table.accept (Table.java:70)
io.trino.sql.tree.AstVisitor.process (AstVisitor.java:27)
io.trino.sql.analyzer.StatementAnalyzer$Visitor.process (StatementAnalyzer.java:553)
io.trino.sql.analyzer.StatementAnalyzer$Visitor.analyzeFrom (StatementAnalyzer.java:5108)
io.trino.sql.analyzer.StatementAnalyzer$Visitor.visitQuerySpecification (StatementAnalyzer.java:3181)
io.trino.sql.analyzer.StatementAnalyzer$Visitor.visitQuerySpecification (StatementAnalyzer.java:534)
io.trino.sql.tree.QuerySpecification.accept (QuerySpecification.java:155)
io.trino.sql.tree.AstVisitor.process (AstVisitor.java:27)
io.trino.sql.analyzer.StatementAnalyzer$Visitor.process (StatementAnalyzer.java:553)
io.trino.sql.analyzer.StatementAnalyzer$Visitor.process (StatementAnalyzer.java:561)
io.trino.sql.analyzer.StatementAnalyzer$Visitor.visitQuery (StatementAnalyzer.java:1607)
io.trino.sql.analyzer.StatementAnalyzer$Visitor.visitQuery (StatementAnalyzer.java:534)
io.trino.sql.tree.Query.accept (Query.java:130)
io.trino.sql.tree.AstVisitor.process (AstVisitor.java:27)
io.trino.sql.analyzer.StatementAnalyzer$Visitor.process (StatementAnalyzer.java:553)
io.trino.sql.analyzer.StatementAnalyzer.analyze (StatementAnalyzer.java:513)
io.trino.sql.analyzer.StatementAnalyzer.analyze (StatementAnalyzer.java:502)
io.trino.sql.analyzer.Analyzer.analyze (Analyzer.java:102)
io.trino.sql.analyzer.Analyzer.lambda$analyze$0 (Analyzer.java:91)
io.trino.sql.analyzer.Analyzer.analyze (Analyzer.java:91)
io.trino.execution.SqlQueryExecution.analyze (SqlQueryExecution.java:359)
io.trino.execution.SqlQueryExecution.<init> (SqlQueryExecution.java:253)
io.trino.execution.SqlQueryExecution$SqlQueryExecutionFactory.createQueryExecution (SqlQueryExecution.java:1033)
io.trino.dispatcher.LocalDispatchQueryFactory.lambda$createDispatchQuery$0 (LocalDispatchQueryFactory.java:163)
...
com.google.common.util.concurrent.TrustedListenableFutureTask$TrustedFutureInterruptibleTask.runInterruptibly (TrustedListenableFutureTask.java:128)
com.google.common.util.concurrent.InterruptibleTask.run (InterruptibleTask.java:74)
com.google.common.util.concurrent.TrustedListenableFutureTask.run (TrustedListenableFutureTask.java:80)
java.util.concurrent.ThreadPoolExecutor.runWorker (ThreadPoolExecutor.java:1090)
java.util.concurrent.ThreadPoolExecutor$Worker.run (ThreadPoolExecutor.java:614)
java.lang.Thread.run (Thread.java:1474)


io.trino.plugin.deltalake.CorruptedDeltaLakeTableHandle.createException (CorruptedDeltaLakeTableHandle.java:44)
io.trino.plugin.deltalake.DeltaLakeMetadata.checkValidTableHandle (DeltaLakeMetadata.java:5142)
io.trino.plugin.deltalake.DeltaLakeMetadata.getTableMetadata (DeltaLakeMetadata.java:973)
io.trino.plugin.objectstore.TracingObjectStoreConnectorMetadata.getTableMetadata (TracingObjectStoreConnectorMetadata.java:256)
io.trino.plugin.objectstore.ObjectStoreMetadata.getTableMetadata (ObjectStoreMetadata.java:593)
io.trino.spi.connector.ConnectorMetadata.getTableSchema (ConnectorMetadata.java:237)
io.trino.plugin.base.classloader.ClassLoaderSafeConnectorMetadata.getTableSchema (ClassLoaderSafeConnectorMetadata.java:271)
io.trino.tracing.TracingConnectorMetadata.getTableSchema (TracingConnectorMetadata.java:234)
io.trino.metadata.MetadataManager.getTableSchema (MetadataManager.java:489)
io.trino.tracing.TracingMetadata.getTableSchema (TracingMetadata.java:311)
io.trino.sql.analyzer.StatementAnalyzer$Visitor.visitTable (StatementAnalyzer.java:2346)
io.trino.sql.analyzer.StatementAnalyzer$Visitor.visitTable (StatementAnalyzer.java:534)
io.trino.sql.tree.Table.accept (Table.java:70)
io.trino.sql.tree.AstVisitor.process (AstVisitor.java:27)
io.trino.sql.analyzer.StatementAnalyzer$Visitor.process (StatementAnalyzer.java:553)
io.trino.sql.analyzer.StatementAnalyzer$Visitor.analyzeFrom (StatementAnalyzer.java:5108)
io.trino.sql.analyzer.StatementAnalyzer$Visitor.visitQuerySpecification (StatementAnalyzer.java:3181)
io.trino.sql.analyzer.StatementAnalyzer$Visitor.visitQuerySpecification (StatementAnalyzer.java:534)
io.trino.sql.tree.QuerySpecification.accept (QuerySpecification.java:155)
io.trino.sql.tree.AstVisitor.process (AstVisitor.java:27)
io.trino.sql.analyzer.StatementAnalyzer$Visitor.process (StatementAnalyzer.java:553)
io.trino.sql.analyzer.StatementAnalyzer$Visitor.process (StatementAnalyzer.java:561)
io.trino.sql.analyzer.StatementAnalyzer$Visitor.visitQuery (StatementAnalyzer.java:1607)
io.trino.sql.analyzer.StatementAnalyzer$Visitor.visitQuery (StatementAnalyzer.java:534)
io.trino.sql.tree.Query.accept (Query.java:130)
io.trino.sql.tree.AstVisitor.process (AstVisitor.java:27)
io.trino.sql.analyzer.StatementAnalyzer$Visitor.process (StatementAnalyzer.java:553)
io.trino.sql.analyzer.StatementAnalyzer.analyze (StatementAnalyzer.java:513)
io.trino.sql.analyzer.StatementAnalyzer.analyze (StatementAnalyzer.java:502)
io.trino.sql.analyzer.Analyzer.analyze (Analyzer.java:102)
io.trino.sql.analyzer.Analyzer.lambda$analyze$0 (Analyzer.java:91)
io.trino.sql.analyzer.Analyzer.analyze (Analyzer.java:91)
io.trino.execution.SqlQueryExecution.analyze (SqlQueryExecution.java:359)
io.trino.execution.SqlQueryExecution.<init> (SqlQueryExecution.java:253)
io.trino.execution.SqlQueryExecution$SqlQueryExecutionFactory.createQueryExecution (SqlQueryExecution.java:1033)
io.trino.dispatcher.LocalDispatchQueryFactory.lambda$createDispatchQuery$0 (LocalDispatchQueryFactory.java:163)
...
com.google.common.util.concurrent.TrustedListenableFutureTask$TrustedFutureInterruptibleTask.runInterruptibly (TrustedListenableFutureTask.java:128)
com.google.common.util.concurrent.InterruptibleTask.run (InterruptibleTask.java:74)
com.google.common.util.concurrent.TrustedListenableFutureTask.run (TrustedListenableFutureTask.java:80)
java.util.concurrent.ThreadPoolExecutor.runWorker (ThreadPoolExecutor.java:1090)
java.util.concurrent.ThreadPoolExecutor$Worker.run (ThreadPoolExecutor.java:614)
java.lang.Thread.run (Thread.java:1474)

Additional context and related issues

Release notes

(x) This is not user-visible or is docs only, and no release notes are required.

@cla-bot cla-bot bot added the cla-signed label Mar 23, 2026
@vlad-lyutenko vlad-lyutenko requested a review from ebyhr March 23, 2026 14:37
@github-actions github-actions bot added the delta-lake Delta Lake connector label Mar 23, 2026
@wendigo wendigo requested a review from chenjian2664 March 23, 2026 14:41
// it could be situation, like access deny or other permission failure error, which actually should not trigger manual check point read
// because we can not be sure, which exceptions could appear here, at least we should not silently hide them
// TODO after we collect more of such situations we could add exclusion rules here
log.warn(e, "Failed to read Delta Lake last checkpoint file %s, falling back to manual checkpoint discovery", checkpointPath);
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why warning log? What is the expected action when users see this message?

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

hi @ebyhr, because of this comment :

// but some file system implementations
// will throw different exceptions if the checkpoint is not found

we can not be 100% sure that this will bring us to the point, where reading check point file manually will fail.

But generally should.

Imagine situation (before this PR), some changes on directory access permission

  • we got exception here - strict permission error
  • silently hide it (because we think that it's legal and we just need to find latest check point manually) and return Optional.empty() here
  • we try to find it and read
  • and got completely another exception smth like Metadata not found in transaction log for, because we hide original permission error.

So with this change user will be able to understand what is original cause.
Maybe log level should be error, but in this case some legit exceptional situation will be highlighted as error.

@ebyhr
Copy link
Copy Markdown
Member

ebyhr commented Mar 23, 2026

Could you fix error-prone failure?

Error:  /home/runner/work/trino/trino/plugin/trino-delta-lake/src/main/java/io/trino/plugin/deltalake/transactionlog/TransactionLogParser.java:[306,60] [UnnecessarilyFullyQualified] This fully qualified name is unambiguous to the compiler if imported.

It could be situation, like access deny or other permission failure error,
which actually should not trigger manual check point read,
because we can not be sure, which exceptions could appear here -
at least we should not silently hide them, after we collect more of such situations
we could add exclusion rules here
@vlad-lyutenko vlad-lyutenko force-pushed the vlad-lyutenko/fix-delta-log branch from 4edf22a to 3a530ff Compare March 24, 2026 10:32
@ebyhr ebyhr merged commit 1f29e53 into trinodb:master Mar 25, 2026
26 checks passed
@github-actions github-actions bot added this to the 481 milestone Mar 25, 2026
@ebyhr ebyhr mentioned this pull request Mar 27, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

cla-signed delta-lake Delta Lake connector

Development

Successfully merging this pull request may close these issues.

3 participants