Skip to content

Conversation

7hong
Copy link
Contributor

@7hong 7hong commented Nov 29, 2024

Description

When using the iceberg table, the xxx-metadata.json file is generated each time commit is executed. In the iceberg table, we can automatically clean the previous metadata file through configuration.

write.metadata.delete-after-commit.enabled = true
write.metadata.previous-versions-max = 10

However, the metadata file is not automatically cleaned in trino. A large number of metadata.json files are left on the hdfs.

Different from #20863 ,I have followed the configurations in iceberg (write.metadata.delete-after-commit.enabled and write.metadata.previous-version-max ). Instead of adding a new configuration, this keeps it compatible with iceberg.

Relates to #14128

Release notes

## Iceberg
* Remove the oldest tracked version metadata files when `write.metadata.delete-after-commit.enabled` is set to `true`. ({issue} #19582 ). 

@cla-bot cla-bot bot added the cla-signed label Nov 29, 2024
@github-actions github-actions bot added the iceberg Iceberg connector label Nov 29, 2024
@7hong 7hong requested a review from ebyhr November 29, 2024 08:13
@ebyhr ebyhr changed the title Delete the oldest tracked version metadata files after commit Delete the oldest tracked version metadata files after commit in Iceberg Nov 29, 2024
@7hong 7hong force-pushed the master branch 3 times, most recently from de4e401 to 1ce6ba0 Compare November 30, 2024 11:21
@7hong 7hong force-pushed the master branch 2 times, most recently from 0b6a250 to d5bc9ae Compare December 1, 2024 04:18
@ebyhr ebyhr force-pushed the master branch 3 times, most recently from 74b52b4 to 447af1b Compare December 1, 2024 05:07
ebyhr
ebyhr previously approved these changes Dec 1, 2024
@7hong 7hong force-pushed the dev branch 8 times, most recently from ab952ce to 6deed07 Compare December 5, 2024 10:01
@ademille
Copy link

Any more progress on this? I'd love to get this change.

@7hong 7hong force-pushed the dev branch 5 times, most recently from 7ede4d6 to b94871a Compare December 13, 2024 02:05
@7hong 7hong force-pushed the dev branch 3 times, most recently from 4eb13af to e9757d7 Compare December 13, 2024 04:41
@7hong
Copy link
Contributor Author

7hong commented Dec 13, 2024

@ebyhr I added test in BaseIcebergConnectorSmokeTest

@ebyhr
Copy link
Member

ebyhr commented Dec 24, 2024

/test-with-secrets sha=b8f9227fcc6a6fd89aeac3cc5602aadbc01ca369

Copy link

github-actions bot commented Dec 24, 2024

The CI workflow run with tests that require additional secrets finished as failure: https://github.com/trinodb/trino/actions/runs/12474870064

@ebyhr ebyhr force-pushed the dev branch 2 times, most recently from 8801c61 to 873f3e9 Compare December 24, 2024 03:10
@ebyhr
Copy link
Member

ebyhr commented Dec 24, 2024

/test-with-secrets sha=873f3e9c4aa460c5095724224838069813408ad8

Copy link

github-actions bot commented Dec 24, 2024

The CI workflow run with tests that require additional secrets finished as failure: https://github.com/trinodb/trino/actions/runs/12476325378

@ebyhr
Copy link
Member

ebyhr commented Dec 24, 2024

/test-with-secrets sha=782ffcf711fa69a86fd68d504a7362fe11e60fa4

Copy link

The CI workflow run with tests that require additional secrets has been started: https://github.com/trinodb/trino/actions/runs/12476584429

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

cla-signed iceberg Iceberg connector

Development

Successfully merging this pull request may close these issues.

3 participants