Skip to content

Conversation

@dramaticlly
Copy link
Contributor

close #7975

Since #6682 provide major deletion performance improvement on S3 and hadoop fileIO, I think we can try to add SupportsBulkOperations for ResolvingFileIO as it currently dynamically load fileIO based on scheme and both of the delegation fileIOs (S3FileIO and HadoopFileIO) supports bulk deletion.

If custom fileIO is used at catalog but not support bulk operations, it will fall back to standup sequential deletion.

CC @rdblue @nastra @RussellSpitzer @szehon-ho

@github-actions github-actions bot added the core label Jul 3, 2023
Copy link
Contributor

@nastra nastra left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think having this makes a lot of sense, just left a small comment

Copy link
Contributor

@nastra nastra left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM with one comment

@dramaticlly
Copy link
Contributor Author

@RussellSpitzer @nastra @bryanck updated per your suggest, appreciate another look.

@nastra nastra merged commit f4d830b into apache:master Aug 14, 2023
nastra pushed a commit to nastra/iceberg that referenced this pull request Aug 15, 2023
@dramaticlly dramaticlly deleted the resolveFileIO branch January 26, 2025 04:06
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

SupportsBulkOperations for ResolvingFileIO

4 participants