Skip to content

Cache file listings per transaction#11650

Merged
sopel39 merged 3 commits intotrinodb:masterfrom
starburstdata:ks/cache_listing
Apr 5, 2022
Merged

Cache file listings per transaction#11650
sopel39 merged 3 commits intotrinodb:masterfrom
starburstdata:ks/cache_listing

Conversation

@sopel39
Copy link
Copy Markdown
Member

@sopel39 sopel39 commented Mar 24, 2022

Related issues, pull requests, and links

# Hive
* Improve performance of queries that scan same table multiple times. ({issue}`issuenumber`)

@cla-bot cla-bot bot added the cla-signed label Mar 24, 2022
@sopel39 sopel39 force-pushed the ks/cache_listing branch 2 times, most recently from c89ded2 to c49ef9b Compare March 24, 2022 10:07
Copy link
Copy Markdown
Member Author

@sopel39 sopel39 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@sopel39 sopel39 requested a review from findinpath March 24, 2022 22:32
Copy link
Copy Markdown
Member

@raunaqmorarka raunaqmorarka left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm % minor comments

@raunaqmorarka
Copy link
Copy Markdown
Member

This might have some implications for fault tolerant execution. E.g. if file listing is cached for a file which is deleted later on, now every retry of the query will fail due to a file being present in the listing cache but not on actual FS.

@findepi findepi requested a review from arhimondr March 25, 2022 10:48
@sopel39 sopel39 requested a review from findinpath March 29, 2022 11:25
Copy link
Copy Markdown
Member Author

@sopel39 sopel39 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ac

@sopel39 sopel39 requested review from findepi and findinpath March 30, 2022 15:01
@sopel39 sopel39 force-pushed the ks/cache_listing branch from f6c63c1 to 2bf8495 Compare April 1, 2022 12:20
@sopel39 sopel39 requested a review from findepi April 1, 2022 12:21
@sopel39 sopel39 force-pushed the ks/cache_listing branch from 2bf8495 to 1b20288 Compare April 1, 2022 12:23
@sopel39 sopel39 force-pushed the ks/cache_listing branch from 1b20288 to 4f8da02 Compare April 1, 2022 12:37
Copy link
Copy Markdown
Member

@findepi findepi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

% previous comments

@sopel39 sopel39 force-pushed the ks/cache_listing branch 2 times, most recently from eb63898 to 17d02ce Compare April 1, 2022 16:46
Cache concurrent directory listing when same
table is being scanned during single query
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Development

Successfully merging this pull request may close these issues.

4 participants