[HUDI-4561] Improve incremental query using the fileSlice adjacent to read.end-commit #6324

flashJd · 2022-08-07T15:07:48Z

when we incremental query a hudi table, if

// 1. there are files in metadata be deleted;
// 2. read from earliest
// 3. the start commit is archived
// 4. the end commit is archived
this query will turns to a fullTableScan.
In this condition, the endInstant parameter in getInputSplits() will be the latest
instance, cause to scan the latest fileSlice(which may be larger as time goes by) and then open it and filter the record using instantRange.
Considering a query scenario, read.start-commit is archived and read.end-commit is in activeTimeLine, this is a fullTableScan. But we can set the endInstant parameter to read.end-commit, not the lastest instance, so as to read less data, more over, if there is an upsert between read.end-commit and the lastest instance, if we use lastest instance as endInstant, we will lose the insert data between read.start-commit and read.end-commit(the data is upserted, so the original data is missing in the lastest instance).
Considering another query scenario, read.start-commit is archived and read.end-commit is also archived, this is a fullTableScan. if read.end-commit is long along and be cleaned, but there is savepoint after it, we can use this savepoint to incremental query the table, not care about the data inserted or upserted after the savepoint.
The core idea is making the searching fileSlice adjacent to read.end-commit.
ps. This PR is based on #6096

…mmit

flashJd · 2022-08-07T15:16:46Z

@danny0405 can you help review it, thx.

danny0405 · 2022-08-08T07:54:48Z

If the end commit is active and is used as the filtering instant for scan, shouldn't getLatestFileSliceBeforeOrOn work here ?

flashJd · 2022-08-08T12:10:00Z

If the end commit is active and is used as the filtering instant for scan, shouldn't getLatestFileSliceBeforeOrOn work here ?

yes, it works, but it can't handle the condition when end commit is archived, if we use getLatestFileSliceBeforeOrOn, it may be a savepoint or just no fileSlice(be cleaned), thus not right, getLatestFileSliceAfterOrOn can hande it.

if the end commit is active, getLatestFileSliceAfterOrOn may get the right fileSlice or the next fileSlice.

if getLatestFileSliceAfterOrOn not found a fileSlice, we must then search before/reverse, as it may have a fileSlice before which is in the incremental query range(it must be the latest fileSlice in this fileGroup), we should judge if this fileSlice is before the read.start-commit, if it is, we skip this fileSlice, otherwise we use this fileSlice to search. so why getLatestFileSliceAfterOrOnThenBefore

danny0405 · 2022-08-09T02:32:40Z

If the end commit is active and is used as the filtering instant for scan, shouldn't getLatestFileSliceBeforeOrOn work here ?

yes, it works, but it can't handle the condition when end commit is archived, if we use getLatestFileSliceBeforeOrOn, it may be a savepoint or just no fileSlice(be cleaned), thus not right, getLatestFileSliceAfterOrOn can hande it.

if the end commit is active, getLatestFileSliceAfterOrOn may get the right fileSlice or the next fileSlice.

if getLatestFileSliceAfterOrOn not found a fileSlice, we must then search before/reverse, as it may have a fileSlice before which is in the incremental query range(it must be the latest fileSlice in this fileGroup), we should judge if this fileSlice is before the read.start-commit, if it is, we skip this fileSlice, otherwise we use this fileSlice to search. so why getLatestFileSliceAfterOrOnThenBefore

In general, let's not make things complex here, the only right way for multi-versioning is the timeline instant, one instant has its counterpart fs view, we should not dig into file slices for different version for one snapshot query even though there is performance gain.

If start/end commit are both archived, very probably they are cleaned also, we have 2 choices here

read the save point commit if there is with greater timestamp
read the latest commit

If start commit is archived but end commit is active, there is also possibility that the active instants with smaller timestamp that end commit is cleaned, we should check that first before we use the end commit as the fs view version filtering condition, that makes the thing more complex too.

So, i would -1 if this is only an improvement not a bug fix.

flashJd · 2022-08-09T09:43:59Z

If the end commit is active and is used as the filtering instant for scan, shouldn't getLatestFileSliceBeforeOrOn work here ?

yes, it works, but it can't handle the condition when end commit is archived, if we use getLatestFileSliceBeforeOrOn, it may be a savepoint or just no fileSlice(be cleaned), thus not right, getLatestFileSliceAfterOrOn can hande it.
if the end commit is active, getLatestFileSliceAfterOrOn may get the right fileSlice or the next fileSlice.
if getLatestFileSliceAfterOrOn not found a fileSlice, we must then search before/reverse, as it may have a fileSlice before which is in the incremental query range(it must be the latest fileSlice in this fileGroup), we should judge if this fileSlice is before the read.start-commit, if it is, we skip this fileSlice, otherwise we use this fileSlice to search. so why getLatestFileSliceAfterOrOnThenBefore

In general, let's not make things complex here, the only right way for multi-versioning is the timeline instant, one instant has its counterpart fs view, we should not dig into file slices for different version for one snapshot query even though there is performance gain.

If start/end commit are both archived, very probably they are cleaned also, we have 2 choices here

read the save point commit if there is with greater timestamp

read the latest commit

If start commit is archived but end commit is active, there is also possibility that the active instants with smaller timestamp that end commit is cleaned, we should check that first before we use the end commit as the fs view version filtering condition, that makes the thing more complex too.

So, i would -1 if this is only an improvement not a bug fix.

If we read the latest commit, then the upsert between read.end-commit and latest commit will shadow the insert
betweent read.start-commit and read.end-commit, thus incremental query lost the data, is this a bug or not?

I draw a sketch above, suppose instant35 is a savepoint and we retain 40 commits for clean,above is the timeline
1)if read.end-commit <= instant30, we can use instant35(savepoint) to do incremental query
2)if instant30 < read.end-commit <= instant35, we use [slice35,slice58,slice35,slice30] to do query
3)if instant35 < read.end-commit <= instant58, we can use instant60(the earliest uncleaned instant) to do query
4)if instant58 < read.end-commit <= instant60, we use [slice60, slice61?, slice60, slice30] to do query
5)if read.end-commit > instant60, we use the original getLatestFileSliceBeforeOrOn logic

As you said "we should not dig into file slices for different version for one snapshot query", I understand it as we should
use one instant's fs view(flieSlice combination), not recombine as 2) and 4)

Then consider another solution, get the savepoint instant or earliest uncleaned instant just after the read.end-commit, then use
this instant as the parameter of getLatestFileSliceBeforeOrOn, the problem is how we can easliy get the
earliest uncleaned instant just after the read.end-commit, as we don't have a timeline which is dataActive?

Can we just align the FirstNonSavepointActiveCommit to EarliestNonSavepointUncleanedCommit? Then will be simpler in many
processing logic.

…bleScan.

…lashJd/hudi into incremental_query_since_clean_instance

hudi-bot · 2022-08-09T12:11:08Z

CI report:

cb353b3 Azure: FAILURE

Bot commands

@hudi-bot supports the following commands:

@hudi-bot run azure re-run the last Azure build

flashJd · 2022-08-11T01:38:21Z

@danny0405 looking forward to your reply

yihua

@danny0405 it seems that Hudi 1.0 already solves the problem with completion time and new file slicing logic. Let me know if this is something specific to Hudi on Flink.

danny0405 · 2024-09-11T07:36:15Z

yeah, we have switched to completion time semantics since 1.0 for Flink in here:

hudi/hudi-common/src/main/java/org/apache/hudi/common/table/timeline/CompletionTimeQueryView.java

Line 259 in 276133b

String maxInstantTime = timeline.getInstantsAsStream()

, but spark still got this issue.

yihua · 2024-09-12T22:49:04Z

Makes sense. We have HUDI-8141 and HUDI-7227 to track the fix of incremental queries on Spark. Closing this PR on Flink.

flashJd added 4 commits August 4, 2022 18:08

support retain hour cleaning policy for flink

08eed74

Merge branch 'master' of github.com:flashJd/hudi

387d10a

improve incremental query using the adjacent fileSlice to read.end-co…

6a4d3ca

…mmit

fix checkStyle

cc426bf

fix some corner case

3c9babc

已添加 commitTimeLine.drawio

536d4d4

flashJd added 3 commits August 9, 2022 18:00

Use getLatestMergedFileSlicesBeforeOrOn logic when it is not a fullTa…

8614fb3

…bleScan.

Merge branch 'incremental_query_since_clean_instance' of github.com:f…

19056a1

…lashJd/hudi into incremental_query_since_clean_instance

delete unnecessary file

cb353b3

yihua assigned danny0405 Sep 6, 2022

yihua added engine:flink Flink integration reader-core priority:high Significant impact; potential bugs labels Sep 6, 2022

github-actions bot added the size:M PR with lines of changes in (100, 300] label Feb 26, 2024

yihua reviewed Sep 11, 2024

View reviewed changes

yihua closed this Sep 12, 2024

hudi-bot mentioned this pull request Dec 9, 2025

Improve incremental query using the fileSlice adjacent to read.end-commit #15324

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[HUDI-4561] Improve incremental query using the fileSlice adjacent to read.end-commit #6324

[HUDI-4561] Improve incremental query using the fileSlice adjacent to read.end-commit #6324

Uh oh!

flashJd commented Aug 7, 2022 •

edited

Loading

Uh oh!

flashJd commented Aug 7, 2022

Uh oh!

danny0405 commented Aug 8, 2022

Uh oh!

flashJd commented Aug 8, 2022 •

edited

Loading

Uh oh!

danny0405 commented Aug 9, 2022 •

edited

Loading

Uh oh!

flashJd commented Aug 9, 2022

Uh oh!

hudi-bot commented Aug 9, 2022

Uh oh!

flashJd commented Aug 11, 2022

Uh oh!

yihua left a comment

Uh oh!

danny0405 commented Sep 11, 2024 •

edited

Loading

Uh oh!

yihua commented Sep 12, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[HUDI-4561] Improve incremental query using the fileSlice adjacent to read.end-commit #6324

[HUDI-4561] Improve incremental query using the fileSlice adjacent to read.end-commit #6324

Uh oh!

Conversation

flashJd commented Aug 7, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

flashJd commented Aug 7, 2022

Uh oh!

danny0405 commented Aug 8, 2022

Uh oh!

flashJd commented Aug 8, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

danny0405 commented Aug 9, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

flashJd commented Aug 9, 2022

Uh oh!

hudi-bot commented Aug 9, 2022

CI report:

Uh oh!

flashJd commented Aug 11, 2022

Uh oh!

yihua left a comment

Choose a reason for hiding this comment

Uh oh!

danny0405 commented Sep 11, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yihua commented Sep 12, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

flashJd commented Aug 7, 2022 •

edited

Loading

flashJd commented Aug 8, 2022 •

edited

Loading

danny0405 commented Aug 9, 2022 •

edited

Loading

danny0405 commented Sep 11, 2024 •

edited

Loading