Skip to content

Conversation

@TJX2014
Copy link
Contributor

@TJX2014 TJX2014 commented Oct 11, 2022

Change Logs

Flink writer not consider aborted instant to write because:
1.When stream write reuse aborted instant, there is chance this one is older than instant of coordinator
2.For org.apache.hudi.sink.common.AbstractStreamWriteFunction#instantToWrite, there is chance to get two different instants(the older and the new one), but only send the older to coordinator, so the coordinator think the new one is dirty and delete it, which will cause FileNotFoundException.

Impact

Bug fix.

Risk level: none | low | medium | high
none

Documentation Update

none

Contributor's checklist

  • Read through contributor's guide
  • Change Logs and Impact were stated clearly
  • Adequate tests were added if applicable
  • CI passed

@TJX2014
Copy link
Contributor Author

TJX2014 commented Oct 11, 2022

@danny0405 Please help review this.

if (!ckpMsg.isComplete()) {
// should not consider 'aborted', which will lead writer instant not consistent with coordinator
if (ckpMsg.isInflight()) {
return ckpMsg.getInstant();
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We never abort a checkpoint in master code now, can you double check this logic again ?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks, no abort in coordinator, I will check other situation of abort instant.

@hudi-bot
Copy link
Collaborator

CI report:

Bot commands @hudi-bot supports the following commands:
  • @hudi-bot run azure re-run the last Azure build

@yihua yihua added writer-core engine:flink Flink integration priority:critical Production degraded; pipelines stalled labels Oct 15, 2022
@bvaradar bvaradar self-assigned this Feb 18, 2023
@danny0405
Copy link
Contributor

Close because it is invalid since #7620

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

engine:flink Flink integration priority:critical Production degraded; pipelines stalled

Projects

Status: 🏁 Triaged
Archived in project

Development

Successfully merging this pull request may close these issues.

5 participants