Skip to content

Conversation

@dongkelun
Copy link
Contributor

Tips

What is the purpose of the pull request

(For example: This pull request adds quick-start document.)

Brief change log

(for example:)

  • Modify AnnotationLocation checkstyle rule in checkstyle.xml

Verify this pull request

(Please pick either of the following options)

This pull request is a trivial rework / code cleanup without any test coverage.

(or)

This pull request is already covered by existing tests, such as (please describe tests).

(or)

This change added tests and can be verified as follows:

(example:)

  • Added integration tests for end-to-end.
  • Added HoodieClientWriteTest to verify the change.
  • Manually verified the change by running a job locally.

Committer checklist

  • Has a corresponding JIRA in PR title & commit

  • Commit message is descriptive of the change

  • CI is green

  • Necessary doc changes done or have another open PR

  • For large changes, please consider breaking it into sub-tasks under an umbrella JIRA.

@dongkelun dongkelun force-pushed the HUDI-3158 branch 3 times, most recently from 1a657de to 08c493b Compare January 5, 2022 15:28
@dongkelun
Copy link
Contributor Author

@xushiyan @nsivabalan hello, can you please take a review?

@xushiyan
Copy link
Member

xushiyan commented Jan 6, 2022

@dongkelun the warn log comes from clustering planning. can you help clarify how would this change avoid the repeated warn logs?

@dongkelun
Copy link
Contributor Author

@dongkelun the warn log comes from clustering planning. can you help clarify how would this change avoid the repeated warn logs?

Hello, the reason for the warning is that the content of replaceCommitRequestedInstantinstant is empty,There are many places to call this method, such as HoodieSparkTable.create:

    if (refreshTimeline) {
      hoodieSparkTable.getHoodieView().sync();
    }

There are also many places to call HoodieSparkTable.create,Therefore, it is not easy to reduce the warning log. It is better to avoid this warning directly from the source.INSERT_OVERWRITE's commitActionType is REPLACE_COMMIT_ACTION,It creates an empty replaceCommitRequestedInstantinstant in the startCommitWithTime method,We can avoid this warning from the source by changing it to non empty

@nsivabalan nsivabalan added the priority:critical Production degraded; pipelines stalled label Jan 7, 2022
@nsivabalan
Copy link
Contributor

@codope : Can you review this PR. May be there could be some follow ups in other places where we do create similar empty replace commit instants. I

Copy link
Member

@codope codope left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@dongkelun Rather than introduce a behavior change, I would prefer to remove the warn log itself. Is there any reason we still want to keep it?
Also, while you're at it, can you make the method getRequestedReplaceMetadata private? I don't see it being used anywhere other than the same class.

@nsivabalan
Copy link
Contributor

I would vote to remote the log.warn msg only(ClusteringUtils L 93)) using a diff patch which we can pull into 0.10.1. we can continue discussions in this patch as to whats the right fix is.

@nsivabalan
Copy link
Contributor

#4553

@xushiyan xushiyan removed the priority:critical Production degraded; pipelines stalled label Jan 11, 2022
@dongkelun
Copy link
Contributor Author

@dongkelun Rather than introduce a behavior change, I would prefer to remove the warn log itself. Is there any reason we still want to keep it? Also, while you're at it, can you make the method getRequestedReplaceMetadata private? I don't see it being used anywhere other than the same class.

OK, I have made the method getRequestedReplaceMetadata in classes MetadataConversionUtils and ClusteringUtils private

@dongkelun
Copy link
Contributor Author

dongkelun commented Jan 11, 2022

@nsivabalan @codope
I found a previous issuse:HUDI-1739 in 'MetadataConversionUtils. getRequestedReplaceMetadata ` is the same as this issuse

#2784

Copy link
Contributor

@nsivabalan nsivabalan left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM.

@nsivabalan
Copy link
Contributor

@xushiyan @codope : if you folks don't have any other comments, I can go ahead and land this.

Copy link
Member

@codope codope left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good. Just one minor comment.

@xushiyan
Copy link
Member

@dongkelun @nsivabalan @codope this PR has pivoted and it's mostly not about warn log any more. Can we file a different ticket tailored to this PR and link to it?

@dongkelun dongkelun changed the title [HUDI-3158] Reduce warn logs in Spark SQL INSERT OVERWRITE [HUDI-3252] Avoid creating empty requestedReplaceCommit in the startCommit method Jan 15, 2022
@dongkelun
Copy link
Contributor Author

@dongkelun @nsivabalan @codope this PR has pivoted and it's mostly not about warn log any more. Can we file a different ticket tailored to this PR and link to it?

I have created a new issue HUDI-3252 and modified the PR link to it. Do you think this description is accurate?

@nsivabalan
Copy link
Contributor

looks good to me. I have closed the previous ticket btw.

@hudi-bot
Copy link
Collaborator

CI report:

Bot commands @hudi-bot supports the following commands:
  • @hudi-bot run azure re-run the last Azure build

@nsivabalan nsivabalan merged commit 20e7983 into apache:master Jan 17, 2022
@vinishjail97 vinishjail97 mentioned this pull request Jan 24, 2022
5 tasks
vingov pushed a commit to vingov/hudi that referenced this pull request Jan 26, 2022
liusenhua pushed a commit to liusenhua/hudi that referenced this pull request Mar 1, 2022
vingov pushed a commit to vingov/hudi that referenced this pull request Apr 3, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants