Skip to content

Conversation

@yihua
Copy link
Contributor

@yihua yihua commented Apr 7, 2022

What is the purpose of the pull request

The Hive sync fails with ClassNotFoundException on a Hudi table if metadata table is enabled on the read path, because hudi-hive-sync-bundle does not package HBase classes which are needed for reading metadata table. This PR fixes hudi-hive-sync-bundle to include HBase dependencies and shading.

Brief change log

(for example:)

Verify this pull request

This pull request is already covered by existing IT tests. The PR is also verified by running the hive sync with hudi-hive-sync-bundle jar in the docker demo setup. The hive sync finishes successfully. From the logs, metadata table files partition can be read without problem after this change, with DEFAULT_METADATA_ENABLE_FOR_READERS flipped to true in HoodieMetadataConfig.

Committer checklist

  • Has a corresponding JIRA in PR title & commit

  • Commit message is descriptive of the change

  • CI is green

  • Necessary doc changes done or have another open PR

  • For large changes, please consider breaking it into sub-tasks under an umbrella JIRA.

@yihua yihua added the priority:blocker Production down; release blocker label Apr 7, 2022
@hudi-bot
Copy link
Collaborator

hudi-bot commented Apr 8, 2022

CI report:

Bot commands @hudi-bot supports the following commands:
  • @hudi-bot run azure re-run the last Azure build

@yihua yihua merged commit 672974c into apache:master Apr 8, 2022
xushiyan pushed a commit that referenced this pull request Apr 14, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

priority:blocker Production down; release blocker

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants