Skip to content

Conversation

@aditiwari01
Copy link
Contributor

What is the purpose of the pull request

Fixing Hive getSchema for RT tables
Refer issue for more details: #2802

Verify this pull request:

Unit tests to be explored. I have tested on local and cluster spark-hive.

Committer checklist

  • Has a corresponding JIRA in PR title & commit

  • Commit message is descriptive of the change

  • CI is green

  • Necessary doc changes done or have another open PR

  • For large changes, please consider breaking it into sub-tasks under an umbrella JIRA.

@codope codope changed the title [Issue: #2802] Fixing Hive getSchema for RT tables [HUDI-3130] Fixing Hive getSchema for RT tables Dec 31, 2021
@nsivabalan nsivabalan changed the title [HUDI-3130] Fixing Hive getSchema for RT tables [HUDI-3130] Fixing Hive getSchema for RT tables addressing different partitions having different schemas Jan 3, 2022
@nsivabalan nsivabalan added the priority:high Significant impact; potential bugs label Jan 4, 2022
@nsivabalan nsivabalan removed their assignment Jan 6, 2022
@nsivabalan nsivabalan added the priority:critical Production degraded; pipelines stalled label Feb 8, 2022
@nsivabalan nsivabalan force-pushed the hiveSchema branch 2 times, most recently from 5933187 to 2461ceb Compare February 25, 2022 21:14
@nsivabalan
Copy link
Contributor

@codope : good to review the patch. I have pushed commits to address feedback.

@nsivabalan nsivabalan added status:triaged Issue has been reviewed and categorized and removed priority:high Significant impact; potential bugs labels Feb 28, 2022
Copy link
Member

@codope codope left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good. One minor comment. Also, it would be good to add a test for multi-partitioned table with schema evolution for one of the partition.

@nsivabalan
Copy link
Contributor

sure. will add a test for schema evolution

@nsivabalan
Copy link
Contributor

@codope : Added a test for schema evolution across partitions. you can take a look.

@hudi-bot
Copy link
Collaborator

hudi-bot commented Mar 4, 2022

CI report:

Bot commands @hudi-bot supports the following commands:
  • @hudi-bot run azure re-run the last Azure build

@codope codope merged commit 051ad0b into apache:master Mar 6, 2022
* @throws Exception
*/
public Schema getTableAvroSchema() throws Exception {
return getTableAvroSchema(metaClient.getTableConfig().populateMetaFields());
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@aditiwari01 @codope folks, can you please elaborate why this has been changed? Why are we assuming that Table's schema should have meta-fields?

vingov pushed a commit to vingov/hudi that referenced this pull request Apr 3, 2022
…partitions having different schemas (apache#4468)

* Fixing Hive getSchema for RT tables

* Addressing feedback

* temp diff

* fixing tests after spark datasource read support for metadata table is merged to master

* Adding multi-partition schema evolution tests to HoodieRealTimeRecordReader

Co-authored-by: Aditya Tiwari <[email protected]>
Co-authored-by: sivabalan <[email protected]>
stayrascal pushed a commit to stayrascal/hudi that referenced this pull request Apr 12, 2022
…partitions having different schemas (apache#4468)

* Fixing Hive getSchema for RT tables

* Addressing feedback

* temp diff

* fixing tests after spark datasource read support for metadata table is merged to master

* Adding multi-partition schema evolution tests to HoodieRealTimeRecordReader

Co-authored-by: Aditya Tiwari <[email protected]>
Co-authored-by: sivabalan <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

priority:critical Production degraded; pipelines stalled status:triaged Issue has been reviewed and categorized

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants