Skip to content

Conversation

@yihua
Copy link
Contributor

@yihua yihua commented Oct 15, 2022

Change Logs

This is applicable only to Hudi Steramer flow. We had a fix in Spark streamer layer which fixed the defaults and null values. So, the Spark Avro schema post processor is not required anymore. This PR removes the Spark Avro schema post processor and corresponding config, which also avoid schema mismatch for ENUM type.

Impact

Removes redundant processing

Risk level

low

Documentation Update

Config default updates will be automatically done through release docs.

Contributor's checklist

  • Read through contributor's guide
  • Change Logs and Impact were stated clearly
  • Adequate tests were added if applicable
  • CI passed

@yihua yihua added area:schema Schema evolution and data types priority:critical Production degraded; pipelines stalled labels Oct 15, 2022
@nsivabalan nsivabalan added the status:in-progress Work in progress label Nov 2, 2022
@nsivabalan nsivabalan added the release-0.12.2 Patches targetted for 0.12.2 label Dec 6, 2022
@codope codope removed the release-0.12.2 Patches targetted for 0.12.2 label Dec 7, 2022
@nsivabalan nsivabalan added priority:high Significant impact; potential bugs and removed priority:critical Production degraded; pipelines stalled labels Feb 8, 2023
@github-actions github-actions bot added the size:XS PR with lines of changes in <= 10 label Feb 26, 2024
@yihua yihua changed the title [DO NOT MERGE][HUDI-5034] Disable avro schema postprocessor by default [HUDI-5034] Disable Spark avro schema postprocessor by default Sep 11, 2024
@yihua yihua force-pushed the HUDI-5034-disable-schema-postprocessor branch from e3f7b86 to 6c37673 Compare September 11, 2024 19:40
@apache apache deleted a comment from hudi-bot Sep 11, 2024
@yihua yihua removed the status:in-progress Work in progress label Sep 11, 2024
@yihua yihua force-pushed the HUDI-5034-disable-schema-postprocessor branch from 6c37673 to 8e727e3 Compare September 19, 2024 21:37
@github-actions github-actions bot added size:M PR with lines of changes in (100, 300] and removed size:XS PR with lines of changes in <= 10 labels Sep 19, 2024
@yihua
Copy link
Contributor Author

yihua commented Sep 19, 2024

@hudi-bot run azure

@yihua yihua changed the title [HUDI-5034] Disable Spark avro schema postprocessor by default [HUDI-5034] Remove Spark Avro schema post processor Sep 19, 2024
@yihua
Copy link
Contributor Author

yihua commented Sep 20, 2024

@hudi-bot run azure

@hudi-bot
Copy link
Collaborator

CI report:

Bot commands @hudi-bot supports the following commands:
  • @hudi-bot run azure re-run the last Azure build

@yihua
Copy link
Contributor Author

yihua commented Sep 21, 2024

Triggering a update on Azure CI status in GH checks.

@yihua yihua merged commit 0680539 into apache:master Sep 21, 2024
linliu-code pushed a commit to linliu-code/hudi that referenced this pull request Jan 7, 2026
yihua added a commit that referenced this pull request Jan 8, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

area:schema Schema evolution and data types priority:high Significant impact; potential bugs size:M PR with lines of changes in (100, 300]

Projects

Status: 👤 User Action
Status: ✅ Done

Development

Successfully merging this pull request may close these issues.

6 participants