Skip to content

Conversation

@alexeykudinkin
Copy link
Contributor

@alexeykudinkin alexeykudinkin commented Aug 10, 2022

Change Logs

This PR cleans up considerable amount of Spark's (internal) resolution logic that has been copied over into Hudi components, while in reality there's no actual need for that.

Changelog: TBA

Issues this will be addressing:

Impact

None

Contributor's checklist

  • Read through contributor's guide
  • Change Logs and Impact were stated clearly
  • Adequate tests were added if applicable
  • CI passed

@alexeykudinkin alexeykudinkin force-pushed the ak/spk-sql-clup branch 4 times, most recently from 8d8cda2 to 24501af Compare August 17, 2022 04:36
@alexeykudinkin alexeykudinkin changed the title [WIP] Cleaning up Hudi custom Spark Rules [HUDI-4503] Cleaning up Hudi custom Spark Rules Aug 19, 2022
@alexeykudinkin alexeykudinkin changed the title [HUDI-4503] Cleaning up Hudi custom Spark Rules [HUDI-4690][HUDI-4503] Cleaning up Hudi custom Spark Rules Aug 22, 2022
@alexeykudinkin alexeykudinkin changed the title [HUDI-4690][HUDI-4503] Cleaning up Hudi custom Spark Rules [WIP][HUDI-4690][HUDI-4503] Cleaning up Hudi custom Spark Rules Aug 23, 2022
@alexeykudinkin alexeykudinkin force-pushed the ak/spk-sql-clup branch 3 times, most recently from 88f855d to d37c994 Compare August 31, 2022 02:11
@alexeykudinkin alexeykudinkin changed the title [WIP][HUDI-4690][HUDI-4503] Cleaning up Hudi custom Spark Rules [WIP][HUDI-4690][HUDI-4503][Stacked on 6550] Cleaning up Hudi custom Spark Rules Aug 31, 2022
@alexeykudinkin alexeykudinkin force-pushed the ak/spk-sql-clup branch 2 times, most recently from bf18e26 to 1f3b814 Compare August 31, 2022 06:07
@yihua yihua added priority:critical Production degraded; pipelines stalled engine:spark Spark integration labels Aug 31, 2022
@alexeykudinkin alexeykudinkin force-pushed the ak/spk-sql-clup branch 5 times, most recently from 533eb9b to 392b7ce Compare September 2, 2022 19:20
@yihua yihua assigned nsivabalan and codope and unassigned nsivabalan Sep 12, 2022
createRdd(df, structName, recordNamespace, toScalaOption(targetSchema))
}

if (latestTableSchema.isPresent && reconcileToLatestSchema) {
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This conditional got removed as useless

Alexey Kudinkin added 2 commits January 6, 2023 09:28
…ly, as otherwise assignments might

get resolved before we do their adaptation;
Handle unresolved plans in `resolveHudiTable`
def customResolutionRules: Seq[RuleBuilder] = {
val rules: ListBuffer[RuleBuilder] = ListBuffer(
// Default rules
session => HoodieResolveReferences(session),
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

All custom resolution logic has been removed (we now rely on Spark to resolve all of its own commands)

Alexey Kudinkin added 3 commits January 6, 2023 23:06
 - Apply to producing side based on whether output contains meta-fields (rather than whether it resolves to Hudi table)
 - Handle producing side in one pass (ie both resolved/unresolved)
@hudi-bot
Copy link
Collaborator

hudi-bot commented Jan 8, 2023

CI report:

Bot commands @hudi-bot supports the following commands:
  • @hudi-bot run azure re-run the last Azure build

@alexeykudinkin
Copy link
Contributor Author

CI is green:

Screenshot 2023-01-08 at 8 07 53 PM

https://dev.azure.com/apache-hudi-ci-org/apache-hudi-ci/_build/results?buildId=14167&view=results

@alexeykudinkin alexeykudinkin added priority:critical Production degraded; pipelines stalled and removed priority:blocker Production down; release blocker labels Jan 12, 2023
@alexeykudinkin alexeykudinkin added priority:blocker Production down; release blocker and removed priority:critical Production degraded; pipelines stalled labels Feb 6, 2023
@alexeykudinkin
Copy link
Contributor Author

Closing this in favor of rebased & squashed #7871

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

area:sql SQL interfaces engine:spark Spark integration priority:blocker Production down; release blocker

Projects

Status: 🏁 Triaged
Archived in project

7 participants