[HUDI-1850][HUDI-3234] Fixing read of a empty table but with failed write #2903

nsivabalan · 2021-04-30T05:22:26Z

What is the purpose of the pull request

*Fixed read of an empty table (failed write) with proper information.

Brief change log

Fixed read of an empty table (failed write) with proper information.

Verify this pull request

Manually verified the change by running a job locally.

Stacktrace before the fix:
find in the attached jira.

Result after the fix:

val df = spark.read.format("hudi").load("/tmp/hudi_trips_cow")
df: org.apache.spark.sql.DataFrame = []

Committer checklist

Has a corresponding JIRA in PR title & commit
Commit message is descriptive of the change
CI is green
Necessary doc changes done or have another open PR
For large changes, please consider breaking it into sub-tasks under an umbrella JIRA.

codecov-commenter · 2021-04-30T10:32:03Z

Codecov Report

Merging #2903 (712b446) into master (16e90d3) will decrease coverage by 20.21%.
The diff coverage is n/a.

@@              Coverage Diff              @@
##             master    #2903       +/-   ##
=============================================
- Coverage     47.62%   27.41%   -20.22%     
+ Complexity     5502     1285     -4217     
=============================================
  Files           930      381      -549     
  Lines         41268    15107    -26161     
  Branches       4137     1304     -2833     
=============================================
- Hits          19655     4141    -15514     
+ Misses        19865    10667     -9198     
+ Partials       1748      299     -1449

Flag	Coverage Δ
hudicli	`?`
hudiclient	`21.05% <ø> (-13.54%)`	⬇️
hudicommon	`?`
hudiflink	`?`
hudihadoopmr	`?`
hudisparkdatasource	`?`
hudisync	`5.28% <ø> (-49.20%)`	⬇️
huditimelineservice	`?`
hudiutilities	`58.60% <ø> (+0.03%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
...main/java/org/apache/hudi/metrics/HoodieGauge.java	`0.00% <0.00%> (-100.00%)`	⬇️
.../org/apache/hudi/hive/NonPartitionedExtractor.java	`0.00% <0.00%> (-100.00%)`	⬇️
.../java/org/apache/hudi/metrics/MetricsReporter.java	`0.00% <0.00%> (-100.00%)`	⬇️
...a/org/apache/hudi/metrics/MetricsReporterType.java	`0.00% <0.00%> (-100.00%)`	⬇️
...rg/apache/hudi/client/bootstrap/BootstrapMode.java	`0.00% <0.00%> (-100.00%)`	⬇️
...he/hudi/hive/HiveStylePartitionValueExtractor.java	`0.00% <0.00%> (-100.00%)`	⬇️
...pache/hudi/client/utils/ConcatenatingIterator.java	`0.00% <0.00%> (-100.00%)`	⬇️
...che/hudi/config/HoodieMetricsPrometheusConfig.java	`0.00% <0.00%> (-100.00%)`	⬇️
.../hudi/execution/bulkinsert/BulkInsertSortMode.java	`0.00% <0.00%> (-100.00%)`	⬇️
...able/action/compact/CompactionTriggerStrategy.java	`0.00% <0.00%> (-100.00%)`	⬇️
... and 617 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 16e90d3...712b446. Read the comment docs.

hudi-common/src/main/java/org/apache/hudi/common/table/TableSchemaResolver.java

hudi-spark-datasource/hudi-spark/src/main/scala/org/apache/hudi/DefaultSource.scala

vinothchandar

I wonder what the right behavior here should be. Should we error out or return an empty dataframe?

hudi-common/src/main/java/org/apache/hudi/common/table/TableSchemaResolver.java

hudi-spark-datasource/hudi-spark/src/main/scala/org/apache/hudi/DefaultSource.scala

nsivabalan · 2021-05-14T23:44:26Z

I wonder what the right behavior here should be. Should we error out or return an empty dataframe?

If someone tries to read a hudi table from a path that does not even exist, we get this exception as of now.

scala> val tripsSnapshotDF = spark.
     |   read.
     |   format("hudi").
     |   load(basePath + "/*/*/*/*")
org.apache.hudi.exception.TableNotFoundException: Hoodie table not found in path Unable to find a hudi table for the user provided paths.
  at org.apache.hudi.DataSourceUtils.getTablePath(DataSourceUtils.java:81)
  at org.apache.hudi.DefaultSource.createRelation(DefaultSource.scala:97)
  at org.apache.hudi.DefaultSource.createRelation(DefaultSource.scala:65)
  at org.apache.spark.sql.execution.datasources.DataSource.resolveRelation(DataSource.scala:318)
  at org.apache.spark.sql.DataFrameReader.loadV1Source(DataFrameReader.scala:223)
  at org.apache.spark.sql.DataFrameReader.load(DataFrameReader.scala:211)
  at org.apache.spark.sql.DataFrameReader.load(DataFrameReader.scala:178)
  ... 64 elided

So, I Guess the current fix should be fine. WDYT.

hudi-spark-datasource/hudi-spark/src/main/scala/org/apache/hudi/DefaultSource.scala

nsivabalan · 2021-06-08T16:55:09Z

my bad. I have no idea why I complicated it so much. Fixed it.

vinothchandar · 2021-06-21T04:56:42Z

...asource/hudi-spark/src/test/scala/org/apache/hudi/functional/HoodieSparkSqlWriterSuite.scala

+      case e: InvalidTableException =>
+        assertTrue(e.getMessage.contains("Invalid Hoodie Table"))
+    } finally {
+      spark.stop()


@nsivabalan why are stopping the spark session here? is it not shared outside of a single test?

yes, we initialize at the beginning of each test. I know we have to fix this entire class for reuse in general. @hmit is working on this refactoring.

nsivabalan · 2021-06-22T20:27:27Z

SQL DML extension tests are failing. I need time to check those out. Will update once I am able to make CI happy.

vinothchandar · 2021-06-27T14:55:57Z

@nsivabalan moving this back to ready for review. Please update when the tests are fixed

nsivabalan · 2021-07-29T19:51:35Z

This patch needs to be redone a bit. Since w/ sql dml, create relation will be called upfront, the empty table check has to be moved to sql dml layer. I will sync up with @pengzhiwei2018 on how to go about this.

nsivabalan · 2021-07-29T21:19:20Z

Guess we can remove the release blocker and critical label. Don't think this is very critical. I understand its nice to have.

pengzhiwei2018 · 2021-07-30T09:22:33Z

This patch needs to be redone a bit. Since w/ sql dml, create relation will be called upfront, the empty table check has to be moved to sql dml layer. I will sync up with @pengzhiwei2018 on how to go about this.

Why should we throw an exception for query empty table? I think return an empty list of rows is more reasonable. When user create table and query the table, it is not friendly to throws an exception. Other data format in spark, like parquet, delta, query empty table also return empty rows.

nsivabalan · 2021-07-30T13:41:24Z

@pengzhiwei2018 : yes, you are right. we should return empty rows. Would you mind taking it up since this involves changes in sql dml. Or I need to look into the code to see where to fit this in. I will reach out to you if I need any help.

nsivabalan · 2022-01-13T00:30:11Z

@xushiyan : patch is good to review.

nsivabalan · 2022-01-18T12:56:20Z

@YannByron : Can you review the patch when you get a chance. should be small one.

YannByron · 2022-01-18T13:28:53Z

@YannByron : Can you review the patch when you get a chance. should be small one.

it's a very simple and forcible solution. There should have been BaseRelation s to think about it and solve it, i think.
But it works. LGTM.

nsivabalan · 2022-01-19T01:48:45Z

@YannByron : there are some failures in sql-dml related tests. TestMergeInfo etc. after we create the table, first merge into fails bcoz, with the proposed fix, we return an empty relation which return NIL schema.
https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_apis/build/builds/5326/logs/23

2022-01-18T15:06:16.2111934Z TestMergeIntoTable:
2022-01-18T15:06:16.3784048Z - Test MergeInto Basic *** FAILED ***
2022-01-18T15:06:16.3785444Z   org.apache.spark.sql.AnalysisException: Cannot resolve 'h0.id in (`s0.id` = `h0.id`), the input columns is: [id#5461, name#5462, price#5463, ts#5464, flag#5465];
2022-01-18T15:06:16.3786516Z   at org.apache.spark.sql.hudi.analysis.HoodieResolveReferences.org$apache$spark$sql$hudi$analysis$HoodieResolveReferences$$resolveExpressionFrom(HoodieAnalysis.scala:387)
2022-01-18T15:06:16.3787449Z   at org.apache.spark.sql.hudi.analysis.HoodieResolveReferences$$anonfun$apply$1.applyOrElse(HoodieAnalysis.scala:200)
2022-01-18T15:06:16.3788267Z   at org.apache.spark.sql.hudi.analysis.HoodieResolveReferences$$anonfun$apply$1.applyOrElse(HoodieAnalysis.scala:122)
2022-01-18T15:06:16.3789124Z   at org.apache.spark.sql.catalyst.plans.logical.AnalysisHelper$$anonfun$resolveOperatorsUp$1$$anonfun$apply$1.apply(AnalysisHelper.scala:90)
2022-01-18T15:06:16.3790035Z   at org.apache.spark.sql.catalyst.plans.logical.AnalysisHelper$$anonfun$resolveOperatorsUp$1$$anonfun$apply$1.apply(AnalysisHelper.scala:90)
2022-01-18T15:06:16.3790833Z   at org.apache.spark.sql.catalyst.trees.CurrentOrigin$.withOrigin(TreeNode.scala:70)
2022-01-18T15:06:16.3791607Z   at org.apache.spark.sql.catalyst.plans.logical.AnalysisHelper$$anonfun$resolveOperatorsUp$1.apply(AnalysisHelper.scala:89)
2022-01-18T15:06:16.3792441Z   at org.apache.spark.sql.catalyst.plans.logical.AnalysisHelper$$anonfun$resolveOperatorsUp$1.apply(AnalysisHelper.scala:86)
2022-01-18T15:06:16.3793269Z   at org.apache.spark.sql.catalyst.plans.logical.AnalysisHelper$.allowInvokingTransformsInAnalyzer(AnalysisHelper.scala:194)
2022-01-18T15:06:16.3794064Z   at org.apache.spark.sql.catalyst.plans.logical.AnalysisHelper$class.resolveOperatorsUp(AnalysisHelper.scala:86)
2022-01-18T15:06:16.3794661Z   ...

So, I guess we might need some fix on sql-dml classes.
Can you help put in a fix for this. Feel free to open up a new PR if need be.

YannByron · 2022-01-19T02:53:28Z

@nsivabalan All the failed DML is for an empty table but with failed write first?
If yes, that's because EmptyRelation returns Empty Schema, and Spark can't resolve the attributes in conditions and assignments.

If the table is created by Spark-SQL, the right schema can be returned correctly. If not, throwing an exception is right and can't fix it.
if you don't mind, i'll submit a pr to your branch.

YannByron · 2022-01-19T03:19:53Z

@nsivabalan
nsivabalan#10

two TODO we need to consider:

maybe unified schema persistence. for now, we have metadata schema, hoodie.table.create.schema just for spark-sql and file(parquet, .log) schema. Need to simplify this.
consider whether all the side effects need to be cleaned up when fail to write at the first time by dataframe. including table info in metastore, and directories and files in filesystem.

nsivabalan · 2022-01-19T15:58:33Z

@YannByron : yeah. feel free to put up a patch with all fixes required. once you have your patch, we can close this one.

good to think about the unified schema. but wondering, for an empty table, does it really require to unify the schemas. why can't sql-dml layer intercept empty table and take action appropriately.
I don't have much context into sql-dml classes. so may be you can help me understand better.

YannByron · 2022-01-20T10:31:15Z

@nsivabalan
you can merge this nsivabalan#10 into your current branch, and re-test.

why can't sql-dml layer intercept empty table and take action appropriately.

�spark-sql also calls the DefaultSource.createRelation to get the schema info and valid file list. For an empty table, with this pr, sql layer can use the EmptyRelation to solve the failure of read this table.

The pr I submitted to your firstWriteFailReadFix branch should fix the sql analysis failure, like org.apache.spark.sql.AnalysisException: Cannot resolve 'h0.id in (s0.id=h0.id), the input columns is: [id#5461, name#5462, price#5463, ts#5464, flag#5465];.

For table created by spark-sql, there is the right schema persisted in hoodie.properties. Even if data fails to be written at the next commit, the schema should be retrieved correctly.

nsivabalan · 2022-01-20T11:44:29Z

awesome, thanks a ton. I will work on it and update the patch.

hudi-bot · 2022-01-20T23:52:05Z

CI report:

ab45189 Azure: SUCCESS

Bot commands

@hudi-bot supports the following commands:

@hudi-bot run azure re-run the last Azure build

nsivabalan · 2022-01-21T06:56:14Z

@YannByron : can you review the patch.

YannByron · 2022-01-22T04:12:03Z

@YannByron : can you review the patch.

LGTM.

…rite (apache#2903)

nsivabalan added the priority:critical Production degraded; pipelines stalled label Apr 30, 2021

leesf reviewed May 1, 2021

View reviewed changes

hudi-common/src/main/java/org/apache/hudi/common/table/TableSchemaResolver.java Outdated Show resolved Hide resolved

leesf reviewed May 1, 2021

View reviewed changes

hudi-common/src/main/java/org/apache/hudi/common/table/TableSchemaResolver.java Outdated Show resolved Hide resolved

leesf reviewed May 1, 2021

View reviewed changes

hudi-spark-datasource/hudi-spark/src/main/scala/org/apache/hudi/DefaultSource.scala Outdated Show resolved Hide resolved

nsivabalan force-pushed the firstWriteFailReadFix branch from 7f8c4ef to fa044aa Compare May 5, 2021 13:15

vinothchandar assigned vinothchandar and leesf May 6, 2021

vinothchandar requested changes May 10, 2021

View reviewed changes

hudi-common/src/main/java/org/apache/hudi/common/table/TableSchemaResolver.java Outdated Show resolved Hide resolved

hudi-spark-datasource/hudi-spark/src/main/scala/org/apache/hudi/DefaultSource.scala Outdated Show resolved Hide resolved

nsivabalan force-pushed the firstWriteFailReadFix branch from 297875e to fcb7ab7 Compare May 14, 2021 23:59

vinothchandar requested changes May 25, 2021

View reviewed changes

hudi-spark-datasource/hudi-spark/src/main/scala/org/apache/hudi/DefaultSource.scala Outdated Show resolved Hide resolved

hudi-spark-datasource/hudi-spark/src/main/scala/org/apache/hudi/DefaultSource.scala Outdated Show resolved Hide resolved

nsivabalan force-pushed the firstWriteFailReadFix branch from fcb7ab7 to f474fba Compare June 8, 2021 16:54

vinothchandar reviewed Jun 21, 2021

View reviewed changes

vinothchandar added the priority:blocker Production down; release blocker label Jun 21, 2021

vinothchandar force-pushed the firstWriteFailReadFix branch from f474fba to 0b48bd5 Compare June 27, 2021 04:54

vinothchandar force-pushed the firstWriteFailReadFix branch from 0b48bd5 to 712b446 Compare July 8, 2021 03:44

nsivabalan force-pushed the firstWriteFailReadFix branch from 712b446 to 2984599 Compare July 29, 2021 19:43

nsivabalan removed the priority:blocker Production down; release blocker label Aug 4, 2021

vinothchandar assigned xushiyan and unassigned vinothchandar and leesf Sep 7, 2021

xushiyan added the pr:author-action label Oct 24, 2021

nsivabalan added the priority:high Significant impact; potential bugs label Jan 6, 2022

nsivabalan changed the title ~~[HUDI-1850] Fixing read of a empty table but with failed write~~ [HUDI-1850][HUDI-3234] Fixing read of a empty table but with failed write Jan 12, 2022

nsivabalan force-pushed the firstWriteFailReadFix branch 3 times, most recently from 83ca285 to fcafe23 Compare January 13, 2022 00:29

nsivabalan force-pushed the firstWriteFailReadFix branch from fcafe23 to 295deba Compare January 18, 2022 12:58

nsivabalan force-pushed the firstWriteFailReadFix branch from 5da3cac to 6fa140c Compare January 18, 2022 23:39

nsivabalan and others added 4 commits January 20, 2022 16:28

Fixing read of a empty table but with failed write

33f2ffe

Fixing read of an empty table with failed first commit

4b3ee63

Fixing build failure

33fb226

Fixing sql dml tests

ab45189

nsivabalan force-pushed the firstWriteFailReadFix branch from f0f4cba to ab45189 Compare January 20, 2022 21:28

nsivabalan merged commit f7a7796 into apache:master Jan 23, 2022

vinishjail97 mentioned this pull request Jan 24, 2022

FixIgnoreKey nsivabalan/hudi#11

Closed

5 tasks

alexeykudinkin pushed a commit to onehouseinc/hudi that referenced this pull request Jan 25, 2022

[HUDI-1850][HUDI-3234] Fixing read of a empty table but with failed w…

c1016ec

…rite (apache#2903)

vingov pushed a commit to vingov/hudi that referenced this pull request Jan 26, 2022

[HUDI-1850][HUDI-3234] Fixing read of a empty table but with failed w…

fbc8b12

…rite (apache#2903)

liusenhua pushed a commit to liusenhua/hudi that referenced this pull request Mar 1, 2022

[HUDI-1850][HUDI-3234] Fixing read of a empty table but with failed w…

51eecfa

…rite (apache#2903)

vingov pushed a commit to vingov/hudi that referenced this pull request Apr 3, 2022

[HUDI-1850][HUDI-3234] Fixing read of a empty table but with failed w…

e84279e

…rite (apache#2903)

[HUDI-1850][HUDI-3234] Fixing read of a empty table but with failed write #2903

[HUDI-1850][HUDI-3234] Fixing read of a empty table but with failed write #2903

Uh oh!

Conversation

nsivabalan commented Apr 30, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What is the purpose of the pull request

Brief change log

Verify this pull request

Committer checklist

Uh oh!

codecov-commenter commented Apr 30, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Uh oh!

Uh oh!

vinothchandar left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

nsivabalan commented May 14, 2021

Uh oh!

Uh oh!

Uh oh!

nsivabalan commented Jun 8, 2021

Uh oh!

vinothchandar Jun 21, 2021

Choose a reason for hiding this comment

Uh oh!

nsivabalan Jun 22, 2021

Choose a reason for hiding this comment

Uh oh!

nsivabalan commented Jun 22, 2021

Uh oh!

vinothchandar commented Jun 27, 2021

Uh oh!

nsivabalan commented Jul 29, 2021

Uh oh!

nsivabalan commented Jul 29, 2021

Uh oh!

pengzhiwei2018 commented Jul 30, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nsivabalan commented Jul 30, 2021

Uh oh!

nsivabalan commented Jan 13, 2022

Uh oh!

nsivabalan commented Jan 18, 2022

Uh oh!

YannByron commented Jan 18, 2022

Uh oh!

nsivabalan commented Jan 19, 2022

Uh oh!

YannByron commented Jan 19, 2022

Uh oh!

YannByron commented Jan 19, 2022

Uh oh!

nsivabalan commented Jan 19, 2022

Uh oh!

YannByron commented Jan 20, 2022

Uh oh!

nsivabalan commented Jan 20, 2022

Uh oh!

hudi-bot commented Jan 20, 2022

CI report:

Uh oh!

nsivabalan commented Jan 21, 2022

Uh oh!

YannByron commented Jan 22, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

nsivabalan commented Apr 30, 2021 •

edited

Loading

codecov-commenter commented Apr 30, 2021 •

edited

Loading

pengzhiwei2018 commented Jul 30, 2021 •

edited

Loading