[SPARK-46535][SQL][3.4] Fix NPE when describe extended a column without col stats #48160

saitharun15 · 2024-09-19T06:03:41Z

What changes were proposed in this pull request?

Backport [#44524 ] to 3.4 for [SPARK-46535][SQL] Fix NPE when describe extended a column without col stats

Why are the changes needed?

Currently executing DESCRIBE TABLE EXTENDED a column without col stats with v2 table will throw a null pointer exception.

Cannot invoke "org.apache.spark.sql.connector.read.colstats.ColumnStatistics.min()" because the return value of "scala.Option.get()" is null
java.lang.NullPointerException: Cannot invoke "org.apache.spark.sql.connector.read.colstats.ColumnStatistics.min()" because the return value of "scala.Option.get()" is null
	at org.apache.spark.sql.execution.datasources.v2.DescribeColumnExec.run(DescribeColumnExec.scala:63)
	at org.apache.spark.sql.execution.datasources.v2.V2CommandExec.result$lzycompute(V2CommandExec.scala:43)
	at org.apache.spark.sql.execution.datasources.v2.V2CommandExec.result(V2CommandExec.scala:43)
	at org.apache.spark.sql.execution.datasources.v2.V2CommandExec.executeCollect(V2CommandExec.scala:49)
	at org.apache.spark.sql.execution.QueryExecution$$anonfun$eagerlyExecuteCommands$1.$anonfun$applyOrElse$1(QueryExecution.scala:98)
	at org.apache.spark.sql.execution.SQLExecution$.$anonfun$withNewExecutionId$6(SQLExecution.scala:118)
	at org.apache.spark.sql.execution.SQLExecution$.withSQLConfPropagated(SQLExecution.scala:195)
	at org.apache.spark.sql.execution.SQLExecution$.$anonfun$withNewExecutionId$1(SQLExecution.scala:103)

Does this PR introduce any user-facing change?

How was this patch tested?

Add a new test describe extended (formatted) a column without col stats

Was this patch authored or co-authored using generative AI tooling?

No

…ut col stats

saitharun15 · 2024-09-19T06:07:10Z

@MaxGekk @yaooqinn @LuciferYang @guykhazma , this is a backport to 3.4 for #44524 can u please review this

MaxGekk

@saitharun15 Please, enable GAs in your fork.

LuciferYang · 2024-09-19T07:18:55Z

sql/core/src/test/scala/org/apache/spark/sql/execution/command/v2/DescribeTableSuite.scala

 import org.apache.spark.sql.types.StringType
 import org.apache.spark.util.Utils

+


please remove this empty line.

saitharun15 · 2024-09-19T07:25:17Z

@MaxGekk I have enabled GAs in my fork

dongjoon-hyun

+1, LGTM. Thank you, @saitharun15 , @LuciferYang , @MaxGekk .

Although @saitharun15 needs to setup his fork more properly, we can see that SQL module tests passed in the CIs.

For Scala linter, I verified manually.

$ dev/scalastyle
Using SPARK_LOCAL_IP=localhost
Scalastyle checks passed.

Merged to branch-3.4 for Apache Spark 3.4.4.

…ut col stats ### What changes were proposed in this pull request? Backport [#44524 ] to 3.4 for [[SPARK-46535]](https://issues.apache.org/jira/browse/SPARK-46535)[SQL] Fix NPE when describe extended a column without col stats ### Why are the changes needed? Currently executing DESCRIBE TABLE EXTENDED a column without col stats with v2 table will throw a null pointer exception. ``` Cannot invoke "org.apache.spark.sql.connector.read.colstats.ColumnStatistics.min()" because the return value of "scala.Option.get()" is null java.lang.NullPointerException: Cannot invoke "org.apache.spark.sql.connector.read.colstats.ColumnStatistics.min()" because the return value of "scala.Option.get()" is null at org.apache.spark.sql.execution.datasources.v2.DescribeColumnExec.run(DescribeColumnExec.scala:63) at org.apache.spark.sql.execution.datasources.v2.V2CommandExec.result$lzycompute(V2CommandExec.scala:43) at org.apache.spark.sql.execution.datasources.v2.V2CommandExec.result(V2CommandExec.scala:43) at org.apache.spark.sql.execution.datasources.v2.V2CommandExec.executeCollect(V2CommandExec.scala:49) at org.apache.spark.sql.execution.QueryExecution$$anonfun$eagerlyExecuteCommands$1.$anonfun$applyOrElse$1(QueryExecution.scala:98) at org.apache.spark.sql.execution.SQLExecution$.$anonfun$withNewExecutionId$6(SQLExecution.scala:118) at org.apache.spark.sql.execution.SQLExecution$.withSQLConfPropagated(SQLExecution.scala:195) at org.apache.spark.sql.execution.SQLExecution$.$anonfun$withNewExecutionId$1(SQLExecution.scala:103) ``` ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? Add a new test describe extended (formatted) a column without col stats ### Was this patch authored or co-authored using generative AI tooling? No Closes #48160 from saitharun15/SPARK-46535-branch-3.4. Lead-authored-by: saitharun15 <[email protected]> Co-authored-by: Sai Tharun <[email protected]> Signed-off-by: Dongjoon Hyun <[email protected]>

[SPARK-46535][SQL][3.4] Fix NPE when describe extended a column witho…

1a8bbb4

…ut col stats

github-actions bot added the SQL label Sep 19, 2024

MaxGekk reviewed Sep 19, 2024

View reviewed changes

LuciferYang reviewed Sep 19, 2024

View reviewed changes

remove extra line

988b502

saitharun15 requested a review from MaxGekk September 19, 2024 15:00

dongjoon-hyun approved these changes Sep 19, 2024

View reviewed changes

dongjoon-hyun closed this Sep 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-46535][SQL][3.4] Fix NPE when describe extended a column without col stats #48160

[SPARK-46535][SQL][3.4] Fix NPE when describe extended a column without col stats #48160

Uh oh!

saitharun15 commented Sep 19, 2024

Uh oh!

saitharun15 commented Sep 19, 2024

Uh oh!

MaxGekk left a comment

Uh oh!

LuciferYang Sep 19, 2024

Uh oh!

saitharun15 Sep 19, 2024

Uh oh!

saitharun15 commented Sep 19, 2024

Uh oh!

dongjoon-hyun left a comment •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		import org.apache.spark.sql.types.StringType
		import org.apache.spark.util.Utils

[SPARK-46535][SQL][3.4] Fix NPE when describe extended a column without col stats #48160

[SPARK-46535][SQL][3.4] Fix NPE when describe extended a column without col stats #48160

Uh oh!

Conversation

saitharun15 commented Sep 19, 2024

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

saitharun15 commented Sep 19, 2024

Uh oh!

MaxGekk left a comment

Choose a reason for hiding this comment

Uh oh!

LuciferYang Sep 19, 2024

Choose a reason for hiding this comment

Uh oh!

saitharun15 Sep 19, 2024

Choose a reason for hiding this comment

Uh oh!

saitharun15 commented Sep 19, 2024

Uh oh!

dongjoon-hyun left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

dongjoon-hyun left a comment •

edited

Loading