[SPARK-9763][SQL] Minimize exposure of internal SQL classes. #8056

rxin · 2015-08-09T03:26:17Z

There are a few changes in this pull request:

Moved all data sources to execution.datasources, except the public JDBC APIs.
In order to maintain backward compatibility from 1, added a backward compatibility translation map in data source resolution.
Moved ui and metric package into execution.
Added more documentation on some internal classes.
Renamed DataSourceRegister.format -> shortName.
Added "override" modifier on shortName.
Removed IntSQLMetric.

SparkQA · 2015-08-09T03:59:58Z

Test build #40259 has finished for PR 8056 at commit 122864a.

This patch fails MiMa tests.
This patch merges cleanly.
This patch adds no public classes.

rxin · 2015-08-09T07:15:45Z

cc @zsxwing for review.

zsxwing · 2015-08-09T09:25:28Z

@rxin could you help remove @VisibleForTesting from SQLListener.scala because of the issue mentioned by @shivaram in #7774 (comment) ?

SparkQA · 2015-08-09T10:00:30Z

Test build #40263 has finished for PR 8056 at commit 8f4dc20.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

zsxwing · 2015-08-09T14:46:36Z

Although the SQL UI only displays the execution info now, moving the ui package into the execution package looks not a good idea. I guess we may add other information to the SQL tab later, then we may need to move the ui package back.

rxin · 2015-08-09T18:01:31Z

@zsxwing removed visible for testing tag.

We can move UI package around later -- it's also OK if it contains non-execution stuff. Not that big of a deal...

SparkQA · 2015-08-09T23:23:53Z

Test build #40275 has finished for PR 8056 at commit c3a4ba4.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2015-08-10T03:44:02Z

Test build #40280 has finished for PR 8056 at commit 9d83ba2.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

zsxwing · 2015-08-10T03:47:48Z

The failure is because ParquetIOSuite has a hard-code name.

zsxwing · 2015-08-10T04:17:36Z

Is it intentional to keep DDLSourceLoadSuite.scala and ResolvedDataSourceSuite.scala under org.apache.spark.sql.sources?

SparkQA · 2015-08-10T06:49:29Z

Test build #40283 has finished for PR 8056 at commit 3dfc06c.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2015-08-10T09:15:56Z

Test build #1419 has finished for PR 8056 at commit 3dfc06c.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

rxin · 2015-08-10T18:46:27Z

Not intentional -- but not that big of a deal for them to be there since I care more about the public API visibility here.

SparkQA · 2015-08-10T20:46:47Z

Test build #40306 has finished for PR 8056 at commit 9df4801.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

rxin · 2015-08-10T20:49:09Z

OK I'm going to merge this since it's from a flaky test.

rxin · 2015-08-10T20:49:31Z

cc @liancheng for the change also.

There are a few changes in this pull request: 1. Moved all data sources to execution.datasources, except the public JDBC APIs. 2. In order to maintain backward compatibility from 1, added a backward compatibility translation map in data source resolution. 3. Moved ui and metric package into execution. 4. Added more documentation on some internal classes. 5. Renamed DataSourceRegister.format -> shortName. 6. Added "override" modifier on shortName. 7. Removed IntSQLMetric. Author: Reynold Xin <[email protected]> Closes #8056 from rxin/SPARK-9763 and squashes the following commits: 9df4801 [Reynold Xin] Removed hardcoded name in test cases. d9babc6 [Reynold Xin] Shorten. e484419 [Reynold Xin] Removed VisibleForTesting. 171b812 [Reynold Xin] MimaExcludes. 2041389 [Reynold Xin] Compile ... 79dda42 [Reynold Xin] Compile. 0818ba3 [Reynold Xin] Removed IntSQLMetric. c46884f [Reynold Xin] Two more fixes. f9aa88d [Reynold Xin] [SPARK-9763][SQL] Minimize exposure of internal SQL classes. (cherry picked from commit 40ed2af) Signed-off-by: Reynold Xin <[email protected]>

rama-mullapudi · 2015-08-15T00:23:07Z

jdbcutils scala code has a typo error in schemaString function, decimal type has extra closing braces } which is causing create table with decimal fail as create statement sent to db looks as below

CREATE TABLE foo (TKT_GID DECIMAL(10},0}) NOT NULL)

Below is the code

def schemaString(df: DataFrame, url: String): String = {
.....
case t: DecimalType => s"DECIMAL(${t.precision}},${t.scale}})"
....
}

rxin · 2015-08-15T00:32:00Z

@rama-mullapudi thanks. can you submit a patch to fix those?

I think I only moved the stuff around.

There are a few changes in this pull request: 1. Moved all data sources to execution.datasources, except the public JDBC APIs. 2. In order to maintain backward compatibility from 1, added a backward compatibility translation map in data source resolution. 3. Moved ui and metric package into execution. 4. Added more documentation on some internal classes. 5. Renamed DataSourceRegister.format -> shortName. 6. Added "override" modifier on shortName. 7. Removed IntSQLMetric. Author: Reynold Xin <[email protected]> Closes apache#8056 from rxin/SPARK-9763 and squashes the following commits: 9df4801 [Reynold Xin] Removed hardcoded name in test cases. d9babc6 [Reynold Xin] Shorten. e484419 [Reynold Xin] Removed VisibleForTesting. 171b812 [Reynold Xin] MimaExcludes. 2041389 [Reynold Xin] Compile ... 79dda42 [Reynold Xin] Compile. 0818ba3 [Reynold Xin] Removed IntSQLMetric. c46884f [Reynold Xin] Two more fixes. f9aa88d [Reynold Xin] [SPARK-9763][SQL] Minimize exposure of internal SQL classes.

jyssky · 2016-02-13T18:14:48Z

sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/ResolvedDataSource.scala

+          }
+
+          val dataSchema =
+            StructType(schema.filterNot(f => partitionColumns.contains(f.name))).asNullable


@rxin Following the discussion on SPARK-9763, I am actually wondering why we convert the StructType with "asNullable" which set all the contained StructField to be Nullable. This will cause problem when one StructFiled is not allowed to be nullable, but the HadoopFsRelationProvider automatically sets it to be nullable. Is it because that all the fields in HadoopFsRelationProvider have to be nullable? Thanks!

rxin force-pushed the SPARK-9763 branch from 8f4dc20 to c3a4ba4 Compare August 9, 2015 20:42

rxin added 9 commits August 10, 2015 11:43

[SPARK-9763][SQL] Minimize exposure of internal SQL classes.

f9aa88d

Two more fixes.

c46884f

Removed IntSQLMetric.

0818ba3

Compile.

79dda42

Compile ...

2041389

MimaExcludes.

171b812

Removed VisibleForTesting.

e484419

Shorten.

d9babc6

Removed hardcoded name in test cases.

9df4801

rxin force-pushed the SPARK-9763 branch from 3dfc06c to 9df4801 Compare August 10, 2015 18:44

asfgit closed this in 40ed2af Aug 10, 2015

liancheng mentioned this pull request Aug 11, 2015

[SPARK-9340] [SQL] Fixes converting unannotated Parquet lists #8070

Closed

jyssky reviewed Feb 13, 2016
View reviewed changes

[SPARK-9763][SQL] Minimize exposure of internal SQL classes. #8056

[SPARK-9763][SQL] Minimize exposure of internal SQL classes. #8056

Uh oh!

Conversation

rxin commented Aug 9, 2015

Uh oh!

SparkQA commented Aug 9, 2015

Uh oh!

rxin commented Aug 9, 2015

Uh oh!

zsxwing commented Aug 9, 2015

Uh oh!

SparkQA commented Aug 9, 2015

Uh oh!

zsxwing commented Aug 9, 2015

Uh oh!

rxin commented Aug 9, 2015

Uh oh!

SparkQA commented Aug 9, 2015

Uh oh!

SparkQA commented Aug 10, 2015

Uh oh!

zsxwing commented Aug 10, 2015

Uh oh!

zsxwing commented Aug 10, 2015

Uh oh!

SparkQA commented Aug 10, 2015

Uh oh!

SparkQA commented Aug 10, 2015

Uh oh!

rxin commented Aug 10, 2015

Uh oh!

SparkQA commented Aug 10, 2015

Uh oh!

rxin commented Aug 10, 2015

Uh oh!

rxin commented Aug 10, 2015

Uh oh!

rama-mullapudi commented Aug 15, 2015

Uh oh!

rxin commented Aug 15, 2015

Uh oh!

jyssky Feb 13, 2016

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants