Skip to content

Conversation

@cloud-fan
Copy link
Contributor

What changes were proposed in this pull request?

reorganize the packages of DS v2 interfaces/classes:

  1. org.spark.sql.connector.catalog: put TableCatalog, Table and other related interfaces/classes
  2. org.spark.sql.connector.expression: put Expression, Transform and other related interfaces/classes
  3. org.spark.sql.connector.read: put ScanBuilder, Scan and other related interfaces/classes
  4. org.spark.sql.connector.write: put WriteBuilder, BatchWrite and other related interfaces/classes

Why are the changes needed?

Data Source V2 has evolved a lot. It's a bit weird that Expression is in org.spark.sql.catalog.v2 and Table is in org.spark.sql.sources.v2.

Does this PR introduce any user-facing change?

No

How was this patch tested?

existing tests

@SparkQA
Copy link

SparkQA commented Sep 5, 2019

Test build #110192 has finished for PR 25700 at commit 38becbf.

  • This patch fails Scala style tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@cloud-fan
Copy link
Contributor Author

most of the changes are import change.

cc @rdblue @brkyvz @jose-torres @gengliangwang

@SparkQA
Copy link

SparkQA commented Sep 5, 2019

Test build #110193 has finished for PR 25700 at commit 9b8969f.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@SparkQA
Copy link

SparkQA commented Sep 9, 2019

Test build #110350 has finished for PR 25700 at commit 0e28b2d.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

* )
* </pre>
*/
@Experimental
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm okay with this, but we should be more careful to make sure these annotations are included earlier in future PRs.

@rdblue
Copy link
Contributor

rdblue commented Sep 11, 2019

+1

The renames look good to me. As long as there are no added changes, then rebasing and committing this when tests pass is okay with me.

* A mix-in interface for {@link Table} delete support. Data sources can implement this
* interface to provide the ability to delete data from tables that matches filter expressions.
*/
@Experimental
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

do we also need to add the since tags?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

currently none of the DS v2 interfaces have the since tag. We can add them all together later.

* A thread-safe manager for [[CatalogPlugin]]s. It tracks all the registered catalogs, and allow
* the caller to look up a catalog by name.
*/
private[sql]
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

do we not want to make this API public, similar to the catalog implementation we had before?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yea, CatalogManager is a completely internal stuff.

Copy link
Contributor

@brkyvz brkyvz left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. Hoping this is the last ever package change for DSV2...

@SparkQA
Copy link

SparkQA commented Sep 11, 2019

Test build #110483 has finished for PR 25700 at commit 188e5ae.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@SparkQA
Copy link

SparkQA commented Sep 12, 2019

Test build #110507 has finished for PR 25700 at commit 7c2f7cc.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@cloud-fan
Copy link
Contributor Author

thanks for the review, merging to master!

@cloud-fan cloud-fan closed this in 053dd85 Sep 12, 2019
PavithraRamachandran pushed a commit to PavithraRamachandran/spark that referenced this pull request Sep 15, 2019
### What changes were proposed in this pull request?

reorganize the packages of DS v2 interfaces/classes:
1. `org.spark.sql.connector.catalog`: put `TableCatalog`, `Table` and other related interfaces/classes
2. `org.spark.sql.connector.expression`: put `Expression`, `Transform` and other related interfaces/classes
3. `org.spark.sql.connector.read`: put `ScanBuilder`, `Scan` and other related interfaces/classes
4. `org.spark.sql.connector.write`: put `WriteBuilder`, `BatchWrite` and other related interfaces/classes

### Why are the changes needed?

Data Source V2 has evolved a lot. It's a bit weird that `Expression` is in `org.spark.sql.catalog.v2` and `Table` is in `org.spark.sql.sources.v2`.

### Does this PR introduce any user-facing change?

No

### How was this patch tested?

existing tests

Closes apache#25700 from cloud-fan/package.

Authored-by: Wenchen Fan <[email protected]>
Signed-off-by: Wenchen Fan <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants