[SPARK-29517][SQL] TRUNCATE TABLE should look up catalog/table like v2 commands #26174

viirya · 2019-10-19T07:01:43Z

What changes were proposed in this pull request?

Add TruncateTableStatement and make TRUNCATE TABLE go through the same catalog/table resolution framework of v2 commands.

Why are the changes needed?

It's important to make all the commands have the same table resolution behavior, to avoid confusing end-users. e.g.

USE my_catalog
DESC t // success and describe the table t from my_catalog
TRUNCATE TABLE t // report table not found as there is no table t in the session catalog

Does this PR introduce any user-facing change?

yes. When running TRUNCATE TABLE, Spark fails the command if the current catalog is set to a v2 catalog, or the table name specified a v2 catalog.

How was this patch tested?

Unit tests.

SparkQA · 2019-10-19T10:41:40Z

Test build #112312 has finished for PR 26174 at commit ae50eec.

This patch passes all tests.
This patch merges cleanly.
This patch adds the following public classes (experimental):
case class TruncateTableStatement(

viirya · 2019-10-19T16:19:32Z

cc @cloud-fan @rdblue @imback82

imback82

LGTM

cloud-fan · 2019-10-21T06:51:54Z

sql/core/src/main/scala/org/apache/spark/sql/catalyst/analysis/ResolveSessionCatalog.scala

        v1TableName.asTableIdentifier,
        "MSCK REPAIR TABLE")
+
+    case TruncateTableStatement(tableName, partitionSpec) =>


shall we add a truncateTable method to TableCatalog? cc @rdblue

No. This is not a catalog operation, it is a table operation. We already have 2 ways to truncate a table: if the table implements SupportsTruncate or SupportsOverwrite. A truncate command should build a write and commit without any commit messages.

We can also add a short-cut trait on Table to avoid the builder.

rdblue · 2019-10-21T20:47:43Z

Looks good to me if we don't want to add the implementation in the same PR.

viirya · 2019-10-21T21:21:18Z

I can add the V2 part in this PR or separate one. @cloud-fan

cloud-fan · 2019-10-22T11:08:06Z

Since it's unclear how to implement a v2 truncate table, let's leave it first.

cloud-fan · 2019-10-22T11:17:41Z

thanks, merging to master!

rdblue · 2019-10-22T16:54:11Z

Since it's unclear how to implement a v2 truncate table, let's leave it first.

How is this unclear? There is a truncate API in v2.

cloud-fan · 2019-10-22T17:05:34Z

I was referring to We can also add a short-cut trait on Table to avoid the builder.

It's unclear if we should submit a job to do truncate, or add a short-cut trait.

rdblue · 2019-10-22T17:08:27Z

Even if we add a short-cut trait, Spark should fall back the the builder method if the table supports it.

TRUNCATE TABLE should do multi-catalog resolution.

ae50eec

dongjoon-hyun added the SQL label Oct 19, 2019

imback82 approved these changes Oct 20, 2019

View reviewed changes

cloud-fan reviewed Oct 21, 2019

View reviewed changes

cloud-fan closed this in b4844ee Oct 22, 2019

viirya deleted the SPARK-29517 branch December 27, 2023 18:37

[SPARK-29517][SQL] TRUNCATE TABLE should look up catalog/table like v2 commands #26174

[SPARK-29517][SQL] TRUNCATE TABLE should look up catalog/table like v2 commands #26174

Uh oh!

Conversation

viirya commented Oct 19, 2019

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

SparkQA commented Oct 19, 2019

Uh oh!

viirya commented Oct 19, 2019

Uh oh!

imback82 left a comment

Choose a reason for hiding this comment

Uh oh!

cloud-fan Oct 21, 2019

Choose a reason for hiding this comment

Uh oh!

rdblue Oct 21, 2019

Choose a reason for hiding this comment

Uh oh!

rdblue commented Oct 21, 2019

Uh oh!

viirya commented Oct 21, 2019

Uh oh!

cloud-fan commented Oct 22, 2019

Uh oh!

cloud-fan commented Oct 22, 2019

Uh oh!

rdblue commented Oct 22, 2019

Uh oh!

cloud-fan commented Oct 22, 2019

Uh oh!

rdblue commented Oct 22, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants