[SPARK-19970][SQL][BRANCH-2.1] Table owner should be USER instead of PRINCIPAL in kerberized clusters #17363

dongjoon-hyun · 2017-03-20T17:41:02Z

What changes were proposed in this pull request?

In the kerberized hadoop cluster, when Spark creates tables, the owner of tables are filled with PRINCIPAL strings instead of USER names. This is inconsistent with Hive and causes problems when using ROLE in Hive. We had better to fix this.

BEFORE

scala> sql("create table t(a int)").show
scala> sql("desc formatted t").show(false)
...
|Owner:                      |spark@EXAMPLE.COM                                         |       |

AFTER

scala> sql("create table t(a int)").show
scala> sql("desc formatted t").show(false)
...
|Owner:                      |spark                                         |       |

How was this patch tested?

Manually do create table and desc formatted because this happens in Kerberized clusters.

…PRINCIPAL in kerberized clusters

dongjoon-hyun · 2017-03-20T18:03:04Z

sql/hive/src/main/scala/org/apache/spark/sql/hive/client/HiveClientImpl.scala

    }
    hiveTable.setPartCols(partCols.asJava)
-    hiveTable.setOwner(conf.getUser)
+    hiveTable.setOwner(state.getAuthenticator().getUserName())


@vanzin . I made a backport for branch-2.1 of #17311 . This one uses state as you advised.

SparkQA · 2017-03-20T19:20:55Z

Test build #74891 has finished for PR 17363 at commit 1328f1d.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

dongjoon-hyun · 2017-03-20T19:42:08Z

This is also tested manually in kerberized cluster, @vanzin .

BTW, Spark 1.6 has the same issue at ClientWrapper.

According to the Spark dev email, there exists demands on more Apache Spark 1.6.X. May I create a backport for branch-1.6? How do you think about that?

For Spark 1.6, it happens only with CREATE TABLE .. AS SELECT statement.

dongjoon-hyun · 2017-03-22T04:56:03Z

Hi, @vanzin .
Could you review this backport when you have some time?

dongjoon-hyun · 2017-03-22T18:00:06Z

Retest this please

SparkQA · 2017-03-22T19:40:06Z

Test build #75055 has finished for PR 17363 at commit 1328f1d.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

vanzin · 2017-03-23T21:55:07Z

Merging to 2.1 / 2.0.

I don't really expect any more 1.6 releases at this point, so I wouldn't bother.

…PRINCIPAL in kerberized clusters ## What changes were proposed in this pull request? In the kerberized hadoop cluster, when Spark creates tables, the owner of tables are filled with PRINCIPAL strings instead of USER names. This is inconsistent with Hive and causes problems when using [ROLE](https://cwiki.apache.org/confluence/display/Hive/SQL+Standard+Based+Hive+Authorization) in Hive. We had better to fix this. **BEFORE** ```scala scala> sql("create table t(a int)").show scala> sql("desc formatted t").show(false) ... |Owner: |sparkEXAMPLE.COM | | ``` **AFTER** ```scala scala> sql("create table t(a int)").show scala> sql("desc formatted t").show(false) ... |Owner: |spark | | ``` ## How was this patch tested? Manually do `create table` and `desc formatted` because this happens in Kerberized clusters. Author: Dongjoon Hyun <[email protected]> Closes #17363 from dongjoon-hyun/SPARK-19970-2.

vanzin · 2017-03-23T21:56:05Z

Failed to merge to 2.0. :-/

vanzin · 2017-03-23T21:56:19Z

@dongjoon-hyun you'll have to manually close this.

dongjoon-hyun · 2017-03-23T22:25:55Z

Oo. Thank you so much, @vanzin !

[SPARK-19970][SQL][BRANCH-2.1] Table owner should be USER instead of …

1328f1d

…PRINCIPAL in kerberized clusters

dongjoon-hyun commented Mar 20, 2017

View reviewed changes

dongjoon-hyun closed this Mar 23, 2017

dongjoon-hyun mentioned this pull request Mar 23, 2017

[SPARK-19970][SQL][BRANCH-1.6] Table owner should be USER instead of PRINCIPAL in kerberized clusters #17366

Closed

dongjoon-hyun deleted the SPARK-19970-2 branch January 7, 2019 07:03

HyukjinKwon mentioned this pull request Feb 20, 2019

[SPARK-26929][SQL]fix table owner use user instead of principal when create table through spark-sql or beeline #23837

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-19970][SQL][BRANCH-2.1] Table owner should be USER instead of PRINCIPAL in kerberized clusters #17363

[SPARK-19970][SQL][BRANCH-2.1] Table owner should be USER instead of PRINCIPAL in kerberized clusters #17363

Uh oh!

dongjoon-hyun commented Mar 20, 2017

Uh oh!

dongjoon-hyun Mar 20, 2017 •

edited

Loading

Uh oh!

SparkQA commented Mar 20, 2017

Uh oh!

dongjoon-hyun commented Mar 20, 2017 •

edited

Loading

Uh oh!

dongjoon-hyun commented Mar 22, 2017

Uh oh!

dongjoon-hyun commented Mar 22, 2017

Uh oh!

SparkQA commented Mar 22, 2017

Uh oh!

vanzin commented Mar 23, 2017

Uh oh!

vanzin commented Mar 23, 2017

Uh oh!

vanzin commented Mar 23, 2017

Uh oh!

dongjoon-hyun commented Mar 23, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[SPARK-19970][SQL][BRANCH-2.1] Table owner should be USER instead of PRINCIPAL in kerberized clusters #17363

[SPARK-19970][SQL][BRANCH-2.1] Table owner should be USER instead of PRINCIPAL in kerberized clusters #17363

Uh oh!

Conversation

dongjoon-hyun commented Mar 20, 2017

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

dongjoon-hyun Mar 20, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

SparkQA commented Mar 20, 2017

Uh oh!

dongjoon-hyun commented Mar 20, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dongjoon-hyun commented Mar 22, 2017

Uh oh!

dongjoon-hyun commented Mar 22, 2017

Uh oh!

SparkQA commented Mar 22, 2017

Uh oh!

vanzin commented Mar 23, 2017

Uh oh!

vanzin commented Mar 23, 2017

Uh oh!

vanzin commented Mar 23, 2017

Uh oh!

dongjoon-hyun commented Mar 23, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

dongjoon-hyun Mar 20, 2017 •

edited

Loading

dongjoon-hyun commented Mar 20, 2017 •

edited

Loading