[MINOR][DOC] Add note regarding proper usage of QueryExecution.toRdd #23822

HeartSaVioR · 2019-02-18T08:37:52Z

What changes were proposed in this pull request?

This proposes adding a note on QueryExecution.toRdd regarding Spark's internal optimization callers would need to indicate.

How was this patch tested?

This patch is a documentation change.

SparkQA · 2019-02-18T11:57:59Z

Test build #102462 has finished for PR 23822 at commit 493b2cb.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

HeartSaVioR · 2019-02-18T12:00:46Z

Retest this, please

HeartSaVioR · 2019-02-18T12:02:02Z

cc. @cloud-fan

cloud-fan · 2019-02-18T12:09:34Z

LGTM

SparkQA · 2019-02-18T15:11:38Z

Test build #102469 has finished for PR 23822 at commit 493b2cb.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

HyukjinKwon · 2019-02-18T16:06:08Z

retest this please

dilipbiswal · 2019-02-18T19:43:04Z

sql/core/src/main/scala/org/apache/spark/sql/execution/QueryExecution.scala

+   * accessing after iteration. (Calling `collect()` is one of known bad usage.)
+   * If you want to store these rows into collection, please apply some converter or copy row
+   * which produces new object per iteration.
+   */


@HeartSaVioR Should we point the users to dataset.rdd method where the conversion is already applied ?

Yeah that's good suggestion for end users (not Spark developers). Will add.

BTW, I don't think it's an API though .. technically we don't have to worry about end users.

SparkQA · 2019-02-18T20:16:50Z

Test build #102484 has finished for PR 23822 at commit 493b2cb.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

dilipbiswal · 2019-02-18T21:06:06Z

sql/core/src/main/scala/org/apache/spark/sql/execution/QueryExecution.scala

   * If you want to store these rows into collection, please apply some converter or copy row
   * which produces new object per iteration.
+   * Given QueryExecution is not a public class, end users are discouraged to use this: please
+   * user `Dataset.rdd` instead which conversion will be applied.


user -> use
which -> in which
or
which -> where ?

Nice finding. Applied.

dilipbiswal · 2019-02-18T21:06:43Z

LGTM with a very minor comment.

SparkQA · 2019-02-19T00:41:24Z

Test build #102490 has finished for PR 23822 at commit 2f224f2.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

HyukjinKwon · 2019-02-19T01:41:33Z

Merged to master.

[MINOR][DOC] Add note regarding proper usage of QueryExecution.toRdd

493b2cb

HyukjinKwon approved these changes Feb 18, 2019

View reviewed changes

dilipbiswal reviewed Feb 18, 2019

View reviewed changes

Add more guide for end users

2f224f2

dilipbiswal reviewed Feb 18, 2019

View reviewed changes

Minor correction on typo/grammar

d68dcf2

viirya approved these changes Feb 19, 2019

View reviewed changes

HyukjinKwon closed this in 865c88f Feb 19, 2019

[MINOR][DOC] Add note regarding proper usage of QueryExecution.toRdd #23822

[MINOR][DOC] Add note regarding proper usage of QueryExecution.toRdd #23822

Uh oh!

Conversation

HeartSaVioR commented Feb 18, 2019

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

SparkQA commented Feb 18, 2019

Uh oh!

HeartSaVioR commented Feb 18, 2019

Uh oh!

HeartSaVioR commented Feb 18, 2019

Uh oh!

cloud-fan commented Feb 18, 2019

Uh oh!

SparkQA commented Feb 18, 2019

Uh oh!

HyukjinKwon commented Feb 18, 2019

Uh oh!

dilipbiswal Feb 18, 2019

Choose a reason for hiding this comment

Uh oh!

HeartSaVioR Feb 18, 2019

Choose a reason for hiding this comment

Uh oh!

HyukjinKwon Feb 19, 2019

Choose a reason for hiding this comment

Uh oh!

SparkQA commented Feb 18, 2019

Uh oh!

dilipbiswal Feb 18, 2019

Choose a reason for hiding this comment

Uh oh!

HeartSaVioR Feb 18, 2019

Choose a reason for hiding this comment

Uh oh!

dilipbiswal commented Feb 18, 2019

Uh oh!

SparkQA commented Feb 19, 2019

Uh oh!

HyukjinKwon commented Feb 19, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants