[MINOR][PySpark][DOCS] Fix examples in PySpark documentation #15242

HyukjinKwon · 2016-09-26T12:35:40Z

What changes were proposed in this pull request?

This PR proposes to fix wrongly indented examples in PySpark documentation

-        >>> json_sdf = spark.readStream.format("json")\
-                                       .schema(sdf_schema)\
-                                       .load(tempfile.mkdtemp())
+        >>> json_sdf = spark.readStream.format("json") \\
+        ...     .schema(sdf_schema) \\
+        ...     .load(tempfile.mkdtemp())

-        people.filter(people.age > 30).join(department, people.deptId == department.id)\
+        people.filter(people.age > 30).join(department, people.deptId == department.id) \\

-        >>> examples = [LabeledPoint(1.1, Vectors.sparse(3, [(0, 1.23), (2, 4.56)])), \
-                        LabeledPoint(0.0, Vectors.dense([1.01, 2.02, 3.03]))]
+        >>> examples = [LabeledPoint(1.1, Vectors.sparse(3, [(0, 1.23), (2, 4.56)])),
+        ...             LabeledPoint(0.0, Vectors.dense([1.01, 2.02, 3.03]))]

-        >>> examples = [LabeledPoint(1.1, Vectors.sparse(3, [(0, -1.23), (2, 4.56e-7)])), \
-                        LabeledPoint(0.0, Vectors.dense([1.01, 2.02, 3.03]))]
+        >>> examples = [LabeledPoint(1.1, Vectors.sparse(3, [(0, -1.23), (2, 4.56e-7)])),
+        ...             LabeledPoint(0.0, Vectors.dense([1.01, 2.02, 3.03]))]

-        ...      for x in iterator:
-        ...           print(x)
+        ...     for x in iterator:
+        ...          print(x)

How was this patch tested?

Manually tested.

Before

After

HyukjinKwon · 2016-09-26T12:36:58Z

Could you take a look @srowen ? I made this change before across PySpark documentation but it seems I missed one before and then another one is introduced somewhere after my previous PR.

srowen · 2016-09-26T12:37:14Z

python/pyspark/sql/streaming.py

-        >>> json_sdf = spark.readStream.format("json")\
-                                       .schema(sdf_schema)\
-                                       .load(tempfile.mkdtemp())
+        >>> json_sdf = spark.readStream.format("json") \\


I see, does the \ vs \ make the difference here? are there possibly other instances of this?

Yes, it seems it does. I took a look and a sweep before - #14063

I will take another look across the documentation again tomorrow.

SparkQA · 2016-09-26T13:00:20Z

Test build #65911 has finished for PR 15242 at commit 9ac3088.

This patch fails PySpark unit tests.
This patch merges cleanly.
This patch adds no public classes.

HyukjinKwon · 2016-09-26T13:01:56Z

retest this please

SparkQA · 2016-09-26T13:31:42Z

Test build #65915 has finished for PR 15242 at commit 9ac3088.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

HyukjinKwon · 2016-09-27T05:22:19Z

@srowen I just took a scan twice and I think they should be all.

HyukjinKwon · 2016-09-27T05:22:51Z

Let me update the PR description just in case.

SparkQA · 2016-09-27T05:52:30Z

Test build #65947 has finished for PR 15242 at commit 88c1069.

This patch fails PySpark unit tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2016-09-27T06:41:21Z

Test build #65949 has finished for PR 15242 at commit 85cda01.

This patch fails PySpark unit tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2016-09-27T07:43:24Z

Test build #65951 has finished for PR 15242 at commit 6de11ec.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

HyukjinKwon · 2016-09-27T08:11:23Z

(PR description is updated again)

srowen · 2016-09-28T10:19:31Z

Merged to master/2.0

## What changes were proposed in this pull request? This PR proposes to fix wrongly indented examples in PySpark documentation ``` - >>> json_sdf = spark.readStream.format("json")\ - .schema(sdf_schema)\ - .load(tempfile.mkdtemp()) + >>> json_sdf = spark.readStream.format("json") \\ + ... .schema(sdf_schema) \\ + ... .load(tempfile.mkdtemp()) ``` ``` - people.filter(people.age > 30).join(department, people.deptId == department.id)\ + people.filter(people.age > 30).join(department, people.deptId == department.id) \\ ``` ``` - >>> examples = [LabeledPoint(1.1, Vectors.sparse(3, [(0, 1.23), (2, 4.56)])), \ - LabeledPoint(0.0, Vectors.dense([1.01, 2.02, 3.03]))] + >>> examples = [LabeledPoint(1.1, Vectors.sparse(3, [(0, 1.23), (2, 4.56)])), + ... LabeledPoint(0.0, Vectors.dense([1.01, 2.02, 3.03]))] ``` ``` - >>> examples = [LabeledPoint(1.1, Vectors.sparse(3, [(0, -1.23), (2, 4.56e-7)])), \ - LabeledPoint(0.0, Vectors.dense([1.01, 2.02, 3.03]))] + >>> examples = [LabeledPoint(1.1, Vectors.sparse(3, [(0, -1.23), (2, 4.56e-7)])), + ... LabeledPoint(0.0, Vectors.dense([1.01, 2.02, 3.03]))] ``` ``` - ... for x in iterator: - ... print(x) + ... for x in iterator: + ... print(x) ``` ## How was this patch tested? Manually tested. **Before** ![2016-09-26 8 36 02](https://cloud.githubusercontent.com/assets/6477701/18834471/05c7a478-8431-11e6-94bb-09aa37b12ddb.png) ![2016-09-26 9 22 16](https://cloud.githubusercontent.com/assets/6477701/18834472/06c8735c-8431-11e6-8775-78631eab0411.png) <img width="601" alt="2016-09-27 2 29 27" src="https://cloud.githubusercontent.com/assets/6477701/18861294/29c0d5b4-84bf-11e6-99c5-3c9d913c125d.png"> <img width="1056" alt="2016-09-27 2 29 58" src="https://cloud.githubusercontent.com/assets/6477701/18861298/31694cd8-84bf-11e6-9e61-9888cb8c2089.png"> <img width="1079" alt="2016-09-27 2 30 05" src="https://cloud.githubusercontent.com/assets/6477701/18861301/359722da-84bf-11e6-97f9-5f5365582d14.png"> **After** ![2016-09-26 9 29 47](https://cloud.githubusercontent.com/assets/6477701/18834467/0367f9da-8431-11e6-86d9-a490d3297339.png) ![2016-09-26 9 30 24](https://cloud.githubusercontent.com/assets/6477701/18834463/f870fae0-8430-11e6-9482-01fc47898492.png) <img width="515" alt="2016-09-27 2 28 19" src="https://cloud.githubusercontent.com/assets/6477701/18861305/3ff88b88-84bf-11e6-902c-9f725e8a8b10.png"> <img width="652" alt="2016-09-27 3 50 59" src="https://cloud.githubusercontent.com/assets/6477701/18863053/592fbc74-84ca-11e6-8dbf-99cf57947de8.png"> <img width="709" alt="2016-09-27 3 51 03" src="https://cloud.githubusercontent.com/assets/6477701/18863060/601607be-84ca-11e6-80aa-a401df41c321.png"> Author: hyukjinkwon <[email protected]> Closes #15242 from HyukjinKwon/minor-example-pyspark. (cherry picked from commit 2190037) Signed-off-by: Sean Owen <[email protected]>

Fix examples in PyPpark documentation

9ac3088

srowen reviewed Sep 26, 2016

View reviewed changes

HyukjinKwon changed the title ~~[MINOR][PySpark] Fix examples in PySpark documentation~~ [MINOR][PySpark][DOC] Fix examples in PySpark documentation Sep 26, 2016

HyukjinKwon changed the title ~~[MINOR][PySpark][DOC] Fix examples in PySpark documentation~~ [MINOR][PySpark][DOCS] Fix examples in PySpark documentation Sep 26, 2016

HyukjinKwon added 2 commits September 27, 2016 14:17

Fix indentation in foreachPartition example

3a14788

Fix another example in mllib

88c1069

Remove \\

85cda01

Add missing ...

6de11ec

asfgit closed this in 2190037 Sep 28, 2016

HyukjinKwon deleted the minor-example-pyspark branch January 2, 2018 03:39

[MINOR][PySpark][DOCS] Fix examples in PySpark documentation #15242

[MINOR][PySpark][DOCS] Fix examples in PySpark documentation #15242

Uh oh!

Conversation

HyukjinKwon commented Sep 26, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

HyukjinKwon commented Sep 26, 2016

Uh oh!

srowen Sep 26, 2016

Choose a reason for hiding this comment

Uh oh!

HyukjinKwon Sep 26, 2016

Choose a reason for hiding this comment

Uh oh!

SparkQA commented Sep 26, 2016

Uh oh!

HyukjinKwon commented Sep 26, 2016

Uh oh!

SparkQA commented Sep 26, 2016

Uh oh!

HyukjinKwon commented Sep 27, 2016

Uh oh!

HyukjinKwon commented Sep 27, 2016

Uh oh!

SparkQA commented Sep 27, 2016

Uh oh!

SparkQA commented Sep 27, 2016

Uh oh!

SparkQA commented Sep 27, 2016

Uh oh!

HyukjinKwon commented Sep 27, 2016

Uh oh!

srowen commented Sep 28, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

HyukjinKwon commented Sep 26, 2016 •

edited

Loading