[SPARK-46135][PYTHON][DOCS] Fix table format error in ipynb docs #44049

panbingkun · 2023-11-28T07:26:12Z

What changes were proposed in this pull request?

1.After pr #44012, the output format of some 'ipynb' tables displayed in HTML format has been disrupted. The pr aims to fix table format error in ipynb docs.

Before:
After:

2.Fix some minor errors.

Why are the changes needed?

Fix bug.

Does this PR introduce any user-facing change?

Yes, only for docs.

How was this patch tested?

Manually test.
Pass GA.

Was this patch authored or co-authored using generative AI tooling?

No.

panbingkun · 2023-11-28T07:36:47Z

python/docs/source/getting_started/quickstart_connect.ipynb

   "outputs": [],
   "source": [
-    "!$HOME/sbin/start-connect-server.sh --packages org.apache.spark:spark-connect_2.12:$SPARK_VERSION"
+    "!$HOME/sbin/start-connect-server.sh --packages org.apache.spark:spark-connect_2.13:$SPARK_VERSION"


By the way, I made some changes because currently our scala version is 2.13, 2.12 is no longer supported.

panbingkun · 2023-11-28T07:37:17Z

cc @itholic @HyukjinKwon

panbingkun · 2023-11-28T07:39:43Z

python/docs/source/getting_started/quickstart_df.ipynb

-     "output_type": "execute_result"
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [


Let's present the output results in text format instead of text/html format to avoid formatting errors.

Hm, this example should show the output nicely as spark.sql.repl.eagerEval.enabled is enabled. Wonder if we can fix the docs instead.

@HyukjinKwon Is the following presentation style appropriate for this special case?

Dark theme:

Light theme:

HyukjinKwon · 2023-11-29T00:33:42Z

python/docs/source/getting_started/quickstart_df.ipynb

   "source": [
-    "df.toPandas()"
+    "from tabulate import tabulate\n",
+    "print(tabulate(df.toPandas(), headers = 'keys', tablefmt = 'psql'))"


Hm, the output format looks fine but the whole point of using spark.sql.repl.eagerEval.enabled is to show a pretty table format without applying any other operations in the notebook.

Can you maybe just manually fix the output text/html to be compatible with both the sphinx dark theme and jupyter notebook?

Dark theme:

Light theme:

Oh, I understand what you mean.
For this document, I have modified the style to maintain using spark.sql.repl.eagerEval.enabled purpose to show a pretty table format without applying any other operations in the notebook.

For python/docs/source/getting_started/quickstart_df.ipynb, are we going to do something similar?
Because this example does not use spark.sql.repl.eagerEval.enabled.

The table format is also correct in jupyter notebook

itholic · 2023-11-29T05:46:40Z

Nice fix! +1 for #44049 (comment), otherwise it looks good to me.

HyukjinKwon · 2023-11-30T02:23:27Z

python/docs/source/getting_started/quickstart_ps.ipynb

   "outputs": [
    {
-     "data": {
-      "text/html": [


Actually can you also manually fix the HTML here instead of using print(psdf)?

If that's possible, it would really be great :-).

Okay, I'll give it a try.

panbingkun · 2023-11-30T09:53:45Z

python/docs/source/getting_started/quickstart_ps.ipynb

       "    }\n",
       "</style>\n",
-       "<table border=\"1\" class=\"dataframe\">\n",
+       "<table border=\"1\" class=\"dataframe\" style=\"table-layout: auto;margin-right: auto;margin-left: 0;\">\n",


The example result in this place is incorrect, we need to correct it
https://spark.apache.org/docs/latest/api/python/getting_started/quickstart_ps.html#Grouping

Before:

After:

HyukjinKwon

LGTM thanks for fixing this @panbingkun !!!

HyukjinKwon · 2023-12-01T00:42:40Z

Merged to master.

### What changes were proposed in this pull request? 1.After pr apache#44012, the output format of some 'ipynb' tables displayed in HTML format has been disrupted. The pr aims to fix table format error in ipynb docs. - Before: <img width="792" alt="image" src="https://github.com/apache/spark/assets/15246973/2095a2ac-f0b5-44bd-a3c2-ce742d041243"> - After: <img width="739" alt="image" src="https://github.com/apache/spark/assets/15246973/ec0be72d-4dc0-44f4-ab75-d9668e32fc51"> 2.Fix some minor errors. ### Why are the changes needed? Fix bug. ### Does this PR introduce _any_ user-facing change? Yes, only for docs. ### How was this patch tested? Manually test. Pass GA. ### Was this patch authored or co-authored using generative AI tooling? No. Closes apache#44049 from panbingkun/SPARK-46135. Authored-by: panbingkun <[email protected]> Signed-off-by: Hyukjin Kwon <[email protected]>

[SPARK-46135][PYTHON][DOCS] Fix table format error in ipynb docs

6157db9

github-actions bot added DOCS PYTHON labels Nov 28, 2023

panbingkun commented Nov 28, 2023

View reviewed changes

[SPARK-46135][PYTHON][DOCS] Fix table format error in ipynb docs

899921e

panbingkun requested a review from HyukjinKwon November 28, 2023 11:35

HyukjinKwon reviewed Nov 29, 2023

View reviewed changes

panbingkun added 2 commits November 29, 2023 10:32

[SPARK-46135][PYTHON][DOCS] Fix table format error in ipynb docs

f52c1c8

Merge branch 'master' into SPARK-46135

b34798c

panbingkun requested a review from HyukjinKwon November 29, 2023 02:46

HyukjinKwon reviewed Nov 30, 2023

View reviewed changes

panbingkun added 2 commits November 30, 2023 16:23

Merge branch 'master' into SPARK-46135

11f6454

[SPARK-46135][PYTHON][DOCS] Fix table format error in ipynb docs

b6fb6db

panbingkun commented Nov 30, 2023

View reviewed changes

HyukjinKwon approved these changes Dec 1, 2023

View reviewed changes

HyukjinKwon closed this in d145788 Dec 1, 2023

[SPARK-46135][PYTHON][DOCS] Fix table format error in ipynb docs #44049

[SPARK-46135][PYTHON][DOCS] Fix table format error in ipynb docs #44049

Uh oh!

Conversation

panbingkun commented Nov 28, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

Choose a reason for hiding this comment

Uh oh!

panbingkun commented Nov 28, 2023

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

panbingkun Nov 28, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

panbingkun Nov 29, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

itholic commented Nov 29, 2023

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

HyukjinKwon left a comment

Choose a reason for hiding this comment

Uh oh!

HyukjinKwon commented Dec 1, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

panbingkun commented Nov 28, 2023 •

edited

Loading

panbingkun Nov 28, 2023 •

edited

Loading

panbingkun Nov 29, 2023 •

edited

Loading