-
Notifications
You must be signed in to change notification settings - Fork 29k
[SPARK-35025][SQL][PYTHON][DOCS] Move Parquet data source options from Python and Scala into a single page. #32161
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
…into a single page.
|
After confirming & finishing this PR, I'll also move another data source options into the single page. See SPARK-34494. |
|
Test build #137333 has finished for PR 32161 at commit
|
|
Kubernetes integration test starting |
|
Kubernetes integration test status failure |
|
Test build #750227134 for PR 32161 at commit |
|
Test build #137376 has finished for PR 32161 at commit
|
|
Kubernetes integration test starting |
|
Kubernetes integration test status failure |
|
cc @HyukjinKwon Could you please review this when you find some time? |
sql/core/src/main/scala/org/apache/spark/sql/DataFrameReader.scala
Outdated
Show resolved
Hide resolved
|
Python linter fails. Please take a look: |
|
Test build #754715131 for PR 32161 at commit |
|
Test build #137472 has finished for PR 32161 at commit
|
|
Kubernetes integration test unable to build dist. exiting with code: 1 |
|
@itholic can you take a look for the style failure and sync your branch to the latest master branch? |
|
Can you fix all instances like #32161 (comment)? Otherwise looks pretty good. @MaxGekk FYI |
|
Test build #138679 has finished for PR 32161 at commit
|
|
Kubernetes integration test starting |
|
Kubernetes integration test status success |
sql/core/src/main/scala/org/apache/spark/sql/streaming/DataStreamReader.scala
Outdated
Show resolved
Hide resolved
sql/core/src/main/scala/org/apache/spark/sql/DataFrameWriter.scala
Outdated
Show resolved
Hide resolved
|
Kubernetes integration test starting |
|
Test build #138739 has finished for PR 32161 at commit
|
|
Kubernetes integration test status success |
|
Test build #138754 has finished for PR 32161 at commit
|
sql/core/src/main/scala/org/apache/spark/sql/DataFrameWriter.scala
Outdated
Show resolved
Hide resolved
|
Looks pretty good otherwise. @MaxGekk it would be great if you could have a change to take a quick look. |
|
Test build #138779 has finished for PR 32161 at commit
|
|
Kubernetes integration test unable to build dist. exiting with code: 1 |
|
Test build #138787 has finished for PR 32161 at commit
|
| * `DataFrameReader` | ||
| * `DataFrameWriter` | ||
| * `DataStreamReader` | ||
| * `DataStreamWriter` |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
also mention:
* `OPTIONS` clause at [CREATE TABLE USING DATA_SOURCE](sql-ref-syntax-ddl-create-table-datasource.html)
What changes were proposed in this pull request?
This PR proposes move Parquet data source options from Python, Scala and Java into a single page.
Why are the changes needed?
So far, the documentation for Parquet data source options is separated into different pages for each language API documents. However, this makes managing many options inconvenient, so it is efficient to manage all options in a single page and provide a link to that page in the API of each language.
Does this PR introduce any user-facing change?
Yes, the documents will be shown below after this change:
"Parquet Files" page

Python

Scala

Java

How was this patch tested?
Manually build docs and confirm the page.