Skip to content

Conversation

@dongjoon-hyun
Copy link
Member

@dongjoon-hyun dongjoon-hyun commented Jan 7, 2021

What changes were proposed in this pull request?

This PR aims to update SQL documentation about ORC data sources.

New structure looks like the following.

  • ORC Implementation
  • Vectorized Reader
  • Schema Merging
  • Zstandard
  • Bloom Filters
  • Columnar Encryption
  • Hive metastore ORC table conversion
  • Configuration

Why are the changes needed?

This document is not up-to-date. Apache Spark 3.2.0 can utilize new improvements from Apache ORC 1.6.6.

Does this PR introduce any user-facing change?

No, this is a documentation.

How was this patch tested?

Manual.

SKIP_API=1 jekyll build

BEFORE
Screen Shot 2021-01-06 at 5 08 19 PM


AFTER
Screen Shot 2021-01-06 at 7 03 38 PM
Screen Shot 2021-01-06 at 7 03 49 PM
Screen Shot 2021-01-06 at 7 03 59 PM
Screen Shot 2021-01-06 at 7 04 10 PM
Screen Shot 2021-01-06 at 7 04 16 PM

@github-actions github-actions bot added the DOCS label Jan 7, 2021
@dongjoon-hyun
Copy link
Member Author

Could you review this please, @maropu , @viirya , @HyukjinKwon ?

@SparkQA
Copy link

SparkQA commented Jan 7, 2021

Test build #133759 has finished for PR 31075 at commit b7baa0c.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@SparkQA
Copy link

SparkQA commented Jan 7, 2021

Kubernetes integration test starting
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/38347/

@SparkQA
Copy link

SparkQA commented Jan 7, 2021

Kubernetes integration test status failure
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/38347/

@dongjoon-hyun
Copy link
Member Author

The PR is updated. Thanks, @sunchao and @viirya .

@SparkQA
Copy link

SparkQA commented Jan 7, 2021

Test build #133762 has finished for PR 31075 at commit d9279dc.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@SparkQA
Copy link

SparkQA commented Jan 7, 2021

Kubernetes integration test starting
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/38350/

@SparkQA
Copy link

SparkQA commented Jan 7, 2021

Kubernetes integration test status success
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/38350/

@SparkQA
Copy link

SparkQA commented Jan 7, 2021

Test build #133765 has finished for PR 31075 at commit 2024f01.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@dongjoon-hyun
Copy link
Member Author

Thank you, @viirya . Merged to master.

@dongjoon-hyun dongjoon-hyun deleted the SPARK-34036 branch January 7, 2021 04:19
Copy link
Member

@HyukjinKwon HyukjinKwon left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

late LGTM2

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants