Skip to content

Support for reading multiple sorted files per bucket#742

Merged
rshkv merged 6 commits intomasterfrom
rr/sorted-bucket
Mar 19, 2021
Merged

Support for reading multiple sorted files per bucket#742
rshkv merged 6 commits intomasterfrom
rr/sorted-bucket

Conversation

@rahij
Copy link

@rahij rahij commented Mar 16, 2021

Upstream SPARK-XXXXX ticket and PR link (if not applicable, explain)

Redoing #731 and #738 on the new branch. The upstream PR author has said that it is taking longer since they are planning to automatically detect if a parent operator can take advantage of the sort before creating the bucketed sorted RDD. We will revert this PR when either of these happen:

  • The upstream PR is merged
  • This PR causes merge conflicts when trying to cherry pick any unrelated upstream change

@rahij rahij requested review from mattsills and rshkv March 16, 2021 15:58
@rshkv
Copy link

rshkv commented Mar 19, 2021

For reference, this was the original PR: apache#29625

@rshkv rshkv merged commit 786fce2 into master Mar 19, 2021
@rshkv rshkv deleted the rr/sorted-bucket branch March 19, 2021 12:34
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants