Skip to content

Use non-batch scans during sorted bucketed reads#738

Merged
rshkv merged 2 commits intomasterfrom
rr/fix-sorted-bucket
Mar 8, 2021
Merged

Use non-batch scans during sorted bucketed reads#738
rshkv merged 2 commits intomasterfrom
rr/fix-sorted-bucket

Conversation

@rahij
Copy link

@rahij rahij commented Mar 8, 2021

Follow up fix from #731. While adapting https://github.com/apache/spark/pull/29625/files to our branch, we missed a change to not do batch scans during sorted bucketed reads - specifically, this line was missed: https://github.com/apache/spark/pull/29625/files#diff-089285f1484c1598cb2839b86b6a9e65b98ab5b30462aedc210fe4bbf44cae78R177.

This results in reads failing when using codegen + vectorized reader. I have added a test that fails without this change.

@rahij rahij requested review from jdcasale, mattsills and rshkv March 8, 2021 12:49
@rshkv rshkv changed the title use non-batch scans during sorted bucketed reads Use non-batch scans during sorted bucketed reads Mar 8, 2021
@rshkv rshkv merged commit 352bdb7 into master Mar 8, 2021
@rshkv rshkv deleted the rr/fix-sorted-bucket branch March 8, 2021 13:56
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants