Skip to content

Proposed Updates to section 3.2#6

Merged
hhhizzz merged 1 commit intohhhizzz:lm-pipeline-blogfrom
alamb:alamb/section3.2
Dec 7, 2025
Merged

Proposed Updates to section 3.2#6
hhhizzz merged 1 commit intohhhizzz:lm-pipeline-blogfrom
alamb:alamb/section3.2

Conversation

@alamb
Copy link

@alamb alamb commented Dec 5, 2025

Here is some proposed "wordsmithing" changes for

I'll comment inline with the rationale

@github-actions
Copy link

github-actions bot commented Dec 5, 2025

Preview URL: https://alamb.github.io/arrow-site

If the preview URL doesn't work, you may forget to configure your fork repository for preview.
See https://github.com/apache/arrow-site/blob/main/README.md#forks how to configure.

@alamb alamb changed the base branch from main to lm-pipeline-blog December 5, 2025 13:42

[`RowSelection::scan_ranges`]: https://github.com/apache/arrow-rs/blob/ce4edd53203eb4bca96c10ebf3d2118299dad006/parquet/src/arrow/arrow_reader/selection.rs#L204

Page skipping is illustrated in the following code example:
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

i think it helps to add some explanation / context for the code and diagram

assert_eq!(ranges.len(), 1); // Only request page1
```

The following figure illustrates page skipping with RLE selections. The
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think this is accurate after reading the diagram, though it would be good if @hhhizzz could double check me

Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It's correct from my understanding.

@hhhizzz hhhizzz merged commit b847e48 into hhhizzz:lm-pipeline-blog Dec 7, 2025
1 check passed
@alamb alamb deleted the alamb/section3.2 branch December 8, 2025 15:12
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants