Skip to content

Conversation

@stubz151
Copy link
Contributor

@stubz151 stubz151 commented Jun 26, 2025

What Am I doing:

#13254
This is to add in the vectored IO path to Iceberg

How am I doing this:

Adding the needed methods to Iceberg's Parquet IO implementation.
Defaulted OFF to the default Iceberg Stream and overrode it for AAL where we support it.
Added a mapping from parquet's object range to an iceberg specific one and then added a mapping from icebergs object range to AAL's

Depends on #13347

Note this includes commits from another: PR
this commit: 85afb89 is the one for read vector.

How have I tested this

  • Tested with and without AAL and could see the default non-read vectored methods being used
  • Ran a suite of benchmarks and confirmed the ranges passed through are as expected
  • Added integration tests for our read vectored approach

@stubz151 stubz151 changed the title Integration tests with analytics accelerator library AWS: Add Read Vector IO support to AAL Jun 26, 2025
@stubz151 stubz151 force-pushed the integration_tests_with_analytics_accelerator_library branch from ec4f06e to 0bb1209 Compare June 26, 2025 15:53
@github-actions github-actions bot added the build label Jul 1, 2025
@stubz151 stubz151 force-pushed the integration_tests_with_analytics_accelerator_library branch from a82a75f to e78a175 Compare July 1, 2025 15:35
@stubz151 stubz151 marked this pull request as ready for review July 2, 2025 10:22
@stubz151 stubz151 force-pushed the integration_tests_with_analytics_accelerator_library branch 5 times, most recently from 1696da6 to b47b5cb Compare July 8, 2025 14:16
@danielcweeks danielcweeks self-requested a review August 6, 2025 18:17
SanjayMarreddi and others added 6 commits August 29, 2025 14:03
- Updated S3FileIOIntegration test
- Enhanced AnalyticsAcceleratorUtil functionality
- Modified PrefixedS3Client implementation
- Updated build configuration and dependencies
- Enhanced Parquet integration
@stubz151 stubz151 force-pushed the integration_tests_with_analytics_accelerator_library branch from b47b5cb to 5317ced Compare August 29, 2025 13:05
@stubz151
Copy link
Contributor Author

Closing this PR to instead look at adding a default implementation first before doing the AAL work.

@stubz151 stubz151 closed this Aug 29, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants