Skip to content

Conversation

@parthchandra
Copy link
Contributor

Replaces the arrow native reader with DataFusion's ParquetExec (at the expense of several TODO's). This implementation passes more unit tests and is equal in speed to the full native reader.

@parthchandra
Copy link
Contributor Author

@andygrove @mbutrovich

@andygrove andygrove changed the title Use Datafusion based record batch reader for use in iceberg reads feat: Use Datafusion based record batch reader for use in iceberg reads Dec 17, 2024
@andygrove andygrove changed the title feat: Use Datafusion based record batch reader for use in iceberg reads feat: [comet-parquet-exec] Use Datafusion based record batch reader for use in iceberg reads Dec 17, 2024
Copy link
Contributor

@mbutrovich mbutrovich left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. Once this merges I'll work on getting in sync with main.

@andygrove andygrove merged commit 2686a4b into apache:comet-parquet-exec Dec 17, 2024
29 of 75 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants