Skip to content

Conversation

@ajantha-bhat
Copy link
Member

Util to write and read partition stats files in parquet format.
TODO: Support ORC in a separate PR.

Depends on PR #8500, #8501

Derived from #8488

Fixes #8456

Address new comments

remove trailing whitespaces
- Since core module need to write stats in parquet format, to avoid circular dependency,
move all the files from iceberg-parquet module to iceberg code.
- `TestParquetReadProjection` used to duplicate the test code of iceberg-api module's `TestReadProjection`.
Removed the duplicate class and instead directly extend the original class from iceberg-api module.
- Update TestParquetReadProjection to skip empty struct testcases as only Avro readers supports it.
The testcases are now common for both Avro and Parquet readers.
PartitionsTable.Partition will be used between Partitions metadata table
and partition stats reader-writer.
Hence, move it to a separate class and extend it with Avro's
IndexedRecord (for partition stats writing).
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

Development

Successfully merging this pull request may close these issues.

Build a util to read and write partition stats file for a table on a single node.

1 participant