feat(reader): null struct default values in create_column #1847

mbutrovich · 2025-11-12T16:35:01Z

Fixes TestSparkReaderDeletes.testPosDeletesOnParquetFileWithMultipleRowGroups in Iceberg Java 1.10 with DataFusion Comet.

Which issue does this PR close?

Partially address ArrowReader enhancements for Apache DataFusion Comet #1749.

What changes are included in this PR?

While RecordBatchTransformer does not have exhaustive nested type support yet, this adds logic to create_column in the specific scenario for a schema evolution with a new struct column that uses the default NULL value.
If the column has a default value other than NULL defined, it will fall into the existing match arm and say it is unsupported.

Are these changes tested?

New test to reflect what happens with Iceberg Java 1.10's TestSparkReaderDeletes.testPosDeletesOnParquetFileWithMultipleRowGroups. The test is misleading, since I figured testing positional deletes would just be a delete vector and be schema agnostic, but it includes schema change with binary and struct types so we need default NULL values.

…etes.testPosDeletesOnParquetFileWithMultipleRowGroups in Iceberg Java 1.10 with DataFusion Comet.

liurenjie1024

Thanks @mbutrovich for this fix!

Null struct default values in create_column. Fixes TestSparkReaderDel…

66fed22

…etes.testPosDeletesOnParquetFileWithMultipleRowGroups in Iceberg Java 1.10 with DataFusion Comet.

mbutrovich mentioned this pull request Nov 12, 2025

ArrowReader enhancements for Apache DataFusion Comet #1749

Open

15 tasks

mbutrovich changed the title ~~fix(reader): null struct default values in create_column~~ feat(reader): null struct default values in create_column Nov 12, 2025

Fix clippy.

37b048a

mbutrovich mentioned this pull request Nov 12, 2025

Tracking issues of Iceberg Rust 0.8 Release #1850

Open

16 tasks

liurenjie1024 approved these changes Nov 13, 2025

View reviewed changes

liurenjie1024 merged commit 12c4c21 into apache:main Nov 13, 2025
18 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(reader): null struct default values in create_column #1847

feat(reader): null struct default values in create_column #1847

Uh oh!

mbutrovich commented Nov 12, 2025

Uh oh!

liurenjie1024 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat(reader): null struct default values in create_column #1847

feat(reader): null struct default values in create_column #1847

Uh oh!

Conversation

mbutrovich commented Nov 12, 2025

Which issue does this PR close?

What changes are included in this PR?

Are these changes tested?

Uh oh!

liurenjie1024 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants