Skip to content

Conversation

@mbutrovich
Copy link
Collaborator

Fixes TestSparkReaderDeletes.testPosDeletesOnParquetFileWithMultipleRowGroups in Iceberg Java 1.10 with DataFusion Comet.

Which issue does this PR close?

What changes are included in this PR?

  • While RecordBatchTransformer does not have exhaustive nested type support yet, this adds logic to create_column in the specific scenario for a schema evolution with a new struct column that uses the default NULL value.
  • If the column has a default value other than NULL defined, it will fall into the existing match arm and say it is unsupported.

Are these changes tested?

New test to reflect what happens with Iceberg Java 1.10's TestSparkReaderDeletes.testPosDeletesOnParquetFileWithMultipleRowGroups. The test is misleading, since I figured testing positional deletes would just be a delete vector and be schema agnostic, but it includes schema change with binary and struct types so we need default NULL values.

…etes.testPosDeletesOnParquetFileWithMultipleRowGroups in Iceberg Java 1.10 with DataFusion Comet.
@mbutrovich mbutrovich changed the title fix(reader): null struct default values in create_column feat(reader): null struct default values in create_column Nov 12, 2025
Copy link
Contributor

@liurenjie1024 liurenjie1024 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @mbutrovich for this fix!

@liurenjie1024 liurenjie1024 merged commit 12c4c21 into apache:main Nov 13, 2025
18 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants