Skip to content

Determine and support appropriate schema evolution semantics for Iceberg table with legacy Hive files #9843

@findepi

Description

@findepi

For a Hive table migrated to iceberg with https://iceberg.apache.org/spark-procedures/#migrate-table-procedure
if I drop a field and add a new field with same name, should i expect nulls, or data from legacy files that do not use field ID mappings?

Currently, the Trino implementation uses current names when reading a legacy file (a file without field ID information), so i will see data being read.
I would expect that legacy files are read with first Iceberg table schema, so i would expect nulls being read from legacy files.

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions