Core: Support first-row-id for manifests and manifest lists #12672

rdblue · 2025-03-28T02:43:34Z

This adds support for first-row-id in manifests and manifest lists.

Manifests are updated so that data files inherit/assign a first-row-id when the field is null, based on the record counts of previous data files. Currently, the first-row-id for a data file is based on the manifest's first-row-id and the number of records in files without an assigned first-row-id. I think that this matches the expected behavior, which is based on the record_count of all ADDED data files (The spec states: "When reading, the first_row_id is assigned by replacing null with the manifest's first_row_id plus the sum of record_count for all added data files that preceded the file in the manifest.")

Manifest lists are updated so that first-row-id for a data manifest is always written, either because the manifest has an already assigned first-row-id or by assigning a new one. The number of added records in a manifest assigned a new first-row-id is used to update a next-row-id that is either used for the next manifest or is used as next-row-id in table metadata. This strategy updates the next-row-id by the number of added records in all new data manifests. This is not what the spec currently says but I think is what we meant at the time.

The v3 spec states:

When adding a new data manifest file, its first_row_id field is assigned the value of the snapshot's first_row_id plus the sum of added_rows_count for all data manifests that preceded the manifest in the manifest list.

I think this language would require allocating the number of rows in added data files in the whole table for every commit, not just the added rows in the new manifests. What I've implemented is allocating the number of added rows in the manifests that are assigned a new first-row-id, which is the same as for each new manifest.

rdblue · 2025-03-28T02:44:53Z

core/src/test/java/org/apache/iceberg/TestManifestEncryption.java

-import org.junit.jupiter.api.Test;
-import org.junit.jupiter.api.io.TempDir;
-
-public class TestManifestEncryption {


This duplicated the manifest tests in TestManifestWriterVersions, so I updated that suite to allow this one to override the plaintext EncryptionManager with the test EM.

rdblue · 2025-03-28T02:46:05Z

core/src/main/java/org/apache/iceberg/GenericManifestFile.java

+   *
+   * @deprecated will be removed in 1.10.0; use {@link ManifestWriter#toManifestFile()} instead.
+   */
+  @Deprecated


I still need to remove a few more references to this older constructor. I'm not sure why there was a public constructor and I think we should avoid having one. That's why it is now deprecated.

This touches more files so I'll take care of it in a follow up.

core/src/main/java/org/apache/iceberg/BaseFile.java

RussellSpitzer · 2025-03-28T20:34:24Z

core/src/main/java/org/apache/iceberg/InheritableMetadataFactory.java

    private final long snapshotId;
    private final long sequenceNumber;
    private final String manifestLocation;
+    private Long nextRowId;


I haven't thought through all the usages but should we have this be atomic? I think we may need handle multiple threads using this class at the same time.

No, we create a separate InheritableMetadata for each manifest reader and the readers make no guarantees about thread safety.

core/src/main/java/org/apache/iceberg/SnapshotProducer.java

core/src/main/java/org/apache/iceberg/TableMetadata.java

core/src/main/java/org/apache/iceberg/TableMetadataParser.java

RussellSpitzer · 2025-03-28T21:03:59Z

core/src/main/java/org/apache/iceberg/V3Metadata.java

          return wrapped.keyMetadata();
+        case 15:
+          if (wrappedFirstRowId != null) {
+            // if first-row-id is assigned, ensure that it is valid


So this is basically making sure that if we inherited a firstRowID it is not also written in the underlying entry?

first-row-id isn't an entry field is it?

Sorry, maybe I didn't understand. Why are we checking that wrapped.firstRowID() is null?

The purpose is to make sure that this isn't used to replace an already assigned first-row-id. In order to assign one by calling wrap(file, firstRowId), the file must be a data file and the row id can't already be assigned.

core/src/test/java/org/apache/iceberg/TestManifestListVersions.java

core/src/test/java/org/apache/iceberg/TestManifestWriterVersions.java

RussellSpitzer · 2025-03-28T21:55:34Z

core/src/test/java/org/apache/iceberg/TestManifestWriterVersions.java

+  }
+
+  @Test
+  public void testV3WriteWithInheritance() throws IOException {


These tests are pretty hard for me to follow. I probably just haven't been in this code in a while but I can't tell what each one is trying to do

Yeah, it's a bit weird because the write methods do more than one thing (write and also assert some things). I was trying to avoid refactoring the entire suite for this though.

I'll spend some more time reading through them to see if I can follow then.

core/src/test/java/org/apache/iceberg/rest/responses/TestLoadTableResponseParser.java

core/src/main/java/org/apache/iceberg/ManifestFileParser.java

rdblue · 2025-04-11T22:22:56Z

...4/spark-extensions/src/test/java/org/apache/iceberg/spark/extensions/TestMetadataTables.java

+    Schema entriesTableSchema =
+        TypeUtil.selectNot(
+            Spark3Util.loadIcebergTable(spark, tableName + ".entries").schema(),
+            Set.of(DataFile.FIRST_ROW_ID.fieldId()));


The assertions in this suite were previously reading the manifest directly as Avro and matching against the rows returned by Spark. That always produces null for first_row_id but the manifest reader sets the field using inheritance so the tests were failing.

The solution is to remove first_row_id from both the entries table schema (used to read manifests) and from the Spark datasets (via selectNonDerived).

May be worth adding a

private Schema entriesTableSchema() { return TypeUtil.selectNot( Spark3Util.loadIcebergTable(spark, tableName + ".entries").schema(), Set.of(DataFile.FIRST_ROW_ID.fieldId())); }

To reduce the redundancy

rdblue · 2025-04-11T22:24:45Z

core/src/test/java/org/apache/iceberg/TestManifestReader.java

        assertThat(file.pos()).as("Position should match").isEqualTo(expectedPos);
-        assertThat(((BaseFile) file).get(20))
-            .as("Position from field index should match")
-            .isEqualTo(expectedPos);


I removed this because the line above tests that file.pos() is correct. This assertion is incorrect because it expects the field order not to change.

rdblue · 2025-04-16T15:31:35Z

core/src/main/java/org/apache/iceberg/InheritableMetadataFactory.java

        manifest.path());
  }

+  /** Returns {@link InheritableMetadata} for rewriting a manifest before it is committed. */


I'm no longer changing this class, but I think it's reasonable to keep the additional javadoc.

rdblue · 2025-04-16T15:33:49Z

core/src/main/java/org/apache/iceberg/ManifestReader.java

    }
  }
+
+  private static <F extends ContentFile<F>> Function<ManifestEntry<F>, ManifestEntry<F>> idAssigner(


Keeping state in InheritableMetadata doesn't work because it is reused each time the reader produces an iterator. That causes incorrect row ID assignment (caught by the tests I'm working on). Instead, I've introduced this assigner function that is called where InheritableMetadata is used. InheritableMetadata is used for constants, this is used for state-based ID assignment.

core/src/main/java/org/apache/iceberg/ManifestReader.java

Record count is always projected.

core/src/test/java/org/apache/iceberg/TestManifestWriterVersions.java

core/src/test/java/org/apache/iceberg/TestRowLineageAssignment.java

RussellSpitzer · 2025-04-16T21:05:52Z

core/src/test/java/org/apache/iceberg/TestRowLineageMetadata.java

    table.newRewrite().deleteFile(filePart1).deleteFile(filePart2).addFile(fileCompacted).commit();

-    // Rewrites are currently just treated as appends. In the future we could treat these as no-ops
+    // rewrites produce new manifests without first-row-id or any information about how many rows


Suggested change

// rewrites produce new manifests without first-row-id or any information about how many rows

// Rewrites produce new manifests without first-row-id or any information about how many rows

RussellSpitzer · 2025-04-16T21:06:19Z

core/src/test/java/org/apache/iceberg/TestRowLineageMetadata.java


-    // Rewrites are currently just treated as appends. In the future we could treat these as no-ops
+    // rewrites produce new manifests without first-row-id or any information about how many rows
+    // are new. without tracking a new metric for a manifest (e.g., assigned-rows) or assuming that


Suggested change

// are new. without tracking a new metric for a manifest (e.g., assigned-rows) or assuming that

// are new. Without tracking a new metric for a manifest (e.g., assigned-rows) or assuming that

core/src/test/java/org/apache/iceberg/TestRowLineageMetadata.java

RussellSpitzer · 2025-04-16T21:16:54Z

spark/v3.4/spark/src/test/java/org/apache/iceberg/spark/data/TestHelpers.java

    file.put(3, 0); // specId
  }

+  // suppress the readable metrics and first-row-id that are not in manifest files


Suggested change

// suppress the readable metrics and first-row-id that are not in manifest files

// Suppress the readable metrics and first-row-id fields that are not in manifest files

I thought we didn't use sentence case in non-javadoc comments because you get weird capitalized fragments and unnecessary PRs/commits to "fix" the case. Are these strong nits?

I have no idea what our standard on this is really

// [A-Z]
Has 5422 Hits

// [a-z]
Has 10896 Hits

So we definitely are leaning towards no case

RussellSpitzer

Looks good to me, I have a few nits on tests but I'm onboard with the implementation

core/src/main/java/org/apache/iceberg/ContentFileParser.java

core/src/main/java/org/apache/iceberg/GenericManifestFile.java

core/src/test/java/org/apache/iceberg/TestManifestWriterVersions.java

danielcweeks

+1 pending checks. (I think the failures were unrelated, so I restarted them).

rdblue · 2025-04-18T17:50:09Z

Thanks for the reviews, @RussellSpitzer and @danielcweeks! I'm going to merge this so that we can get working on the next set of changes, including #12836.

github-actions bot added API core labels Mar 28, 2025

rdblue commented Mar 28, 2025

View reviewed changes