[WIP] V4 Manifest Read Support #14533

anoopj · 2025-11-08T00:07:47Z

WIP PR for s.apache.org/iceberg-single-file-commit

Implemented so far:

Foundational types such as TrackedFile interface as unified representation for all V4 entry types
Reader and basic root manifest expansion

Introduces the foundational types for V4 manifest format support: - TrackedFile interface as unified representation for all V4 entry types - DeletionVector and ManifestStats interfaces - GenericTrackedFile implementation and test

rdblue · 2025-11-13T00:45:13Z

api/src/main/java/org/apache/iceberg/FileContent.java

+   * Manifest deletion vector entry (V4+ only) - marks entries in a manifest as deleted without
+   * rewriting the manifest.
+   */
+  MANIFEST_DV(5);


I prefer the option of having the DV located in a field of the data or delete manifest record. That way we don't have to wait to find the DV before processing a manifest file. Not sure what others think here, but since the DV metadata/content is likely going to be different between the Metadata DV (inline) and Data DV (stored in Puffin), I don't see much value in trying to reuse metadata fields for it.

That was my preference too, and I advocated for it in our community discussion but this is what we settled on. Our current v4 proposal specifically uses MANIFEST_DV as a separate content type that references manifests via the referenced_file field. We can certainly change it, but want to hear from others.

I just check with Amogh and I don't think that this has been firmly decided yet. From the implementation here, I think we should embed the DV in manifest-specific metadata.

rdblue · 2025-11-13T00:46:24Z

api/src/main/java/org/apache/iceberg/DeletionVector.java

+   * <p>When present, the deletion vector is stored inline in the manifest rather than in a separate
+   * Puffin file.
+   */
+  ByteBuffer inlineContent();


I mentioned this in my comment below, but I don't think there's much value in combining the inline MDV metadata and fields to track data DVs stored in Puffin. These aren't overlapping, so I'd keep them separate.

Data and metadata DVs seemed like similar concepts to me. But I don't mind changing it.

rdblue · 2025-11-13T00:47:12Z

api/src/main/java/org/apache/iceberg/TrackedFile.java

+          100,
+          "location",
+          Types.StringType.get(),
+          "Location of the file. Optional if content_type is 5 and deletion_vector.inline_content is not null");


Not using a separate entry for inline would make this required, right?

rdblue · 2025-11-13T00:48:09Z

api/src/main/java/org/apache/iceberg/TrackedFile.java

+          104,
+          "file_size_in_bytes",
+          Types.LongType.get(),
+          "Total file size in bytes. Must be defined if location is defined");


rdblue · 2025-11-13T00:50:01Z

api/src/main/java/org/apache/iceberg/TrackedFile.java

+   * <p>Contains status, snapshot ID, sequence numbers, and first-row-id. Optional - may be null if
+   * tracking info is inherited.
+   */
+  TrackingInfo trackingInfo();


I think that this also requires a field to be defined, so we have a record of the ID used for it.

Yep. Will add it as soon as we define it. Right now it's TBD in the proposal.

Let's just assign it now and update the proposal. I also have example assignments in my exploration if you want to be consistent:

fn tracked_file_write_schema( table_schema: &Schema, stats_modes: Option<&HashMap<FieldId, StatsMode>>, ) -> Schema { let default = HashMap::from_iter((0..=32).into_iter().map(|id| (id, DEFAULT_STATS_MODE))); let modes: &HashMap<FieldId, StatsMode> = stats_modes.unwrap_or_else(|| &default); StructType::new([ required("tracking_info", tracking_info_schema(), 147), // TODO: assigned ID required("content", DataType::INTEGER, 134), // now required! required("location", DataType::STRING, 100), required("file_format", DataType::STRING, 101), required("record_count", DataType::LONG, 103), required("file_size_in_bytes", DataType::LONG, 104), required( "content_stats", content_stats_schema(table_schema, modes), 146, ), // TODO: ID is from stats proposal optional("key_metadata", DataType::BINARY, 131), optional("split_offsets", ArrayType::new(DataType::LONG, false), 132), // TODO: missing element ID (133) optional("content_slice", slice_schema(), 148), // TODO: assigned ID optional("referenced_file", DataType::STRING, 143), optional("manifest_stats", manifest_stats_schema(), 149), // TODO: assigned ID optional("min_sequence_number", DataType::LONG, 516), ]) }

What is content_slice?

rdblue · 2025-11-13T00:51:26Z

api/src/main/java/org/apache/iceberg/TrackedFile.java

+   * @throws IllegalStateException if content_type is not DATA
+   * @throws UnsupportedOperationException if ContentStats not yet implemented
+   */
+  DataFile asDataFile(PartitionSpec spec);


I'm not sure that we want to pass in the spec, since the record contains an ID. Wouldn't it be better to pass in a map of specs by ID when reading manifests so that this is already known when adapting to DataFile?

Yes, I thought about it, but was not sure which would be cleaner.

For this API, I think it is cleaner to call asDataFile() without any arguments.

rdblue · 2025-11-13T00:52:48Z

api/src/main/java/org/apache/iceberg/TrackedFile.java

+  DeleteFile asDeleteFile(PartitionSpec spec);
+
+  /** Set the status for this tracked file entry. */
+  void setStatus(TrackingInfo.Status status);


This API should not expose any setter methods. The implementation used can, if needed, for things like inherited metadata. But the API interface itself should not force implementations to be mutable. In general, we want to think of the API interfaces as immutable.

I had it as immutable initially, and added mutability while implementing inheritable metadata. That means that in places which need mutability, we will need to downcast it, which is fine I think.

I see. I didn't realize that we had added the setter methods to the ManifestEntry interface. Looks like that was probably allowed because ManifestEntry is in core and not exposed in the public API. Instead, DataFile and DeleteFile are in the API.

Here, I think we need to be more careful. This is very likely going to be in API along side DataFile, so it should be an immutable interface. To avoid downcasting, the manifest reader should configure its Parquet reader to produce the concrete class, GenericTrackedFile. That class can be mutable so the reader can pass instances to InheritableTrackedMetadata and then the reader should return those instances as TrackedFile (because it is CloseableIterable<TrackedFile>). That shouldn't require casting.

rdblue · 2025-11-13T00:53:23Z

api/src/main/java/org/apache/iceberg/TrackingInfo.java

+ */
+public interface TrackingInfo {
+  /** Status of an entry in a tracked file */
+  enum Status {


Isn't this enum already defined somewhere?

It is in ManifestEntry.Status, but it is in core. For v4, we need it in the API.

Okay, this is fine for now. We may want to move it later, but we'll see what makes sense.

rdblue · 2025-11-13T00:54:46Z

core/src/main/java/org/apache/iceberg/DataTableScan.java

  @Override
  public CloseableIterable<FileScanTask> doPlanFiles() {
    Snapshot snapshot = snapshot();
-


Nit: Avoid unnecessary whitespace changes. They cause conflicts.

rdblue · 2025-11-13T00:55:51Z

core/src/main/java/org/apache/iceberg/DataTableScan.java

+            : 2;
+
+    if (formatVersion >= 4) {
+      return planV4Files(snapshot, io);


Rather than modifying data table scan right now, let's leave this out. We don't need to plug anything into table scans at this point, since that is just a configuration API.

pvary · 2025-11-13T15:27:57Z

api/src/main/java/org/apache/iceberg/TrackedFile.java

+   *
+   * <p>Use this method to copy data without stats when collecting files.
+   */
+  F copyWithoutStats();


Is it intentional that we removed copyWithStats(Set<Integer> requestedColumnIds) from ContentFile?

It will be supported. I left it out for now because the stats was still being defined by @nastra.

rdblue · 2025-11-18T23:37:30Z

core/src/main/java/org/apache/iceberg/GenericTrackedFile.java

+    implements TrackedFile<GenericTrackedFile>,
+        IndexedRecord,
+        StructLike,
+        SpecificData.SchemaConstructable,


There's no need for Avro interfaces since we no longer use Avro generics to read metadata. You can remove IndexedRecord because it is replaced by StructLike and remove SpecificData.SchemaConstructable because that's handled by reflection in the readers.

rdblue · 2025-11-18T23:38:56Z

core/src/main/java/org/apache/iceberg/GenericTrackedFile.java

+import org.apache.iceberg.util.ByteBuffers;
+
+/** Generic implementation of {@link TrackedFile} for V4 manifests. */
+public class GenericTrackedFile extends SupportsIndexProjection


The prefix Generic was used to identify Avro generic classes that are compatible with IndexedRecord. Now that we have replaced IndexedRecord with StructLike, I'd recommend renaming this and other classes to TrackedFileStruct.

Also, I don't think that this class should be public. Let's keep everything package-private right now.

rdblue · 2025-11-18T23:40:58Z

core/src/main/java/org/apache/iceberg/GenericTrackedFile.java

+          ManifestStats.ADDED_ROWS_COUNT.asOptional(),
+          ManifestStats.EXISTING_ROWS_COUNT.asOptional(),
+          ManifestStats.DELETED_ROWS_COUNT.asOptional(),
+          ManifestStats.MIN_SEQUENCE_NUMBER.asOptional());


Each of the sub-structs should be its own struct type (like ManifestStatsStruct and TrackingInfoStruct) that is read and written the same way that this class is. The TrackedFile schema should embed each sub struct as a field.

rdblue · 2025-11-18T23:41:25Z

core/src/main/java/org/apache/iceberg/GenericTrackedFile.java

+  private Object contentStats = null;
+
+  // Cached schema for Avro
+  private transient Schema avroSchema = null;


No need for this.

rdblue · 2025-11-18T23:44:05Z

core/src/main/java/org/apache/iceberg/GenericTrackedFile.java

+          ManifestStats.MIN_SEQUENCE_NUMBER.asOptional());
+
+  /** Used by Avro reflection to instantiate this class when reading manifest files. */
+  public GenericTrackedFile(Schema avroSchema) {


You can remove this along with the other Avro artifacts.

rdblue · 2025-11-18T23:47:20Z

core/src/main/java/org/apache/iceberg/GenericTrackedFile.java

+   * @param partitionSpecId the partition spec ID
+   * @param recordCount the number of records
+   */
+  public GenericTrackedFile(


I don't think that this is a "full constructor". I'm also not sure that we need this constructor at all. It depends on whether we create TrackedFile instances directly and I doubt that we will. I think it is more likely that we will use a wrapper on the write path to write DataFile as a TrackedFile (similar to the V3Metadata wrapper classes). In that case, this class only needs to be used in the read path.

Let's remove this constructor for now, unless you need a complete one for test purposes (in which case, it should be package-private).

rdblue · 2025-11-18T23:48:40Z

core/src/main/java/org/apache/iceberg/GenericTrackedFile.java

+   * @param toCopy a tracked file to copy
+   * @param copyStats whether to copy stats
+   */
+  private GenericTrackedFile(GenericTrackedFile toCopy, boolean copyStats) {


For v4, we can remove the paths that use a copyStats boolean. Instead, we should use a set of field IDs to identify the fields to copy, much like the newer copy API.

rdblue · 2025-11-18T23:50:57Z

core/src/main/java/org/apache/iceberg/GenericTrackedFile.java

+  private String referencedFile = null;
+
+  // Manifest stats (for manifest entries)
+  private Integer addedFilesCount = null;


These fields are required in v4, so they should be primitives in ManifestStatsStruct.

rdblue · 2025-11-18T23:51:39Z

core/src/main/java/org/apache/iceberg/GenericTrackedFile.java

+
+  // Deletion vector fields
+  private Long deletionVectorOffset = null;
+  private Long deletionVectorSizeInBytes = null;


These should be required in the DV struct, since the struct itself is optional.

rdblue · 2025-11-18T23:52:32Z

core/src/main/java/org/apache/iceberg/GenericTrackedFile.java

+  }
+
+  @Override
+  public TrackingInfo trackingInfo() {


Structs contained within TrackedFile should be actual structs.

rdblue · 2025-11-18T23:54:12Z

core/src/main/java/org/apache/iceberg/GenericTrackedFile.java

+  }
+
+  @Override
+  public void put(int i, Object v) {


Not needed, same as getSchema.

rdblue · 2025-11-19T00:12:21Z