API: Fix timestamp(9) with identity partitioning. #13746

rdblue · 2025-08-05T21:44:14Z

This fixes the bug described in #11775 (comment), which was caused by incorrectly constructing a new predicate using a literal value instead of the literal itself.

Here's a test that reproduces the problem:

  @Test
  public void testStringToTimestampNanosLiteral() {
    Schema schema = new Schema(
        Types.NestedField.required(1, "id", Types.LongType.get()),
        Types.NestedField.optional(2, "ts", Types.TimestampNanoType.withoutZone()));

    PartitionSpec spec = PartitionSpec.builderFor(schema).identity("ts").build();

    Expression expr = Expressions.equal("ts", "2022-07-26T12:13:14.123456789");
    Expression projected = Projections.inclusive(spec).project(expr);

    Binder.bind(schema.asStruct(), projected);
  }

What's happening is the projection will first bind the original predicate because it needs a bound ID reference rather than a name reference. That ID is used to find the partition fields that can project the predicate. The binding process produces the expected TimestampNanoLiteral(1658837594123456789L). The bug is in the Identity transform's projection code, which needs to produce a new predicate that is unbound and uses a reference for the partition field's name (the partition name does not have to match). When it constructs the new unbound predicate, it passes the underlying value rather than the unchanged literal.

Updating that line to pass the literal instead of the value fixes the problem because it doesn't lose the context that the value was already a nanosecond timestamp value.

rdblue · 2025-08-05T21:45:18Z

@ebyhr, I think this fixes your timestamp nanos issue.

amogh-jahagirdar

Great find, and thanks for the minimal repro test. It makes sense that the unbound predicate that's produced needs to be based off of the literal to preserve the fact that it's a timestamp nano; with extracting the value, the previous logic would surface a predicate based on a long which would then be interpreted as micros and incorrectly undergo a conversion to nanos.

ebyhr

@rdblue Thank you for opening this PR! I internally verified that this change fixes our issue.

ebyhr · 2025-08-06T00:00:04Z

api/src/test/java/org/apache/iceberg/transforms/TestProjection.java

+    org.apache.iceberg.Schema schema =
+        new org.apache.iceberg.Schema(


nit: The package org.apache.iceberg looks redundant as this class already imports Schema.

API: Fix timestamp(9) with identity partitioning.

bd692a3

rdblue added this to the Iceberg 1.10.0 milestone Aug 5, 2025

github-actions bot added the API label Aug 5, 2025

rdblue requested a review from stevenzwu August 5, 2025 21:44

rdblue mentioned this pull request Aug 5, 2025

Core: Fix numeric overflow of timestamp nano literal #11775

Closed

Apply spotless.

5524323

amogh-jahagirdar approved these changes Aug 5, 2025

View reviewed changes

ebyhr approved these changes Aug 6, 2025

View reviewed changes

stevenzwu approved these changes Aug 6, 2025

View reviewed changes

singhpk234 approved these changes Aug 6, 2025

View reviewed changes

nastra approved these changes Aug 6, 2025

View reviewed changes

Fokko approved these changes Aug 6, 2025

View reviewed changes

danielcweeks approved these changes Aug 6, 2025

View reviewed changes

danielcweeks merged commit 64a7ca5 into apache:main Aug 6, 2025
43 checks passed

rdblue deleted the fix-timestamp-nano-with-identity-partition branch August 6, 2025 21:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

API: Fix timestamp(9) with identity partitioning. #13746

API: Fix timestamp(9) with identity partitioning. #13746

rdblue commented Aug 5, 2025

Uh oh!

rdblue commented Aug 5, 2025

Uh oh!

amogh-jahagirdar left a comment

Uh oh!

ebyhr left a comment

Uh oh!

ebyhr Aug 6, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

		org.apache.iceberg.Schema schema =
		new org.apache.iceberg.Schema(

API: Fix timestamp(9) with identity partitioning. #13746

API: Fix timestamp(9) with identity partitioning. #13746

Conversation

rdblue commented Aug 5, 2025

Uh oh!

rdblue commented Aug 5, 2025

Uh oh!

amogh-jahagirdar left a comment

Choose a reason for hiding this comment

Uh oh!

ebyhr left a comment

Choose a reason for hiding this comment

Uh oh!

ebyhr Aug 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

ebyhr Aug 6, 2025 •

edited

Loading