Skip to content

position delete manifest lower_bounds/upper_bounds not correct #6694

@kingwind94

Description

@kingwind94

Query engine

flink 1.12
iceberg 0.13.2

Question

As we know, position delete files should keep the file path and position. In rewriting v2 tables, iceberg would validateNoNewDeletesForDataFiles and would use the new position delete files' lower_bounds/upper_bounds for paths, to check whether the new position deletes should apply to rewritten data files.

However, I found that the lower_bounds/upper_bounds of my position deletes are incompelete as 'hdfs://ns2/user/', which would make iceberg validation method thinks all of the new added position deletes should apply to rewritten data files, and it will fail the rewrite process. Does any one know why the lower_bounds/upper_bounds of position deletes are not correct?

"lower_bounds" : {
  "array" : [ {
    "key" : 2147483546,
    "value" : "hdfs://ns2/user/"
  }, {
    "key" : 2147483545,
    "value" : "\n\u0000\u0000\u0000\u0000\u0000\u0000\u0000"
  } ]
}, 
"upper_bounds" : {
  "array" : [ {
    "key" : 2147483546,
    "value" : "hdfs://ns2/user0"
  }, {
    "key" : 2147483545,
    "value" : "ãå\u0005\u0000\u0000\u0000\u0000\u0000"
  } ]
},

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions