Handle cases that definitely cannot match for notStartsWith() Predica… #13766

manirajv06 · 2025-08-08T11:06:40Z

…te in StrictMetricsEvaluator

Fixes #13765

…te in StrictMetricsEvaluator

manirajv06 · 2025-08-08T11:09:12Z

@nastra @rdblue @kbendick Can you take a look?

nastra · 2025-08-08T11:27:56Z

@manirajv06 can you fix CI please?

manirajv06 · 2025-08-08T12:32:10Z

@manirajv06 can you fix CI please?

Fixed. Please check.

manirajv06 · 2025-08-30T17:11:02Z

@nastra @rdblue Had a chance to review the changes? Thanks.

smaheshwar-pltr

Thanks for the contribution, @manirajv06!

I'm not sure the logic is quite right as is - it seems to claim that if both the lower and upper bounds don't start with the prefix, then all rows don't start with the prefix.

However, consider notStartsWith("b") with bound ["a", "z"] - the logic above would report that all rows don't start with "b", but "b" could very well be a row lying between "a" and "z".

I think that the TODO here before was referring to the case where the prefix lies outside the bounds, in which case no row can match. There are similar semantics in the inclusive metric evaluation of the startsWith predicate, which I think it symmetric to this case:

iceberg/api/src/main/java/org/apache/iceberg/expressions/InclusiveMetricsEvaluator.java

Line 388 in 7b9ea75

public <T> Boolean startsWith(Bound<T> term, Literal<T> lit) {

manirajv06 · 2025-09-15T16:46:33Z

@smaheshwar-pltr Thanks for taking a look. Sorry for the delay.

My understanding is, Unlike Inclusive, Strict would answer only if it is able to find the values deterministically. Thats the reason for returning ROWS_MIGHT_NOT_MATCH in the initial cut as that would be more accurate from defensive stand point of view with the comment to catch the case that cannot match for sure later. I have made the changes to catch the cases that cannot match for sure using the prefix.

However, consider notStartsWith("b") with bound ["a", "z"] - the logic above would report that all rows don't start with "b", but "b" could very well be a row lying between "a" and "z".

Yes, this should return true in Inclusive, but should not be in Strict.

@rdblue Can you please check? Is my understanding correct?

github-actions · 2025-10-16T00:17:14Z

This pull request has been marked as stale due to 30 days of inactivity. It will be closed in 1 week if no further activity occurs. If you think that’s incorrect or this pull request requires a review, please simply write any comment. If closed, you can revive the PR at any time and @mention a reviewer or discuss it on the [email protected] list. Thank you for your contributions.

manirajv06 · 2025-10-16T15:31:47Z

recheck

github-actions · 2025-11-16T00:19:15Z

This pull request has been marked as stale due to 30 days of inactivity. It will be closed in 1 week if no further activity occurs. If you think that’s incorrect or this pull request requires a review, please simply write any comment. If closed, you can revive the PR at any time and @mention a reviewer or discuss it on the [email protected] list. Thank you for your contributions.

github-actions · 2025-11-23T00:20:53Z

This pull request has been closed due to lack of activity. This is not a judgement on the merit of the PR in any way. It is just a way of keeping the PR queue manageable. If you think that is incorrect, or the pull request requires review, you can revive the PR at any time.

Handle cases that definitely cannot match for notStartsWith() Predica…

0c0aca3

…te in StrictMetricsEvaluator

github-actions bot added the API label Aug 8, 2025

manirajv06 added 2 commits August 8, 2025 17:15

Fixed checkstyle warnings

a880263

Fixed checkstyle warnings

f85f977

smaheshwar-pltr suggested changes Sep 7, 2025

View reviewed changes

github-actions bot added the stale label Oct 16, 2025

github-actions bot removed the stale label Oct 17, 2025

github-actions bot added the stale label Nov 16, 2025

github-actions bot closed this Nov 23, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Handle cases that definitely cannot match for notStartsWith() Predica… #13766

Handle cases that definitely cannot match for notStartsWith() Predica… #13766

Uh oh!

manirajv06 commented Aug 8, 2025

Uh oh!

manirajv06 commented Aug 8, 2025

Uh oh!

nastra commented Aug 8, 2025

Uh oh!

manirajv06 commented Aug 8, 2025

Uh oh!

manirajv06 commented Aug 30, 2025

Uh oh!

smaheshwar-pltr left a comment

Uh oh!

manirajv06 commented Sep 15, 2025

Uh oh!

github-actions bot commented Oct 16, 2025

Uh oh!

manirajv06 commented Oct 16, 2025

Uh oh!

github-actions bot commented Nov 16, 2025

Uh oh!

github-actions bot commented Nov 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Handle cases that definitely cannot match for notStartsWith() Predica… #13766

Handle cases that definitely cannot match for notStartsWith() Predica… #13766

Uh oh!

Conversation

manirajv06 commented Aug 8, 2025

Uh oh!

manirajv06 commented Aug 8, 2025

Uh oh!

nastra commented Aug 8, 2025

Uh oh!

manirajv06 commented Aug 8, 2025

Uh oh!

manirajv06 commented Aug 30, 2025

Uh oh!

smaheshwar-pltr left a comment

Choose a reason for hiding this comment

Uh oh!

manirajv06 commented Sep 15, 2025

Uh oh!

github-actions bot commented Oct 16, 2025

Uh oh!

manirajv06 commented Oct 16, 2025

Uh oh!

github-actions bot commented Nov 16, 2025

Uh oh!

github-actions bot commented Nov 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants