Feat cache residual evaluator #2695

ForeverAngry · 2025-11-04T02:19:39Z

Implement ResidualEvaluatorCache with LRU eviction and thread safety
Cache evaluators by partition spec, expression, case sensitivity, and schema

Rationale for this change

For queries, the same combinations of (partition spec, expression, case sensitivity, schema) are evaluated repeatedly, causing unnecessary computational overhead and increased query latency.

Are these changes tested?

Yes, using the existing test cases as they seemed sufficient for the changes.

Are there any user-facing changes?

No.

Fixes apache#2147 - Implement ResidualEvaluatorCache with LRU eviction and thread safety - Cache evaluators by partition spec, expression, case sensitivity, and schema - Fix mypy type annotations and add type ignore for cachetools decorator

…nting

Fokko · 2025-11-14T23:53:08Z

pyiceberg/expressions/visitors.py

+    cached = _residual_evaluator_cache.get(spec, expr, case_sensitive, schema)
+    if cached is not None:
+        return cached


Why not use the build in LRU cache? https://docs.python.org/3/library/functools.html#functools.lru_cache

Hi @Fokko 👋 ! We certainly could try to simplify this — my main motivation for the current approach was to build something that felt more extensible, more readable (so contributors don’t have to dig too deep to understand what’s going on), and that gives users a bit more control.

That said, when I considered switching to functools.lru_cache for residual evaluator creation, it looked like it would actually require more work to avoid a custom cache. Specifically, making the cache keys hashable while still passing the spec, expr, and schema objects — which aren’t inherently hashable — seemed tricky. I would have needed helper functions to produce stable key representations, and potentially to check the hashability of PartitionSpec, just to ensure caching didn’t break evaluator construction.

I’m definitely open to simplifying this if you think that’s the right direction. Let me know your thoughts!

could we do something similar to this? https://github.com/apache/iceberg-python/blob/main/pyiceberg/manifest.py#L877 and use a small helper function to produce the key?

@jayceslesar thanks for the the review!! Glad your back in the saddle! @Fokko what do you think?

Yes, that's more or less what I suggested as well. I don't think we should implement this ourselves.

@Fokko are we good to merge this then?

ForeverAngry added 4 commits November 7, 2025 09:41

feat: Cache ResidualEvaluator

7f525e7

Fixes apache#2147 - Implement ResidualEvaluatorCache with LRU eviction and thread safety - Cache evaluators by partition spec, expression, case sensitivity, and schema - Fix mypy type annotations and add type ignore for cachetools decorator

fix: Update type ignore for cache_clear in test_manifest.py

7ead87b

fix: Remove type ignore for cache_clear in test_manifest.py to fix li…

bbb504b

…nting

chore: Remove leftover comment in visitors.py

0ec381e

ForeverAngry force-pushed the feat-cache-residual-evaluator branch from 55e53d0 to 0ec381e Compare November 7, 2025 14:41

fix: Use modern type hint syntax (PEP 604)

c0dfdaf

Fokko reviewed Nov 14, 2025

View reviewed changes

ForeverAngry requested a review from Fokko November 23, 2025 16:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feat cache residual evaluator #2695

Feat cache residual evaluator #2695

Uh oh!

ForeverAngry commented Nov 4, 2025

Uh oh!

Fokko Nov 14, 2025

Uh oh!

ForeverAngry Nov 15, 2025 •

edited

Loading

Uh oh!

jayceslesar Dec 3, 2025 •

edited

Loading

Uh oh!

ForeverAngry Dec 5, 2025

Uh oh!

Fokko Dec 8, 2025

Uh oh!

ForeverAngry Dec 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Feat cache residual evaluator #2695

Are you sure you want to change the base?

Feat cache residual evaluator #2695

Uh oh!

Conversation

ForeverAngry commented Nov 4, 2025

Rationale for this change

Are these changes tested?

Are there any user-facing changes?

Uh oh!

Fokko Nov 14, 2025

Choose a reason for hiding this comment

Uh oh!

ForeverAngry Nov 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jayceslesar Dec 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ForeverAngry Dec 5, 2025

Choose a reason for hiding this comment

Uh oh!

Fokko Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

ForeverAngry Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ForeverAngry Nov 15, 2025 •

edited

Loading

jayceslesar Dec 3, 2025 •

edited

Loading