Skip to content

Conversation

@tdcmeehan
Copy link
Owner

Description

Motivation and Context

Impact

Test Plan

Contributor checklist

  • Please make sure your submission complies with our development, formatting, commit message, and attribution guidelines.
  • PR description addresses the issue accurately and concisely. If the change is non-trivial, a GitHub Issue is referenced.
  • Documented new properties (with its default value), SQL syntax, functions, or other functionality.
  • If release notes are required, they follow the release notes guidelines.
  • Adequate tests were added if applicable.
  • CI passed.

Release Notes

Please follow release notes guidelines and fill in the release notes below.

== RELEASE NOTES ==

General Changes
* ...
* ...

Hive Changes
* ...
* ...

If release note is NOT required, use:

== NO RELEASE NOTE ==

@github-actions
Copy link

github-actions bot commented Aug 4, 2023

Codenotify: Notifying subscribers in CODENOTIFY files for diff 0f69916...65adec2.

Notify File(s)
@steveburnett presto-docs/src/main/sphinx/overview.rst

@tdcmeehan tdcmeehan closed this Aug 8, 2023
tdcmeehan pushed a commit that referenced this pull request Nov 6, 2025
…b#26529)

Summary:
1. Fix an error of refresh MV after base table insertion
2. Use ExpressionUtil in MV refresh query composing
3. Fix an issue in MV caused by copying catalog properties during
session creation

For #3, we need a follow up to understand why the check of setting
transaction id and catalog properties during session constructor is
needed. I don't see a clear reason by checking the previous relavant
commits. I'll check with some additional people. If no clear objection,
I'll remove that check at least from the Session constructor in
following PRs, and restoring the copy of catalog properties in
buildOwnerSession().

Differential Revision: D86223888

## Release Notes

```
== NO RELEASE NOTE ==
```
tdcmeehan pushed a commit that referenced this pull request Feb 8, 2026
…ies and complex subqueries

Issue #1 (cyclic join graphs): RPT's LargestRoot algorithm identifies back edges that
create cycles. Materializing those joins as CTEs decomposes the cyclic graph into acyclic
sections, each of which RPT can fully reduce independently.

Issue #3 (complex subquery join inputs): Materializing aggregation/nested-join subqueries
as CTEs converts them to TableScanNodes on temporary tables. RPT can then push BFs into
these scans with column pruning. CteProjectionAndPredicatePushDown can further push
TupleDomain ranges back into CTE producers.

Both leverage existing CTE infrastructure: LogicalCteOptimizer, PhysicalCteOptimizer,
SequenceNode, CTEMaterializationTracker.

https://claude.ai/code/session_01SAXk4AS8yQyQkhRVi6RQie
tdcmeehan pushed a commit that referenced this pull request Feb 8, 2026
New issues:
- #13: Cost model assumes local NVMe, invalid for S3/HDFS (0.3s becomes 5-30s)
- prestodb#14: Parquet BF section is 115-line dead end, should be condensed
- prestodb#15: Duplicate section numbering (two "Section 6" headers)
- prestodb#16: Superlinear join efficiency claims lack citations
- prestodb#17: CTE materialization cost missing from quantitative cost model
- prestodb#18: Reduced hash table may still exceed VRAM — fallback composition unclear

Updated existing issues: #1,#3 remaining gaps noted, #2 now worse (more
occurrences), #7 partially addressed. 12 open issues total.

https://claude.ai/code/session_01SAXk4AS8yQyQkhRVi6RQie
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant