Skip to content

Improve CBO estimates for correlated columns#11324

Merged
sopel39 merged 5 commits intotrinodb:masterfrom
raunaqmorarka:cbo-correlation
Mar 11, 2022
Merged

Improve CBO estimates for correlated columns#11324
sopel39 merged 5 commits intotrinodb:masterfrom
raunaqmorarka:cbo-correlation

Conversation

@raunaqmorarka
Copy link
Copy Markdown
Member

Description

Overall goal of the PR is to work towards enabling optimizer.default-filter-factor-enabled by default.
If default-filter-factor is enabled with existing implementation, it improves q18 and q21 on tpch significantly.
However, it also results in regressions on certain benchmark queries (tpcds partitioned q64, tpcds unpartitioned q78).
These changes update the estimation logic of filters and joins to address the problems
with underestimation of filter conjunctions and overestimation of multi-clause joins observed
when default-filter-factor is enabled with existing implementation.

Is this change a fix, improvement, new feature, refactoring, or other?

Improvement

Is this a change to the core query engine, a connector, client library, or the SPI interfaces? (be specific)

Query optimizer

How would you describe this change to a non-technical end user or system administrator?

Improves CBO estimates in the presence of hard to estimate terms.

Related issues, pull requests, and links

Picks first (n-1) commits from #11066

Documentation

( ) No documentation is needed.
(x) Sufficient documentation is included in this PR.
( ) Documentation PR is available with #prnumber.
( ) Documentation issue #issuenumber is filed, and can be handled later.

Release notes

( ) No release notes entries required.
(x) Release notes entries required with the following suggested text:

# Section
* Improve CBO estimates in the presence of correlated columns.

Loading
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Development

Successfully merging this pull request may close these issues.

3 participants