Skip to content

Avoid including page loading time in project expression profiler#13137

Closed
raunaqmorarka wants to merge 1 commit intotrinodb:masterfrom
raunaqmorarka:exp-pp
Closed

Avoid including page loading time in project expression profiler#13137
raunaqmorarka wants to merge 1 commit intotrinodb:masterfrom
raunaqmorarka:exp-pp

Conversation

@raunaqmorarka
Copy link
Copy Markdown
Member

Description

Is this change a fix, improvement, new feature, refactoring, or other?

Is this a change to the core query engine, a connector, client library, or the SPI interfaces? (be specific)

How would you describe this change to a non-technical end user or system administrator?

Related issues, pull requests, and links

Documentation

( ) No documentation is needed.
( ) Sufficient documentation is included in this PR.
( ) Documentation PR is available with #prnumber.
( ) Documentation issue #issuenumber is filed, and can be handled later.

Release notes

( ) No release notes entries required.
( ) Release notes entries required with the following suggested text:

# Section
* Fix some things. ({issue}`issuenumber`)

@cla-bot cla-bot bot added the cla-signed label Jul 10, 2022
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

where it will be accounted for?

Copy link
Copy Markdown
Member Author

@raunaqmorarka raunaqmorarka Sep 13, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm not yet sure that this needs to be accounted for anywhere.
My thought was that if ExpressionProfiler is about detecting time consuming expression evaluation, then the addition of page loading time, which includes orc/parquet read from filesystem and decoding, is probably not right.
But I wasn't certain about it.

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm not yet sure that this needs to be accounted for anywhere.

i would bet that it needs to

My thought was that if ExpressionProfiler is about detecting time consuming expression evaluation

sounds so.

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I did some benchmarking found no change to TPC and small improvement in scan operator benchmarks.
Screenshot 2022-09-15 at 12 32 31 AM
Screenshot 2022-09-15 at 12 32 17 AM

The job of Expression Profiler appears to be to reduce projection batch sizes for expensive expressions so that they don't hog CPU for too long. Any change to projection batch size due to this heuristic does not impact the size of reads and the pages produced by orc/parquet (those are based on different criteria like locality of reads and memory consumption). So it seems that Expression Profiler should ignore time taken to load page from the page source.

@raunaqmorarka
Copy link
Copy Markdown
Member Author

Moved to #14135 as a prep commit

@raunaqmorarka raunaqmorarka deleted the exp-pp branch September 15, 2022 05:34
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Development

Successfully merging this pull request may close these issues.

2 participants