Add adaptive execution policy by tdcmeehan · Pull Request #17378 · prestodb/presto

tdcmeehan · 2022-03-01T03:55:29Z

Add a simple adaptive execution policy that delegates to all-at-once scheduling for low stage queries, and phased scheduling for high stage queries. The aim is to keep lower stage queries at a reasonable latency, while making high stage queries more reliable at the potential cost of higher latency--reducing the maximum concurrent running task count can improve reliability in heavily loaded clusters that run very high-stage queries, as may be generated by BI tools, ETL jobs, or complex analytical queries.

Test plan - Ran this configuration with a workload that skewed toward high stage queries and observed improvement in running task count. While this test may not always reflect the real world, in reality reliability issues are typically seen when a burst of high stage queries are executed in tandem in a cluster. In other words, this change should keep typical task counts constant, but improve (lower) the running task count in the worst case where reliability issues are encountered.

== RELEASE NOTES ==

General Changes


* Add an adaptive stage scheduling policy that switches to phased execution mode once a query's stage count exceeds a configurable upper bound. This can be enabled by setting the session property ``execution_policy`` to ``phased`` and the stage count limit can be configured by the session property ``max_stage_count_for_eager_scheduling``.

...ain/src/main/java/com/facebook/presto/execution/scheduler/AdaptivePhasedExecutionPolicy.java

mayankgarg1990 · 2022-03-03T18:58:58Z

This looks good to me . While query latency is an acceptable impact to protect the cluster, do we expect some sort of memory impact as well ? I am not completely familiar with how phased execution is implemented, so I maybe wrong here - but since some stages might not be scheduled, we might have data staying in memory for longer and that can increase the memory-second value of a query?

mayankgarg1990 · 2022-03-03T19:13:32Z

Add an adaptive stage scheduling policy that switches to phased execution mode once a query's stage count exceeds a configurable upper bound. This can be enabled by setting the session property ``execution_policy`` to ``phased`` and the stage count limit can be configured by the session property ``max_stage_count_for_eager_scheduling``.

swapsmagic · 2022-03-03T19:14:40Z

presto-main/src/main/java/com/facebook/presto/SystemSessionProperties.java

It make more sense to call this config Adaptive scheduling over eager scheduling. wdyt?

Hmm, I'm putting "max for eager scheduling", meaning the maximum level before which we use eager scheduling, and after which we use phased scheduling.

swapsmagic · 2022-03-03T19:15:32Z

...ain/src/main/java/com/facebook/presto/execution/scheduler/AdaptivePhasedExecutionPolicy.java

Can we add some unit test to verify this works as expected?

tdcmeehan · 2022-03-03T19:41:55Z

This looks good to me . While query latency is an acceptable impact to protect the cluster, do we expect some sort of memory impact as well ? I am not completely familiar with how phased execution is implemented, so I maybe wrong here - but since some stages might not be scheduled, we might have data staying in memory for longer and that can increase the memory-second value of a query?

I actually think this will likely improve memory, since we won't create tasks for the entire plan for large stage queries, and task-related overhead dominates memory usage in heavily loaded clusters.

...ain/src/main/java/com/facebook/presto/execution/scheduler/AdaptivePhasedExecutionPolicy.java

tdcmeehan force-pushed the sch branch 6 times, most recently from 24282eb to 032e093 Compare March 1, 2022 15:57

ajaygeorge reviewed Mar 2, 2022

View reviewed changes

...ain/src/main/java/com/facebook/presto/execution/scheduler/AdaptivePhasedExecutionPolicy.java Outdated Show resolved Hide resolved

tdcmeehan requested review from a team and ajaygeorge March 3, 2022 14:05

tdcmeehan changed the title ~~[WIP] Add adaptive execution policy~~ Add adaptive execution policy Mar 3, 2022

ajaygeorge approved these changes Mar 3, 2022

View reviewed changes

swapsmagic reviewed Mar 3, 2022

View reviewed changes

tdcmeehan requested a review from rschlussel March 3, 2022 20:05

Add adaptive execution policy

6d0a647

tdcmeehan force-pushed the sch branch from 032e093 to 6d0a647 Compare March 3, 2022 22:07

swapsmagic approved these changes Mar 3, 2022

View reviewed changes

tdcmeehan requested a review from a team March 4, 2022 19:57

rschlussel approved these changes Mar 4, 2022

View reviewed changes

...ain/src/main/java/com/facebook/presto/execution/scheduler/AdaptivePhasedExecutionPolicy.java Outdated Show resolved Hide resolved

tdcmeehan merged commit 5eae5e5 into prestodb:master Mar 5, 2022

varungajjala mentioned this pull request Mar 22, 2022

Add release notes for 0.272 #17499

Closed

9 tasks

asjadsyed mentioned this pull request Mar 23, 2022

Add release notes for 0.272 #17510

Closed

9 tasks

asjadsyed mentioned this pull request Apr 1, 2022

Add release notes for 0.272 #17564

Merged

8 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add adaptive execution policy#17378

Add adaptive execution policy#17378
tdcmeehan merged 1 commit intoprestodb:masterfrom
tdcmeehan:sch

tdcmeehan commented Mar 1, 2022 •

edited

Loading

Uh oh!

Uh oh!

mayankgarg1990 commented Mar 3, 2022

Uh oh!

mayankgarg1990 commented Mar 3, 2022

Uh oh!

swapsmagic Mar 3, 2022

Uh oh!

tdcmeehan Mar 3, 2022

Uh oh!

swapsmagic Mar 3, 2022

Uh oh!

tdcmeehan commented Mar 3, 2022

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

tdcmeehan commented Mar 1, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

mayankgarg1990 commented Mar 3, 2022

Uh oh!

mayankgarg1990 commented Mar 3, 2022

Uh oh!

swapsmagic Mar 3, 2022

Choose a reason for hiding this comment

Uh oh!

tdcmeehan Mar 3, 2022

Choose a reason for hiding this comment

Uh oh!

swapsmagic Mar 3, 2022

Choose a reason for hiding this comment

Uh oh!

tdcmeehan commented Mar 3, 2022

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

tdcmeehan commented Mar 1, 2022 •

edited

Loading