Support limiting the number of unacknowledged source splits per task by pettyjamesm · Pull Request #15761 · prestodb/presto

pettyjamesm · 2021-03-01T16:33:47Z

Adds support for tracking (and optionally limiting) the number of splits that are not yet acknowledged for a given task. Previously, the only two scheduler supported limits were the number of total splits across all tasks on the worker node and the number of splits queued for a given task which included:

splits received by the worker but not yet running as of the last task status update
splits queued in the coordinator local RemoteTask but not yet sent to the task
splits assigned in the current scheduling batch but not yet added to the coordinator-local RemoteTask

The new max_unacknowledged_splits_per_task session property enables setting a limit on the number of splits that are either:

queued on the coordinator-local RemoteTask but not yet sent or at least not confirmed to have been received by the worker
assigned to the task in the current scheduling batch but not yet added to the coordinator-local RemoteTask

This limit enforcement takes precedence over both of the other existing split limit configurations and is designed to prevent large task update requests that might cause a query to fail.

== RELEASE NOTES ==

General Changes
* Adds support for configuring the maximum number of unacknowledged source splits per task. This can be enabled by setting the ``max_unacknowledged_splits_per_task`` session property or ``node-scheduler.max-unacknowledged-splits-per-task`` configuration property.

shixuan-fan

"Refactor NodeAssignmentStats to simplify the tracked representation" LGTM

shixuan-fan · 2021-03-11T18:55:40Z

presto-main/src/main/java/com/facebook/presto/execution/scheduler/NodeAssignmentStats.java

It feels a bit weird to have getQueuedSplitCount return queuedSplitCount + assignedSplits, but I don't have a better name in mind.

Yeah, the structure here is a little quirky (but is fundamentally the same as before, just with 1 fewer HashMap). Any split assigned within the current batch is "effectively queued" even though it hasn't made its way to the task yet. I couldn't think of a name that worked any better at making that piece clear.

shixuan-fan

"Avoid unnecessary node list sorting in SimpleNodeSelector"

.../src/main/java/com/facebook/presto/execution/scheduler/nodeSelection/SimpleNodeSelector.java

shixuan-fan

"Support limiting the number of unacknowledged splits per task"

LGTM, mostly questions for my own understanding

presto-main/src/main/java/com/facebook/presto/execution/scheduler/NodeAssignmentStats.java

presto-main/src/main/java/com/facebook/presto/execution/scheduler/NodeScheduler.java

shixuan-fan · 2021-03-11T19:08:49Z

presto-main/src/main/java/com/facebook/presto/execution/scheduler/NodeSchedulerConfig.java

Just curious, what is the rationale behind using 500 as the default value? 🤔

In an experimental setup I was running with differently (very large) split queue depth configurations and small files (aka: cheap splits that are processed very quickly once delivered), 500 seemed to be about the point of diminishing returns in terms of setting this value higher.

In the original iteration, I had this default to Integer.MAX_VALUE since I don't aim to affect any of the default scheduling behavior with this change, but rather just want to add a safety net for deeper split queues blowing up task update sizes. Dain convinced me to pick a more "reasonable" value that was still above the ceiling of having any affect on the current default configs.

shixuan-fan · 2021-03-11T19:15:53Z

presto-main/src/main/java/com/facebook/presto/server/remotetask/HttpRemoteTask.java

It might be a dumb question but I'm curious to understand why we need to check this before listeners are registered?

It's possible for task initial splits to already exceed the pending split limit inside of the constructor, so setting the "no space" flag appropriately at initialization time had some effect but... now that I'm looking I can't see what the effect was. It might have had some relationship to the MockRemoteTask implementation in test and maybe isn't strictly required here.

Adds support for tracking (and optionally limiting) the number of splits that are not yet acknowledged for a given task. Previously, the only two scheduler supported limits were the number of total splits across all tasks on the worker node and the number of splits queued for a given task which included: - splits received by the worker but not yet running as of the last task status update - splits queued in the coordinator local RemoteTask but not yet sent to the task - splits assigned in the current scheduling run but not yet added to the coordinator local RemoteTask The new max_unacknowledged_splits_per_task session property enables setting a limit on the number of splits that are either: - queued on the coordinator-local RemoteTask but not yet sent or at least confirmed to have been received by the worker - assigned to the task in the current scheduling batch but not yet added to the coordinator-local RemoteTask This limit enforcement takes precedence over both of the other existing split limit configurations and is designed to prevent large task update requests that might cause a query to fail.

shixuan-fan

LGTM

shixuan-fan · 2021-03-12T17:20:20Z

cc @NikhilCollooru This might touch some of the codes that you are currently working on. I don't think there is conflict but want to give you a heads-up.

pettyjamesm mentioned this pull request Mar 1, 2021

Support limiting the number of unacknowledged splits per task trinodb/trino#7080

Merged

pettyjamesm force-pushed the limit-unacknowledged-task-splits branch 3 times, most recently from ad79842 to 5ec261c Compare March 8, 2021 22:38

pettyjamesm force-pushed the limit-unacknowledged-task-splits branch from 5ec261c to 88a5b11 Compare March 9, 2021 17:44

pettyjamesm requested review from NikhilCollooru, arhimondr, aweisberg and rschlussel March 9, 2021 18:54

shixuan-fan reviewed Mar 11, 2021

View reviewed changes

.../src/main/java/com/facebook/presto/execution/scheduler/nodeSelection/SimpleNodeSelector.java Outdated Show resolved Hide resolved

shixuan-fan reviewed Mar 11, 2021

View reviewed changes

pettyjamesm added 2 commits March 11, 2021 14:48

Refactor NodeAssignmentStats to simplify the tracked representation

b051434

Avoid unnecessary node list sorting in SimpleNodeSelector

6ba7829

jainxrohit self-requested a review March 11, 2021 19:54

pettyjamesm force-pushed the limit-unacknowledged-task-splits branch from 88a5b11 to 545d8b9 Compare March 11, 2021 19:55

pettyjamesm force-pushed the limit-unacknowledged-task-splits branch from 545d8b9 to 88049f2 Compare March 11, 2021 20:40

shixuan-fan approved these changes Mar 11, 2021

View reviewed changes

shixuan-fan merged commit 9a83668 into prestodb:master Mar 12, 2021

pettyjamesm deleted the limit-unacknowledged-task-splits branch March 12, 2021 20:11

varungajjala mentioned this pull request Mar 23, 2021

Add release notes for 0.250 #15865

Merged

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support limiting the number of unacknowledged source splits per task#15761

Support limiting the number of unacknowledged source splits per task#15761
shixuan-fan merged 3 commits intoprestodb:masterfrom
pettyjamesm:limit-unacknowledged-task-splits

pettyjamesm commented Mar 1, 2021 •

edited

Loading

Uh oh!

shixuan-fan left a comment

Uh oh!

shixuan-fan Mar 11, 2021

Uh oh!

pettyjamesm Mar 11, 2021

Uh oh!

shixuan-fan left a comment

Uh oh!

Uh oh!

shixuan-fan left a comment

Uh oh!

Uh oh!

Uh oh!

shixuan-fan Mar 11, 2021

Uh oh!

pettyjamesm Mar 11, 2021

Uh oh!

shixuan-fan Mar 11, 2021

Uh oh!

pettyjamesm Mar 11, 2021

Uh oh!

shixuan-fan left a comment

Uh oh!

shixuan-fan commented Mar 12, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

pettyjamesm commented Mar 1, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

shixuan-fan left a comment

Choose a reason for hiding this comment

Uh oh!

shixuan-fan Mar 11, 2021

Choose a reason for hiding this comment

Uh oh!

pettyjamesm Mar 11, 2021

Choose a reason for hiding this comment

Uh oh!

shixuan-fan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

shixuan-fan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

shixuan-fan Mar 11, 2021

Choose a reason for hiding this comment

Uh oh!

pettyjamesm Mar 11, 2021

Choose a reason for hiding this comment

Uh oh!

shixuan-fan Mar 11, 2021

Choose a reason for hiding this comment

Uh oh!

pettyjamesm Mar 11, 2021

Choose a reason for hiding this comment

Uh oh!

shixuan-fan left a comment

Choose a reason for hiding this comment

Uh oh!

shixuan-fan commented Mar 12, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

pettyjamesm commented Mar 1, 2021 •

edited

Loading