[native] Randomize retry time with jitter after worker announcement failure. by amitkdutta · Pull Request #19906 · prestodb/presto

amitkdutta · 2023-06-17T01:43:22Z

This PR randomizes announcement scheduled time after announcement fails. Announcement failure can happen when coordinator is not available. For example, if coordinator goes to a old gc cycle, it can be non responsive for certain amount of time. After its back, all workers tries to register at the same time making its recover harder. Some randomization with jitter helps the coordinator to come back to a healthy status quickly as the workers will register with a variable speed with the upper bound of regular announcement time. Note that, if there is no failure, announcement scheduling time is unchanged.

Test plan

Tested when coordinator and worker are both healthy. Observed announcement timing does not change.
Tested with dead coordinator, observed announcement time varies. Also recovered the coordinator and observed announcement come back to previous timing.

== NO RELEASE NOTE ==

facebook-github-bot · 2023-06-17T16:49:46Z

@amitkdutta has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

pranjalssh · 2023-06-18T15:55:38Z

presto-native-execution/presto_cpp/main/PrestoServer.cpp

static constexpr

pranjalssh · 2023-06-18T16:22:00Z

presto-native-execution/presto_cpp/main/Announcer.cpp

There should also be a jitter If announcement is always successful. In that case, we can do something like timeToSleepMs + rand(2000) - 1000

In high level, when a worker makes a successful announcement, it should not schedule the next announcement very quickly, but after some specific period of time which should ideally include some jitter (as you suggested). However, when the announcement fails, worker should try the next attempt quickly with exponential back-off. And also, this wait can't be infinitely long, so capped by max delay (which is today's hard-coded frequency ms). Since worker does not know when coordinator will come back, it will need to talk indefinitely. The issue is indefinite talking is - if there are too many failed attempts then all workers will get synchronized towards the frequencyMsMax (somewhat today's behavior). On the other side, if frequencyMsMax is too big, then if coordinator comes back after a number of failed attempts, worker registration will be late due to exponential back off. That will hurt routing and caching etc.

Hence a reasonable trade off is:

Add some jitter in success

Back-off the retry exponentially where initial failed attempts try to talk quickly hopoing coordinator is back soon and registration can be done quickly (this is improvement from today's hard coded timing, because today worker need to wait a lot regardless of the failed attempts)

We need to keep the announcement indefinitely (for both success and failure) to keep giving heartbead to coordiantor.

I think it might be good to add a small amount of jitter as @pranjalssh suggested (not exponentially backoff as the failure case does but a simple a small random value in tens of milliseconds with cap to the max or on top of max) for successful case to avoid any possible synchronized announcements from all the workers especially in batch cluster with a lot of workers. thanks!

pranjalssh · 2023-06-18T16:26:13Z

presto-native-execution/presto_cpp/main/Announcer.cpp

Retrying infinitely doesn't look clean. We can cap number of failedAttempts - so there is only one codepath which does the inifinite retries.

We can also skip jitter here and use the manually added jitter in below comment. I'll leave this to your judgment

xiaoxmeng

@amitkdutta thanks for the back-off retry improvement % some minors.

xiaoxmeng · 2023-06-18T19:40:07Z

presto-native-execution/presto_cpp/main/PrestoServer.cpp

Do we want to make this a pair of node level configs? Also name constexpr variable with prefix k -> s/frequencyMsMin/kFrequencyMsMin/

xiaoxmeng · 2023-06-18T19:40:23Z

presto-native-execution/presto_cpp/main/PrestoServer.cpp

s/// In ms/// 35s/

xiaoxmeng · 2023-06-18T19:40:47Z

presto-native-execution/presto_cpp/main/PrestoServer.cpp

nit: s/// In ms/// 100 ms/

xiaoxmeng · 2023-06-18T19:41:23Z

presto-native-execution/presto_cpp/main/PrestoServer.cpp

nit: minFrequenceMs, maxFrequenceMs

xiaoxmeng · 2023-06-18T19:41:53Z

presto-native-execution/presto_cpp/main/Announcer.h

Can you comment the pair of parameters in ctor?

xiaoxmeng · 2023-06-18T19:42:21Z

presto-native-execution/presto_cpp/main/Announcer.h

NYC: could rearrange the members by putting the const members first? Thanks!

Can you comment on jitterParam_?
s/jitterParam/jitterParam_/

xiaoxmeng · 2023-06-18T19:46:41Z

presto-native-execution/presto_cpp/main/Announcer.cpp

How about we put this logic into a function

Announcer::nextScheduleDelay() { if (failedAttempts_ == 0) { return ... } ... }

xiaoxmeng · 2023-06-18T19:48:59Z

presto-native-execution/presto_cpp/main/Announcer.cpp

I think it might be good to add a small amount of jitter as @pranjalssh suggested (not exponentially backoff as the failure case does but a simple a small random value in tens of milliseconds with cap to the max or on top of max) for successful case to avoid any possible synchronized announcements from all the workers especially in batch cluster with a lot of workers. thanks!

aditi-pandit · 2023-06-20T21:48:49Z

presto-native-execution/presto_cpp/main/PrestoServer.cpp

Nit : Rename variables kMinFrequencyMs

aditi-pandit · 2023-06-20T21:49:24Z

presto-native-execution/presto_cpp/main/Announcer.h

Nit : Rename min(max)FrequencyMs_

xiaoxmeng

@amitkdutta thanks for the update!

xiaoxmeng · 2023-06-21T03:36:06Z

presto-native-execution/presto_cpp/main/Announcer.cpp

coordinator with max back off time cap at 'maxFrequencyMs_'.

xiaoxmeng · 2023-06-21T03:38:51Z

presto-native-execution/presto_cpp/main/Announcer.cpp

nit: s/Add/Adds/

…ailure.

facebook-github-bot · 2023-06-21T18:18:58Z

@amitkdutta has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

amitkdutta requested a review from a team as a code owner June 17, 2023 01:43

amitkdutta requested a review from pranjalssh June 17, 2023 01:43

amitkdutta force-pushed the announcement_jitter branch 3 times, most recently from 7565433 to 7e87029 Compare June 17, 2023 02:25

pranjalssh reviewed Jun 18, 2023

View reviewed changes

xiaoxmeng reviewed Jun 18, 2023

View reviewed changes

aditi-pandit reviewed Jun 20, 2023

View reviewed changes

amitkdutta force-pushed the announcement_jitter branch from 7e87029 to 0543476 Compare June 21, 2023 01:05

xiaoxmeng approved these changes Jun 21, 2023

View reviewed changes

[native] Randomize retry time with jitter after worker announcement f…

00506f1

…ailure.

amitkdutta force-pushed the announcement_jitter branch from 0543476 to 00506f1 Compare June 21, 2023 06:46

amitkdutta merged commit b0cde2e into prestodb:master Jun 21, 2023

mbasmanova mentioned this pull request Jun 22, 2023

[native] Enable [-Werror,-Wreorder-ctor] compiler flags #19940

Closed

amitkdutta mentioned this pull request Jun 29, 2023

[native] Random delay in announcement. #20024

Merged

Conversation

amitkdutta commented Jun 17, 2023

Uh oh!

facebook-github-bot commented Jun 17, 2023

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

xiaoxmeng Jun 18, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

xiaoxmeng left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

xiaoxmeng Jun 18, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

xiaoxmeng left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Jun 21, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

xiaoxmeng Jun 18, 2023 •

edited

Loading

xiaoxmeng Jun 18, 2023 •

edited

Loading