Use system time for scheduling. by oschaaf · Pull Request #549 · envoyproxy/nighthawk

oschaaf · 2020-09-22T21:08:29Z

Small refactor that makes the client rely on system time instead of monotonic time
for scheduling the start of worker executions. System clocks can be synchronized
across machines, and this may come in handy when we start facilitating horizontal
scaling.

Note: SequencerImpl gets modified to re-use the execution duration that the RateLimiter
it uses already tracks, in favour of its own tracking. This is a small clean up.

Apart from the actual switching from monotonic time to wall clock time, this should be a
mechanical change.

This change will make things easier if we would like to add an option to schedule the time at
which an execution will start.
This in turn could be useful when directing clients running on multiple machines to start, as a
means to have them start at approximately the same time.
(the approximation would mostly depend on how well the wall clock time is synchronised across
machines that are involved).

Signed-off-by: Otto van der Schaaf oschaaf@we-amp.com

A refactor that makes the client rely on system time instead of monotonic time for scheduling the start of an execution. Prelude to horizontal scalability: system clocks can be synchronized accross machines. Signed-off-by: Otto van der Schaaf <oschaaf@we-amp.com>

oschaaf · 2020-09-22T22:06:58Z

/retest

repokitteh-read-only · 2020-09-22T22:07:02Z

🔨 rebuilding ci/circleci: clang_tidy (failed build)

🐱

Caused by: a #549 (comment) was created by @oschaaf.

see: more, trace.

Signed-off-by: Otto van der Schaaf <oschaaf@we-amp.com>

…heduling-start Signed-off-by: Otto van der Schaaf <oschaaf@we-amp.com>

dubious90

Looks good. Just one question.

dubious90 · 2020-09-28T22:56:45Z

source/common/termination_predicate_impl.cc

 TerminationPredicate::Status DurationTerminationPredicateImpl::evaluate() {
-  return time_source_.monotonicTime() - start_ > duration_ ? TerminationPredicate::Status::TERMINATE
-                                                           : TerminationPredicate::Status::PROCEED;
+  return time_source_.systemTime() - start_ > duration_ ? TerminationPredicate::Status::TERMINATE


This change looks good. in fact, I think it's possible this is more accurate now. Is there any chance this represents a change in behavior that we should document in the description?

Well, SystemTime doesn't guarantee to always move forward across calls to get snapshots of it, like MonotonicTime does, and may it be adjusted while we are polling it. But there's only a very small window in which this can affect operation here: the duration that the main thread requests workers to wait before starting execution. That delay is computed here:

nighthawk/source/client/process_impl.cc

Line 167 in 6aa0331

const std::chrono::milliseconds kMinimalWorkerDelay = 500ms + (concurrency * 50ms);

Reasoning through clock updates that get applied right between our scheduling and starting of operations:

with small updates, load generation may start a little earlier or later, no problem. Any durations that get measured for latency or execution are based on monotonic time and will not be affected.

when the clock jumps forward a lot, worst case workers won't have sufficient time to get ready to start because the clock moved back in time significantly, but they will observe that and complain in the logs about it. (Execution results may be noisy because of workers having missed their schedules to start).

when the clock jumps backwards a lot, workers will wait longer before starting execution. This isn't a problem, unless it's a huge leap backwards in time, in which case the wait might take a long time as well.

Also, suspend/sleep might work a little differently, I suspect that MonotonicTime may not track time spend suspended/sleeping. 2. from above more or less applies here as well.

All in all, I think chances are pretty small of anyone running into trouble because of this?

oschaaf added 3 commits September 23, 2020 22:43

Simplify, fix edge case, test getting sequencer elapsed time.

5845df5

Signed-off-by: Otto van der Schaaf <oschaaf@we-amp.com>

Merge remote-tracking branch 'upstream/master' into systemtime-for-sc…

afc460a

…heduling-start Signed-off-by: Otto van der Schaaf <oschaaf@we-amp.com>

Merge remote-tracking branch 'upstream/master' into systemtime-for-sc…

bdc7289

…heduling-start Signed-off-by: Otto van der Schaaf <oschaaf@we-amp.com>

oschaaf added the waiting-for-review A PR waiting for a review. label Sep 25, 2020

oschaaf marked this pull request as ready for review September 25, 2020 22:31

dubious90 reviewed Sep 28, 2020

View reviewed changes

dubious90 approved these changes Sep 30, 2020

View reviewed changes

dubious90 merged commit 3102a8c into envoyproxy:master Sep 30, 2020

adisuissa mentioned this pull request Nov 9, 2020

Building with clang's libc++ fails #569

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use system time for scheduling.#549

Use system time for scheduling.#549
dubious90 merged 4 commits intoenvoyproxy:masterfrom
oschaaf:systemtime-for-scheduling-start

oschaaf commented Sep 22, 2020 •

edited

Loading

Uh oh!

oschaaf commented Sep 22, 2020

Uh oh!

repokitteh-read-only bot commented Sep 22, 2020

Uh oh!

dubious90 left a comment

Uh oh!

dubious90 Sep 28, 2020

Uh oh!

oschaaf Sep 29, 2020 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

oschaaf commented Sep 22, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

oschaaf commented Sep 22, 2020

Uh oh!

repokitteh-read-only bot commented Sep 22, 2020

Uh oh!

dubious90 left a comment

Choose a reason for hiding this comment

Uh oh!

dubious90 Sep 28, 2020

Choose a reason for hiding this comment

Uh oh!

oschaaf Sep 29, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

oschaaf commented Sep 22, 2020 •

edited

Loading

oschaaf Sep 29, 2020 •

edited

Loading