Allocator startup can perform many raft writes #1286

aaronlehmann · 2016-08-01T17:30:58Z

In doNetworkInit, allocations of networks, nodes, and services aren't batched. Loading a swarm state with thousands of nodes appears to result in many raft writes from doNetworkInit's calls to allocateNode. This could block for a long time in a multi-manager setup where writes need to be acknowledged by a quorum of managers.

cc @mrjana

The text was updated successfully, but these errors were encountered:

When loading a state that contained large numbers of nodes and tasks, but no ready nodes that could accept the tasks, swarmd used large amounts of CPU repeatedly trying to schedule the full set of tasks. The allocator caused many commits on startup (see moby#1286), and this produced a large backlog of commit events, each one of which caused a full scheduling pass. To avoid this pathological behavior, debounce the commit events similarly to how the dispatcher's Tasks loop debounces events. When a commit event is received, that starts a 50 ms countdown to wait for another commit event before running the scheduling pass. If commit events keep being received and resetting this timer, the scheduler will run the scheduling pass anyway after a second. Signed-off-by: Aaron Lehmann <[email protected]>

When loading a state that contained large numbers of nodes and tasks, but no ready nodes that could accept the tasks, swarmd used large amounts of CPU repeatedly trying to schedule the full set of tasks. The allocator caused many commits on startup (see #1286), and this produced a large backlog of commit events, each one of which caused a full scheduling pass. To avoid this pathological behavior, debounce the commit events similarly to how the dispatcher's Tasks loop debounces events. When a commit event is received, that starts a 50 ms countdown to wait for another commit event before running the scheduling pass. If commit events keep being received and resetting this timer, the scheduler will run the scheduling pass anyway after a second. Signed-off-by: Aaron Lehmann <[email protected]> (cherry picked from commit 77c62db)

aaronlehmann mentioned this issue Aug 1, 2016

scheduler: Debounce commit events #1287

Merged

aaronlehmann mentioned this issue Oct 12, 2016

allocator: Batch service and node allocations #1630

Merged

mrjana closed this as completed in #1630 Oct 24, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allocator startup can perform many raft writes #1286

Allocator startup can perform many raft writes #1286

aaronlehmann commented Aug 1, 2016

Allocator startup can perform many raft writes #1286

Allocator startup can perform many raft writes #1286

Comments

aaronlehmann commented Aug 1, 2016