constraintenforcer: Trigger task restarts when appropriate #1958

aaronlehmann · 2017-02-15T20:09:33Z

The constraint enforcer currently sets task desired state to "shutdown"
directly, which means the orchestrator will consider these tasks already
processed, and won't trigger restarts. While this is appropriate for
global services, replicated services should keep the desired number of
replicas running, so each task shut down by the constraint enforcer
should be restarted somewhere else.

This changes the constraint enforcer to trigger a task shutdown by
updating the actual state rather than desired state. This will cause the
orchestrator to restart the task when necessary. It's not a perfect
solution, because it bends rules about field ownership, and may cause
the replacement task to start before the old one stops. However, it's a
good compromise solution for this problem that doesn't require absorbing
the constraint enforcer into each orchestrator (which wouldn't fit the
model well), or adding a third type of state to every task.

Addresses moby/moby#31014

cc @dongluochen @aluzzardi

codecov-io · 2017-02-15T20:24:06Z

Codecov Report

Merging #1958 into master will increase coverage by 0.08%.
The diff coverage is 50%.

@@            Coverage Diff             @@
##           master    #1958      +/-   ##
==========================================
+ Coverage    54.5%   54.58%   +0.08%     
==========================================
  Files         108      108              
  Lines       18577    18586       +9     
==========================================
+ Hits        10125    10146      +21     
+ Misses       7217     7205      -12     
  Partials     1235     1235

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 9ef1e42...8c896af. Read the comment docs.

dongluochen · 2017-02-15T21:22:01Z

manager/orchestrator/constraintenforcer/constraint_enforcer.go

+					// will bypass actions such as
+					// restarting the task on another node
+					// (if applicable).
+					t.Status.State = api.TaskStateRejected


Change function name from shutdownNoncompliantTasks to rejectNoncompliantTasks?

aluzzardi · 2017-02-15T21:24:48Z

LGTM

The constraint enforcer currently sets task desired state to "shutdown" directly, which means the orchestrator will consider these tasks already processed, and won't trigger restarts. While this is appropriate for global services, replicated services should keep the desired number of replicas running, so each task shut down by the constraint enforcer should be restarted somewhere else. This changes the constraint enforcer to trigger a task shutdown by updating the actual state rather than desired state. This will cause the orchestrator to restart the task when necessary. It's not a perfect solution, because it bends rules about field ownership, and may cause the replacement task to start before the old one stops. However, it's a good compromise solution for this problem that doesn't require absorbing the constraint enforcer into each orchestrator (which wouldn't fit the model well), or adding a third type of state to every task. Also, update the global orchestrator to only restart a task when the node still meets the constraints. Signed-off-by: Aaron Lehmann <[email protected]>

dongluochen · 2017-02-15T21:51:19Z

LGTM

aaronlehmann added the process/cherry-pick label Feb 15, 2017

aaronlehmann added this to the 1.13.2 milestone Feb 15, 2017

aaronlehmann mentioned this pull request Feb 15, 2017

Swarm stops scheduling replicas after node constraint is met again moby/moby#31014

Open

dongluochen reviewed Feb 15, 2017

View reviewed changes

aaronlehmann force-pushed the constraint-enforcer-task-replacement branch from 8c896af to 69fcc19 Compare February 15, 2017 21:38

dongluochen merged commit 569defc into moby:master Feb 15, 2017

aaronlehmann deleted the constraint-enforcer-task-replacement branch February 15, 2017 22:19

aaronlehmann added process/cherry-picked and removed process/cherry-pick labels Feb 18, 2017

This was referenced Feb 18, 2017

[1.13] Vendor swarmkit 30a4278 moby/moby#31143

Merged

update CHANGELOG for 17.03.0-ce moby/moby#31205

Merged

aaronlehmann mentioned this pull request Mar 24, 2017

Tasks should be removed when a node doesn't matches service constraint #2053

Closed

le-ortega mentioned this pull request Mar 27, 2017

Wrong task scheduling in swarm moby/moby#31377

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

constraintenforcer: Trigger task restarts when appropriate #1958

constraintenforcer: Trigger task restarts when appropriate #1958

aaronlehmann commented Feb 15, 2017

codecov-io commented Feb 15, 2017

dongluochen Feb 15, 2017

aaronlehmann Feb 15, 2017

aluzzardi commented Feb 15, 2017

dongluochen commented Feb 15, 2017

constraintenforcer: Trigger task restarts when appropriate #1958

constraintenforcer: Trigger task restarts when appropriate #1958

Conversation

aaronlehmann commented Feb 15, 2017

codecov-io commented Feb 15, 2017

Codecov Report

dongluochen Feb 15, 2017

Choose a reason for hiding this comment

aaronlehmann Feb 15, 2017

Choose a reason for hiding this comment

aluzzardi commented Feb 15, 2017

dongluochen commented Feb 15, 2017