Bug 1765756: UPSTREAM: 80004: Prefer to delete doubled-up pods of a ReplicaSet by Miciah · Pull Request #24057 · openshift/origin

Miciah · 2019-10-30T17:57:41Z

This commit brings back #23806, which was dropped during a rebase scuffle (see #24014 (comment)), and adds a fix for BZ#1765756.

UPSTREAM: 80004: Prefer to delete doubled-up pods of a ReplicaSet

When scaling down a ReplicaSet, delete doubled up replicas first, where a "doubled up replica" is defined as one that is on the same node as an active replica belonging to a related ReplicaSet. ReplicaSets are considered "related" if they have a common controller (typically a Deployment).

The intention of this change is to make a rolling update of a Deployment scale down the old ReplicaSet as it scales up the new ReplicaSet by deleting pods from the old ReplicaSet that are colocated with ready pods of the new ReplicaSet. This change in the behavior of rolling updates can be combined with pod affinity rules to preserve the locality of a Deployment's pods over rollout.

A specific scenario that benefits from this change is when a Deployment's pods are exposed by a Service that has type "LoadBalancer" and external traffic policy "Local". In this scenario, the load balancer uses health checks to determine whether it should forward traffic for the Service to a particular node. If the node has no local endpoints for the Service, the health check will fail for that node. Eventually, the load balancer will stop forwarding traffic to that node. In the meantime, the service proxy drops traffic for that Service. Thus, in order to reduce risk of dropping traffic during a rolling update, it is desirable preserve node locality of endpoints.

vendor/k8s.io/kubernetes/pkg/controller/controller_utils.go (ActivePodsWithRanks): New type to sort pods using a given ranking.
vendor/k8s.io/kubernetes/pkg/controller/controller_utils_test.go (TestSortingActivePodsWithRanks): New test for ActivePodsWithRanks.
vendor/k8s.io/kubernetes/pkg/controller/replicaset/replica_set.go
(getReplicaSetsWithSameController): New method. Given a ReplicaSet, return all ReplicaSets that have the same owner.
(manageReplicas): Call getIndirectlyRelatedPods, and pass its result to getPodsToDelete.
(getIndirectlyRelatedPods): New method. Given a ReplicaSet, return all pods that are owned by any ReplicaSet with the same owner.
(getPodsToDelete): Add an argument for related pods. Use related pods and the new getPodsRankedByRelatedPodsOnSameNode function to take into account whether the pod is doubled up when sorting pods for deletion.
(getPodsRankedByRelatedPodsOnSameNode): New function. Return an ActivePodsWithRanks value that wraps the given slice of pods and computes ranks where each pod's rank is equal to the number of active related pods that are colocated on the same node.
vendor/k8s.io/kubernetes/pkg/controller/replicaset/replica_set_test.go (newReplicaSet): Set OwnerReferences on the ReplicaSet.
(newPod): Set a unique UID on the pod.
(byName): New type to sort pods by name.
(TestRelatedPodsLookup): New test for getIndirectlyRelatedPods.
(TestGetPodsToDelete): Augment the "various pod phases and conditions, diff = len(pods)" test case to ensure that scale-down still selects doubled-up pods if there are not enough other pods to scale down. Add a "various pod phases and conditions, diff = len(pods), relatedPods empty" test case to verify that getPodsToDelete works even if related pods could not be determined. Add a "ready and colocated with another ready pod vs not colocated, diff < len(pods)" test case to verify that a doubled-up pod gets preferred for deletion. Augment the "various pod phases and conditions, diff < len(pods)" test case to ensure that not-ready pods are preferred over ready but doubled-up pods.
vendor/k8s.io/kubernetes/pkg/controller/replicaset/BUILD: Regenerate.
vendor/k8s.io/kubernetes/test/e2e/apps/deployment.go (testRollingUpdateDeploymentWithLocalTrafficLoadBalancer): New end-to-end test. Create a deployment with a rolling update strategy and affinity rules and a load balancer with "Local" external traffic policy, and verify that set of nodes with local endponts for the service remains unchanged during rollouts.
(setAffinity): New helper, used by testRollingUpdateDeploymentWithLocalTrafficLoadBalancer.
vendor/k8s.io/kubernetes/test/e2e/apps/types.go (AgnhostImageName, AgnhostImage): New constants for the agnhost image.
vendor/k8s.io/kubernetes/test/e2e/framework/service/jig.go (GetEndpointNodes): Factor building the set of node names out...
(GetEndpointNodeNames): ...into this new method.

UPSTREAM: 84339: Fix deployment e2e test at scale

UPSTREAM: 84568: test/e2e/apps: Skip or scale LB test per node count

Skip the "Deployment should not disrupt a cloud load-balancer's connectivity during rollout" test if the number of nodes is less than 2; otherwise, set the deployment's replicas equal to the lesser of 5 and the number of nodes.

The test would fail if there were fewer nodes than replicas, but the test needs at least 2 nodes, and the likelihood of failure absent the feature under test increases with the number of replicas, so it is desirable to set replicas to a higher value, within reason.

vendor/k8s.io/kubernetes/test/e2e/apps/deployment.go: Skip the load-balancer connectivity test unless there are at least 2 nodes.
(testRollingUpdateDeploymentWithLocalTrafficLoadBalancer): Set replicas to the min of 5 and the number of nodes.

This PR supersedes #24047.

When scaling down a ReplicaSet, delete doubled up replicas first, where a "doubled up replica" is defined as one that is on the same node as an active replica belonging to a related ReplicaSet. ReplicaSets are considered "related" if they have a common controller (typically a Deployment). The intention of this change is to make a rolling update of a Deployment scale down the old ReplicaSet as it scales up the new ReplicaSet by deleting pods from the old ReplicaSet that are colocated with ready pods of the new ReplicaSet. This change in the behavior of rolling updates can be combined with pod affinity rules to preserve the locality of a Deployment's pods over rollout. A specific scenario that benefits from this change is when a Deployment's pods are exposed by a Service that has type "LoadBalancer" and external traffic policy "Local". In this scenario, the load balancer uses health checks to determine whether it should forward traffic for the Service to a particular node. If the node has no local endpoints for the Service, the health check will fail for that node. Eventually, the load balancer will stop forwarding traffic to that node. In the meantime, the service proxy drops traffic for that Service. Thus, in order to reduce risk of dropping traffic during a rolling update, it is desirable preserve node locality of endpoints. * vendor/k8s.io/kubernetes/pkg/controller/controller_utils.go (ActivePodsWithRanks): New type to sort pods using a given ranking. * vendor/k8s.io/kubernetes/pkg/controller/controller_utils_test.go (TestSortingActivePodsWithRanks): New test for ActivePodsWithRanks. * vendor/k8s.io/kubernetes/pkg/controller/replicaset/replica_set.go (getReplicaSetsWithSameController): New method. Given a ReplicaSet, return all ReplicaSets that have the same owner. (manageReplicas): Call getIndirectlyRelatedPods, and pass its result to getPodsToDelete. (getIndirectlyRelatedPods): New method. Given a ReplicaSet, return all pods that are owned by any ReplicaSet with the same owner. (getPodsToDelete): Add an argument for related pods. Use related pods and the new getPodsRankedByRelatedPodsOnSameNode function to take into account whether the pod is doubled up when sorting pods for deletion. (getPodsRankedByRelatedPodsOnSameNode): New function. Return an ActivePodsWithRanks value that wraps the given slice of pods and computes ranks where each pod's rank is equal to the number of active related pods that are colocated on the same node. * vendor/k8s.io/kubernetes/pkg/controller/replicaset/replica_set_test.go (newReplicaSet): Set OwnerReferences on the ReplicaSet. (newPod): Set a unique UID on the pod. (byName): New type to sort pods by name. (TestRelatedPodsLookup): New test for getIndirectlyRelatedPods. (TestGetPodsToDelete): Augment the "various pod phases and conditions, diff = len(pods)" test case to ensure that scale-down still selects doubled-up pods if there are not enough other pods to scale down. Add a "various pod phases and conditions, diff = len(pods), relatedPods empty" test case to verify that getPodsToDelete works even if related pods could not be determined. Add a "ready and colocated with another ready pod vs not colocated, diff < len(pods)" test case to verify that a doubled-up pod gets preferred for deletion. Augment the "various pod phases and conditions, diff < len(pods)" test case to ensure that not-ready pods are preferred over ready but doubled-up pods. * vendor/k8s.io/kubernetes/pkg/controller/replicaset/BUILD: Regenerate. * vendor/k8s.io/kubernetes/test/e2e/apps/deployment.go (testRollingUpdateDeploymentWithLocalTrafficLoadBalancer): New end-to-end test. Create a deployment with a rolling update strategy and affinity rules and a load balancer with "Local" external traffic policy, and verify that set of nodes with local endponts for the service remains unchanged during rollouts. (setAffinity): New helper, used by testRollingUpdateDeploymentWithLocalTrafficLoadBalancer. * vendor/k8s.io/kubernetes/test/e2e/apps/types.go (AgnhostImageName) (AgnhostImage): New constants for the agnhost image. * vendor/k8s.io/kubernetes/test/e2e/framework/service/jig.go (GetEndpointNodes): Factor building the set of node names out... (GetEndpointNodeNames): ...into this new method.

Skip the "Deployment should not disrupt a cloud load-balancer's connectivity during rollout" test if the number of nodes is less than 2; otherwise, set the deployment's replicas equal to the lesser of 5 and the number of nodes. The test would fail if there were fewer nodes than replicas, but the test needs at least 2 nodes, and the likelihood of failure absent the feature under test increases with the number of replicas, so it is desirable to set replicas to a higher value, within reason. Follow-up to commit 980b640. * vendor/k8s.io/kubernetes/test/e2e/apps/deployment.go: Skip the load-balancer connectivity test unless there are at least 2 nodes. (testRollingUpdateDeploymentWithLocalTrafficLoadBalancer): Set replicas to the min of 5 and the number of nodes.

knobunc · 2019-11-01T20:32:05Z

/lgtm

openshift-ci-robot · 2019-11-01T20:34:07Z

@Miciah: This pull request references Bugzilla bug 1765756, which is valid. The bug has been moved to the POST state. The bug has been updated to refer to the pull request using the external bug tracker.

Details

In response to this:

Bug 1765756: UPSTREAM: 80004: Prefer to delete doubled-up pods of a ReplicaSet

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

openshift-ci-robot · 2019-11-01T20:35:29Z

@Miciah: This pull request references Bugzilla bug 1765756, which is valid.

Details

In response to this:

Bug 1765756: UPSTREAM: 80004: Prefer to delete doubled-up pods of a ReplicaSet

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Miciah · 2019-11-01T20:54:55Z

I wonder whether I can do this:
/test e2e-aws-scaleup-rhel7-4.3

Miciah · 2019-11-01T20:56:57Z

/test e2e-aws-scaleup-rhel7

soltysh

/lgtm
/approve

openshift-ci-robot · 2019-11-05T20:19:00Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: knobunc, Miciah, soltysh

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details

Needs approval from an approver in each of these files:

~~OWNERS~~ [knobunc,soltysh]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

openshift-bot · 2019-11-07T03:19:03Z

/retest

Please review the full test history for this PR and help us cut down flakes.

openshift-ci-robot · 2019-11-07T07:10:19Z

@Miciah: All pull requests linked via external trackers have merged. Bugzilla bug 1765756 has been moved to the MODIFIED state.

Details

In response to this:

Bug 1765756: UPSTREAM: 80004: Prefer to delete doubled-up pods of a ReplicaSet

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Miciah and others added 3 commits October 30, 2019 12:28

UPSTREAM: 84339: Fix deployment e2e test at scale

6a89b23

openshift-ci-robot added the size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. label Oct 30, 2019

openshift-ci-robot requested review from soltysh and timlnx October 30, 2019 17:58

Miciah mentioned this pull request Oct 30, 2019

UPSTREAM: 80004: Prefer to delete doubled-up pods of a ReplicaSet #24047

Closed

Miciah changed the title ~~UPSTREAM: 80004: Prefer to delete doubled-up pods of a ReplicaSet 80004 prefer to delete doubled up pods of a replicaset 2~~ UPSTREAM: 80004: Prefer to delete doubled-up pods of a ReplicaSet Oct 30, 2019

Miciah mentioned this pull request Oct 30, 2019

UPSTREAM: 80004: Prefer to delete doubled-up pods of a ReplicaSet #23806

Merged

openshift-ci-robot assigned knobunc Nov 1, 2019

openshift-ci-robot added the lgtm Indicates that a PR is ready to be merged. label Nov 1, 2019

Miciah changed the title ~~UPSTREAM: 80004: Prefer to delete doubled-up pods of a ReplicaSet~~ Bug 1765756: UPSTREAM: 80004: Prefer to delete doubled-up pods of a ReplicaSet Nov 1, 2019

openshift-ci-robot added the bugzilla/valid-bug Indicates that a referenced Bugzilla bug is valid for the branch this PR is targeting. label Nov 1, 2019

soltysh approved these changes Nov 5, 2019

View reviewed changes

openshift-ci-robot assigned soltysh Nov 5, 2019

openshift-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Nov 5, 2019

openshift-merge-robot merged commit 193c7d7 into openshift:master Nov 7, 2019

Miciah mentioned this pull request Nov 14, 2019

Bug 1709958: test/e2e/upgrade: Add ingress controller test #22852

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug 1765756: UPSTREAM: 80004: Prefer to delete doubled-up pods of a ReplicaSet#24057

Bug 1765756: UPSTREAM: 80004: Prefer to delete doubled-up pods of a ReplicaSet#24057
openshift-merge-robot merged 3 commits intoopenshift:masterfrom
Miciah:UPSTREAM-80004-prefer-to-delete-doubled-up-pods-of-a-replicaset-2

Miciah commented Oct 30, 2019 •

edited

Loading

Uh oh!

knobunc commented Nov 1, 2019

Uh oh!

openshift-ci-robot commented Nov 1, 2019

Uh oh!

openshift-ci-robot commented Nov 1, 2019

Uh oh!

Miciah commented Nov 1, 2019

Uh oh!

Miciah commented Nov 1, 2019

Uh oh!

soltysh left a comment

Uh oh!

openshift-ci-robot commented Nov 5, 2019

Uh oh!

openshift-bot commented Nov 7, 2019

Uh oh!

openshift-ci-robot commented Nov 7, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Conversation

Miciah commented Oct 30, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

UPSTREAM: 80004: Prefer to delete doubled-up pods of a ReplicaSet

UPSTREAM: 84339: Fix deployment e2e test at scale

UPSTREAM: 84568: test/e2e/apps: Skip or scale LB test per node count

Uh oh!

knobunc commented Nov 1, 2019

Uh oh!

openshift-ci-robot commented Nov 1, 2019

Uh oh!

openshift-ci-robot commented Nov 1, 2019

Uh oh!

Miciah commented Nov 1, 2019

Uh oh!

Miciah commented Nov 1, 2019

Uh oh!

soltysh left a comment

Choose a reason for hiding this comment

Uh oh!

openshift-ci-robot commented Nov 5, 2019

Uh oh!

openshift-bot commented Nov 7, 2019

Uh oh!

openshift-ci-robot commented Nov 7, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Miciah commented Oct 30, 2019 •

edited

Loading