Skip to content

Bug 1984683: use new default leader election values to handle apiserver rollout on SNO#328

Merged
openshift-ci[bot] merged 1 commit intoopenshift:masterfrom
creydr:use-new-leader-election-values-to-handle-60-sec-apiserver-rollout-in-sno
Aug 4, 2021
Merged

Bug 1984683: use new default leader election values to handle apiserver rollout on SNO#328
openshift-ci[bot] merged 1 commit intoopenshift:masterfrom
creydr:use-new-leader-election-values-to-handle-60-sec-apiserver-rollout-in-sno

Conversation

@creydr
Copy link
Member

@creydr creydr commented Jul 30, 2021

In case of SDN gets deployed on SNO, it can run into leader election timeouts during an api server rollout (as only one api server is available and it doesn't restart fast enough yet).
Therefor the default values for leader election have been updated (openshift/library-go#1104) to the recommended values.

This PR updates the version of the openshift/library-go library to use these new values and being able to handle a 60 seconds downtime of the api server on SNO.

How to reproduce / validate:

  1. Start a SNO cluster with this patch
  2. Trigger a apiserver rollout (e.g. via k patch kubeapiserver/cluster --type merge -p '{"spec":{"forceRedeploymentReason":"something"}}')
  3. Check the logs from sdncontroller pod during the update
  4. Validate the pod did not get restarted cause it lost the lease (e.g. check no previous logs are available / doesn't contain leaderelection lost

Special notes for reviewers:

  • had to update the go version for the build root image:

    sdn/.ci-operator.yaml

    Lines 1 to 4 in d979606

    build_root_image:
    name: release
    namespace: openshift
    tag: golang-1.16
    as openshift/build-machinery required at least go 1.15.2.

@openshift-ci openshift-ci bot added do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. bugzilla/severity-high Referenced Bugzilla bug's severity is high for the branch this PR is targeting. bugzilla/valid-bug Indicates that a referenced Bugzilla bug is valid for the branch this PR is targeting. labels Jul 30, 2021
@openshift-ci
Copy link
Contributor

openshift-ci bot commented Jul 30, 2021

@creydr: This pull request references Bugzilla bug 1984683, which is valid. The bug has been moved to the POST state. The bug has been updated to refer to the pull request using the external bug tracker.

3 validation(s) were run on this bug
  • bug is open, matching expected state (open)
  • bug target release (4.9.0) matches configured target release for branch (4.9.0)
  • bug is in the state ASSIGNED, which is one of the valid states (NEW, ASSIGNED, ON_DEV, POST, POST)

Requesting review from QA contact:
/cc @zhaozhanqi

Details

In response to this:

Bug 1984683: use new default leader election values to handle apiserver rollout on SNO

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@openshift-ci openshift-ci bot requested a review from zhaozhanqi July 30, 2021 06:46
@openshift-ci
Copy link
Contributor

openshift-ci bot commented Jul 30, 2021

Skipping CI for Draft Pull Request.
If you want CI signal for your change, please convert it to an actual PR.
You can still manually trigger a test run with /test all

@creydr creydr marked this pull request as ready for review July 30, 2021 07:48
@openshift-ci openshift-ci bot removed the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Jul 30, 2021
@creydr creydr marked this pull request as draft July 30, 2021 07:57
@openshift-ci openshift-ci bot added the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Jul 30, 2021
@creydr creydr force-pushed the use-new-leader-election-values-to-handle-60-sec-apiserver-rollout-in-sno branch from e3b0dbe to 164424c Compare July 30, 2021 09:55
@creydr
Copy link
Member Author

creydr commented Jul 30, 2021

/retest

@creydr creydr force-pushed the use-new-leader-election-values-to-handle-60-sec-apiserver-rollout-in-sno branch 2 times, most recently from 995c706 to 6402e7f Compare July 30, 2021 12:08
Signed-off-by: Christoph Stäbler <cstabler@redhat.com>
@creydr creydr force-pushed the use-new-leader-election-values-to-handle-60-sec-apiserver-rollout-in-sno branch from 6402e7f to d979606 Compare July 30, 2021 12:22
@creydr
Copy link
Member Author

creydr commented Jul 30, 2021

/retest

@openshift-ci
Copy link
Contributor

openshift-ci bot commented Jul 30, 2021

@creydr: This pull request references Bugzilla bug 1984683, which is valid.

3 validation(s) were run on this bug
  • bug is open, matching expected state (open)
  • bug target release (4.9.0) matches configured target release for branch (4.9.0)
  • bug is in the state POST, which is one of the valid states (NEW, ASSIGNED, ON_DEV, POST, POST)

Requesting review from QA contact:
/cc @zhaozhanqi

Details

In response to this:

Bug 1984683: use new default leader election values to handle apiserver rollout on SNO

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@creydr creydr marked this pull request as ready for review July 30, 2021 12:24
@openshift-ci openshift-ci bot removed the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Jul 30, 2021
@creydr
Copy link
Member Author

creydr commented Jul 30, 2021

/retest-required

2 similar comments
@creydr
Copy link
Member Author

creydr commented Jul 30, 2021

/retest-required

@creydr
Copy link
Member Author

creydr commented Jul 30, 2021

/retest-required

@creydr
Copy link
Member Author

creydr commented Aug 2, 2021

@zhaozhanqi, @abhat, @alexanderConstantinescu could you have a look at this PR please?
Thanks

@zhaozhanqi
Copy link

@zhaozhanqi, @abhat, @alexanderConstantinescu could you have a look at this PR please?
Thanks

I added comment in https://bugzilla.redhat.com/show_bug.cgi?id=1984683#c8 please help check, thanks

@creydr
Copy link
Member Author

creydr commented Aug 3, 2021

@abhat could you have a look at this PR?

@abhat
Copy link
Contributor

abhat commented Aug 3, 2021

/lgtm

@abhat
Copy link
Contributor

abhat commented Aug 3, 2021

/approve

@openshift-ci openshift-ci bot added the lgtm Indicates that a PR is ready to be merged. label Aug 3, 2021
@openshift-ci
Copy link
Contributor

openshift-ci bot commented Aug 3, 2021

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: abhat, creydr

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

1 similar comment
@openshift-ci
Copy link
Contributor

openshift-ci bot commented Aug 3, 2021

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: abhat, creydr

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@openshift-ci openshift-ci bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Aug 3, 2021
@openshift-bot
Copy link
Contributor

/retest-required

Please review the full test history for this PR and help us cut down flakes.

1 similar comment
@openshift-bot
Copy link
Contributor

/retest-required

Please review the full test history for this PR and help us cut down flakes.

@openshift-ci openshift-ci bot merged commit a7a08ff into openshift:master Aug 4, 2021
@openshift-ci
Copy link
Contributor

openshift-ci bot commented Aug 4, 2021

@creydr: All pull requests linked via external trackers have merged:

Bugzilla bug 1984683 has been moved to the MODIFIED state.

Details

In response to this:

Bug 1984683: use new default leader election values to handle apiserver rollout on SNO

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

approved Indicates a PR has been approved by an approver from all required OWNERS files. bugzilla/severity-high Referenced Bugzilla bug's severity is high for the branch this PR is targeting. bugzilla/valid-bug Indicates that a referenced Bugzilla bug is valid for the branch this PR is targeting. lgtm Indicates that a PR is ready to be merged.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants

Comments