Skip to content

ovn-kube: add pod disruption budget, readiness check#414

Merged
openshift-merge-robot merged 1 commit intoopenshift:masterfrom
squeed:ovn-improvements
Dec 5, 2019
Merged

ovn-kube: add pod disruption budget, readiness check#414
openshift-merge-robot merged 1 commit intoopenshift:masterfrom
squeed:ovn-improvements

Conversation

@squeed
Copy link
Contributor

@squeed squeed commented Dec 4, 2019

Add a PodDisruptionBudget to protect the raft quorum.

Configure a readines probe for the DBs: ovsdb raft only opens its port once it has a raft consensus. Utilize that.

Also, add a TerminationMessagePolicy.

(Closes SDN-662)

/cc @alexanderConstantinescu

Add a PodDisruptionBudget to protect the raft quorum.

Configure a readines probe for the DBs: ovsdb raft only opens its port
once it has a raft consensus. Utilize that.

Also, add a TerminationMessagePolicy.
@openshift-ci-robot openshift-ci-robot added the size/M Denotes a PR that changes 30-99 lines, ignoring generated files. label Dec 4, 2019
@squeed
Copy link
Contributor Author

squeed commented Dec 4, 2019

/test e2e-aws-ovn

@squeed
Copy link
Contributor Author

squeed commented Dec 4, 2019

/cc @dcbw

@openshift-ci-robot openshift-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Dec 4, 2019
done
fi
readinessProbe:
initialDelaySeconds: 30
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Question, I am not sure how readinessProbes impact hostNetwork pods. But will ovnkube-node be able to communicate with its depedencies in ovnkube-master during the first 30 seconds while ovnkube-master is not ready? Otherwise I suspect we might have a lot of errors logged in those pods (if they don't even CrashLoop?)

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Very good question. In this case, that won't be an issue, because we're not using a Service / Endpoints. Basically, only Ready pods can show up in a Service. However, we talk directly to the pods, so that's not an issue here.

@squeed
Copy link
Contributor Author

squeed commented Dec 5, 2019

Looks like the test failure is due to a pod not being created in time, but the logs indicate it only waits about 5 seconds. That seems... wrong. Looking further.

@squeed
Copy link
Contributor Author

squeed commented Dec 5, 2019

Aha, seems to be a bit of a flake
/test e2e-aws-ovn

@squeed
Copy link
Contributor Author

squeed commented Dec 5, 2019

Filed https://bugzilla.redhat.com/show_bug.cgi?id=1780143 for test flake.

@alexanderConstantinescu
Copy link
Contributor

/lgtm

@openshift-ci-robot openshift-ci-robot added the lgtm Indicates that a PR is ready to be merged. label Dec 5, 2019
@openshift-ci-robot
Copy link
Contributor

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: alexanderConstantinescu, squeed

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

5 similar comments
@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-merge-robot openshift-merge-robot merged commit 25ed090 into openshift:master Dec 5, 2019
@openshift-ci-robot
Copy link
Contributor

@squeed: The following test failed, say /retest to rerun them all:

Test name Commit Details Rerun command
ci/prow/e2e-aws-ovn b2f02b3 link /test e2e-aws-ovn

Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

approved Indicates a PR has been approved by an approver from all required OWNERS files. lgtm Indicates that a PR is ready to be merged. size/M Denotes a PR that changes 30-99 lines, ignoring generated files.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants