Skip to content

Bug 1803697: Changing OVN DB readinessProbe to check membership status instead of DB status #466

Merged
openshift-merge-robot merged 1 commit intoopenshift:masterfrom
alexanderConstantinescu:bugfix/fix-readinessprobes
Feb 24, 2020
Merged

Bug 1803697: Changing OVN DB readinessProbe to check membership status instead of DB status #466
openshift-merge-robot merged 1 commit intoopenshift:masterfrom
alexanderConstantinescu:bugfix/fix-readinessprobes

Conversation

@alexanderConstantinescu
Copy link
Contributor

This PR fixes the recently changed readinessProbe for nbdb and sbdb to check each pod's membership status instead of the general cluster status.

Some background concerning this change (as to understand the implemented check better):

We should check raft cluster status using ovs-appctl -t /var/run/openvswitch/ovnnb_db.ctl cluster/status OVN_Northbound, which outputs

sh-4.2#  ovs-appctl -t /var/run/openvswitch/ovnnb_db.ctl  cluster/status OVN_Northbound       
f782
Name: OVN_Northbound
Cluster ID: f72d (f72d04b2-f90e-4ce0-98a4-3636e668d1e7)
Server ID: f782 (f7826705-a5a9-4e4c-b2a0-935da3d72743)
Address: ssl:10.0.146.224:9643
Status: cluster member
Role: follower
Term: 3
Leader: 1b1b
Vote: 1b1b

Election timer: 1000
Log: [2, 1699]
Entries not yet committed: 0
Entries not yet applied: 0
Connections: ->0368 ->1b1b <-1b1b <-0368
Servers:
    0368 (0368 at ssl:10.0.137.205:9643)
    f782 (f782 at ssl:10.0.146.224:9643) (self)
    1b1b (1b1b at ssl:10.0.134.97:9643)

The following will be returned in the case that the pod is not a member of the cluster (has left or otherwise):

sh-4.2#  ovs-appctl -t /var/run/openvswitch/ovnnb_db.ctl  cluster/status OVN_Northbound
unknown cluster
ovs-appctl: /var/run/openvswitch/ovnnb_db.ctl: server returned an error

/assign @dcbw

@openshift-ci-robot openshift-ci-robot added the size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. label Feb 4, 2020
@pecameron
Copy link
Contributor

/test e2e-aws-ovn
/test e2e-gcp-ovn

@alexanderConstantinescu
Copy link
Contributor Author

/retest

@alexanderConstantinescu
Copy link
Contributor Author

/test e2e-aws-ovn

@alexanderConstantinescu
Copy link
Contributor Author

/test e2e-gcp-ovn

@alexanderConstantinescu
Copy link
Contributor Author

/test e2e-gcp
/test e2e-gcp-ovn

@alexanderConstantinescu
Copy link
Contributor Author

/test e2e-gcp-ovn

@dcbw
Copy link
Contributor

dcbw commented Feb 12, 2020

Terraform GCP failures....

/test e2e-gcp-ovn

@dcbw
Copy link
Contributor

dcbw commented Feb 12, 2020

/lgtm

@openshift-ci-robot openshift-ci-robot added the lgtm Indicates that a PR is ready to be merged. label Feb 12, 2020
@dcbw
Copy link
Contributor

dcbw commented Feb 12, 2020

@alexanderConstantinescu this needs a Bugzilla bug with a 4.4.0 target...

@openshift-ci-robot
Copy link
Contributor

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: alexanderConstantinescu, dcbw

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@openshift-ci-robot openshift-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Feb 12, 2020
@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

12 similar comments
@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@danwinship
Copy link
Contributor

/hold
to stop the bot while CI is broken, and this needs a bugzilla bug before it can merge anyway

@openshift-ci-robot openshift-ci-robot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Feb 13, 2020
@alexanderConstantinescu
Copy link
Contributor Author

/retitle Bug 1803697: Changing OVN DB readinessProbe to check membership status instead of DB status

/bugzilla refresh

@openshift-ci-robot
Copy link
Contributor

@alexanderConstantinescu: No Bugzilla bug is referenced in the title of this pull request.
To reference a bug, add 'Bug XXX:' to the title of this pull request and request another bug refresh with /bugzilla refresh.

Details

In response to this:

/retitle Bug 1803697: Changing OVN DB readinessProbe to check membership status instead of DB status

/bugzilla refresh

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@openshift-ci-robot openshift-ci-robot changed the title Changing OVN DB readinessProbe to check membership status instead of DB status Bug 1803697: Changing OVN DB readinessProbe to check membership status instead of DB status Feb 17, 2020
@openshift-ci-robot openshift-ci-robot added the bugzilla/valid-bug Indicates that a referenced Bugzilla bug is valid for the branch this PR is targeting. label Feb 17, 2020
@openshift-ci-robot
Copy link
Contributor

@alexanderConstantinescu: This pull request references Bugzilla bug 1803697, which is valid. The bug has been moved to the POST state. The bug has been updated to refer to the pull request using the external bug tracker.

Details

In response to this:

Bug 1803697: Changing OVN DB readinessProbe to check membership status instead of DB status

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@alexanderConstantinescu
Copy link
Contributor Author

/test e2e-gcp-ovn

@alexanderConstantinescu
Copy link
Contributor Author

/test e2e-gcp-ovn

2 similar comments
@alexanderConstantinescu
Copy link
Contributor Author

/test e2e-gcp-ovn

@alexanderConstantinescu
Copy link
Contributor Author

/test e2e-gcp-ovn

@alexanderConstantinescu
Copy link
Contributor Author

/test e2e-gcp-ovn

2 similar comments
@alexanderConstantinescu
Copy link
Contributor Author

/test e2e-gcp-ovn

@alexanderConstantinescu
Copy link
Contributor Author

/test e2e-gcp-ovn

@alexanderConstantinescu
Copy link
Contributor Author

And what if this works?

/test e2e-aws-ovn

@danwinship
Copy link
Contributor

/hold cancel

@openshift-ci-robot openshift-ci-robot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Feb 24, 2020
@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

4 similar comments
@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-ci-robot
Copy link
Contributor

openshift-ci-robot commented Feb 24, 2020

@alexanderConstantinescu: The following test failed, say /retest to rerun all failed tests:

Test name Commit Details Rerun command
ci/prow/e2e-gcp-ovn 31476cc link /test e2e-gcp-ovn

Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

@openshift-merge-robot openshift-merge-robot merged commit 3bda376 into openshift:master Feb 24, 2020
@openshift-ci-robot
Copy link
Contributor

@alexanderConstantinescu: All pull requests linked via external trackers have merged. Bugzilla bug 1803697 has been moved to the MODIFIED state.

Details

In response to this:

Bug 1803697: Changing OVN DB readinessProbe to check membership status instead of DB status

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

approved Indicates a PR has been approved by an approver from all required OWNERS files. bugzilla/valid-bug Indicates that a referenced Bugzilla bug is valid for the branch this PR is targeting. lgtm Indicates that a PR is ready to be merged. size/XS Denotes a PR that changes 0-9 lines, ignoring generated files.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

7 participants