Skip to content

Conversation

@vrutkovs
Copy link
Contributor

@vrutkovs vrutkovs commented Mar 10, 2020

- What I did

  • updated etcd-quorum-guard deployment to generate readiness script once and use host IP and name to find certificates in defined locations.

  • removed hostNetwork: true from this deployment.
    This caused all network traffic to be counted as a container traffic, so some pods had > 4MBps network in/out in console

- How to verify it

Run setup / upgrade.
Check network in/out data for etcd-quorum pods

- Description for the changelog

Prepare the script once so that readinessProbe command would not look for certificate path every time. This also uses downward API to avoid looking for certificates and fetch them from defined locations
@openshift-ci-robot openshift-ci-robot added the size/M Denotes a PR that changes 30-99 lines, ignoring generated files. label Mar 10, 2020
@runcom
Copy link
Member

runcom commented Mar 10, 2020

@hexfusion ptal

@hexfusion
Copy link
Contributor

In general, if this works I am fine with it.

Avoid setting hostNetwork for etcd-quorum-guard
@kikisdeliveryservice
Copy link
Contributor

PTAL: @alaypatel07 @retroflexer

@hexfusion
Copy link
Contributor

prow is down waiting for retest.

@hexfusion
Copy link
Contributor

/retest

1 similar comment
@retroflexer
Copy link

/retest

@vrutkovs
Copy link
Contributor Author

Throttling

/retest

@vrutkovs
Copy link
Contributor Author

/test e2e-vsphere
/test e2e-openstack
/test e2e-ovirt

@hexfusion
Copy link
Contributor

Thanks @vrutkovs great find. As per slack conversation we are going to let this soak for a bit in 4.5 before we backport just in case.

/lgtm

@openshift-ci-robot openshift-ci-robot added the lgtm Indicates that a PR is ready to be merged. label Mar 11, 2020
@openshift-ci-robot openshift-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Mar 11, 2020
@retroflexer
Copy link

/lgtm

@openshift-ci-robot
Copy link
Contributor

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: hexfusion, retroflexer, vrutkovs, yuqi-zhang

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

6 similar comments
@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

2 similar comments
@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@vrutkovs
Copy link
Contributor Author

/hold

GCP upgrades are not passing anymore. Lets see if removing the namespace and a rebuild helps

@openshift-ci-robot openshift-ci-robot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Mar 12, 2020
@vrutkovs
Copy link
Contributor Author

/hold cancel
/retest

@openshift-ci-robot openshift-ci-robot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Mar 12, 2020
@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

10 similar comments
@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-ci-robot
Copy link
Contributor

@vrutkovs: The following test failed, say /retest to rerun all failed tests:

Test name Commit Details Rerun command
ci/prow/e2e-vsphere 0c666f6 link /test e2e-vsphere

Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

@kikisdeliveryservice
Copy link
Contributor

kikisdeliveryservice commented Mar 12, 2020

this vsphere test has passed once in 87 runs overall so I really don't think we should keep running it. will report/file a bugzilla for it:

https://prow.svc.ci.openshift.org/job-history/origin-ci-test/pr-logs/directory/pull-ci-openshift-machine-config-operator-master-e2e-vsphere

/skip

@vrutkovs
Copy link
Contributor Author

@jcpowermac PTAL, machine-api is unhappy in vsphere tests

@kikisdeliveryservice
Copy link
Contributor

kikisdeliveryservice commented Mar 12, 2020

I opened BZ 1813026 to track vsphere e2e failures.

@hexfusion
Copy link
Contributor

/retitle Bug 1825967: etcd quorum guard: don't set hostNetwork

@openshift-ci-robot openshift-ci-robot changed the title etcd quorum guard: don't set hostNetwork Bug 1825967: etcd quorum guard: don't set hostNetwork Apr 20, 2020
@openshift-ci-robot
Copy link
Contributor

@vrutkovs: All pull requests linked via external trackers have merged: . Bugzilla bug 1825967 has been moved to the MODIFIED state.

Details

In response to this:

Bug 1825967: etcd quorum guard: don't set hostNetwork

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@vrutkovs vrutkovs deleted the etcd-quorum-no-hostnetwork branch September 16, 2020 22:29
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

approved Indicates a PR has been approved by an approver from all required OWNERS files. lgtm Indicates that a PR is ready to be merged. size/M Denotes a PR that changes 30-99 lines, ignoring generated files.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

10 participants