Skip to content

Conversation

@hexfusion
Copy link
Contributor

@hexfusion hexfusion commented Oct 29, 2019

This PR adds a flag to bootstrap allowing the cluster-etcd-operator image to be be used in the etcd-member spec. We are also using the value of the flag as conditional logic for the spec. Because we need to balance that with what is put in etcd toggling CEO will involve unsetting the flag and the value of the configmap.

We also add a readinessProbe to etcd-member container. etcd should not show ready in service until it can serve traffic.

@openshift-ci-robot openshift-ci-robot added do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. labels Oct 29, 2019
Copy link
Contributor

@kikisdeliveryservice kikisdeliveryservice left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

A few comments

@openshift-ci-robot openshift-ci-robot added size/L Denotes a PR that changes 100-499 lines, ignoring generated files. and removed size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. labels Oct 29, 2019
@hexfusion hexfusion force-pushed the add_ceo_flag branch 2 times, most recently from 1bf123b to bd9fc6f Compare October 30, 2019 00:32
@hexfusion
Copy link
Contributor Author

using for test release for debug

/test e2e-aws

@hexfusion
Copy link
Contributor Author

FTR the failure has been isolated and will be resolved shortly

@kikisdeliveryservice
Copy link
Contributor

fyi: that aws-scaleup-rhel7 is basically broken. i filed a bz last night. so you can just:

/skip

@hexfusion hexfusion force-pushed the add_ceo_flag branch 7 times, most recently from 1feecc2 to bb2ca60 Compare November 1, 2019 02:10
@hexfusion
Copy link
Contributor Author

level=error msg="Error: Error creating DNS RecordSet: googleapi: Error 503: Backend Error, backendError"
level=error
level=error msg="  on ../tmp/openshift-install-025962353/dns/base.tf line 39, in resource \"google_dns_record_set\" \"etcd_a_nodes\":"
level=error msg="  39: resource \"google_dns_record_set\" \"etcd_a_nodes\" {"

/test e2e-gcp-upgrade

@hexfusion hexfusion changed the title WIP cmd/machine-config-operator: add cluster-etcd-operator flag to bootstrap cmd. cmd/machine-config-operator/bootstrap: add cluster-etcd-operator flag Nov 1, 2019
@openshift-ci-robot openshift-ci-robot removed the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Nov 1, 2019
@hexfusion hexfusion force-pushed the add_ceo_flag branch 5 times, most recently from e0f6f00 to 5ade6a6 Compare November 1, 2019 12:42
An empty image field will mean CEO is not enabled and cluster is
bootstrapping 4.2 style. A lot left to do for making etcd-member
manifest file boot without 4.2 but this is a start.
@hexfusion
Copy link
Contributor Author

/cc @runcom @kikisdeliveryservice PTAL

@openshift-ci-robot
Copy link
Contributor

@hexfusion: GitHub didn't allow me to request PR reviews from the following users: PTAL.

Note that only openshift members and repo collaborators can review this PR, and authors cannot review their own PRs.

Details

In response to this:

/cc @runcom @kikisdeliveryservice PTAL

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

"etcd": "registry.svc.ci.openshift.org/openshift:etcd",
"infraImage": "registry.svc.ci.openshift.org/openshift:pod",
"kubeClientAgentImage": "registry.svc.ci.openshift.org/openshift:kube-client-agent",
"clusterEtcdOperatorImage": "",
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

to be clear this is intentional and will be added in a separate PR when we enable the operator.

@hexfusion
Copy link
Contributor Author

hexfusion commented Nov 1, 2019

mcd_test.go:555: pool worker didn't report updated to rendered-worker-a4935ca22d7431a704c741eba9e4d563: timed out waiting for the condition

Doesn't feel related as the error is reporting from worker which we should not of changed.

/retest

@alaypatel07
Copy link
Contributor

/lgtm

@openshift-ci-robot openshift-ci-robot added the lgtm Indicates that a PR is ready to be merged. label Nov 1, 2019
@hexfusion
Copy link
Contributor Author

level=info msg="Cluster operator authentication Progressing is Unknown with NoData: "

/test e2e-gcp-op

Copy link
Contributor

@kikisdeliveryservice kikisdeliveryservice left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

tests are kind of broken looking into that now..

but this looks good!

@openshift-ci-robot openshift-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Nov 1, 2019
@kikisdeliveryservice
Copy link
Contributor

/skip

@hexfusion
Copy link
Contributor Author

#1232 (comment)

FIPS is broken I guess?

@hexfusion
Copy link
Contributor Author

cc @cgwalters

@kikisdeliveryservice
Copy link
Contributor

hitting same error as here: #1232 (comment)

we are trying to get a fix

@kikisdeliveryservice
Copy link
Contributor

@cgwalters do you think we can do an override for this as it's blocking other PRs in the bunch?

@cgwalters
Copy link
Member

Yeah,
/approve
And I'm OK overriding, you want to pull the trigger?

@openshift-ci-robot
Copy link
Contributor

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: alaypatel07, cgwalters, hexfusion, kikisdeliveryservice

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details Needs approval from an approver in each of these files:
  • OWNERS [cgwalters,kikisdeliveryservice]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@kikisdeliveryservice
Copy link
Contributor

/override ci/prow/e2e-gcp-op

@openshift-ci-robot
Copy link
Contributor

@kikisdeliveryservice: Overrode contexts on behalf of kikisdeliveryservice: ci/prow/e2e-gcp-op

Details

In response to this:

/override ci/prow/e2e-gcp-op

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@kikisdeliveryservice
Copy link
Contributor

thanks bot for re-running all the tests again, including the one we told you not to run...

/override ci/prow/e2e-gcp-op

@kikisdeliveryservice
Copy link
Contributor

i think it has to fail before i can override, ill check in later

@openshift-ci-robot
Copy link
Contributor

@hexfusion: The following tests failed, say /retest to rerun them all:

Test name Commit Details Rerun command
ci/prow/e2e-aws-scaleup-rhel7 492fac6 link /test e2e-aws-scaleup-rhel7
ci/prow/e2e-gcp-op 492fac6 link /test e2e-gcp-op

Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

@kikisdeliveryservice
Copy link
Contributor

/override ci/prow/e2e-gcp-op

@openshift-ci-robot
Copy link
Contributor

@kikisdeliveryservice: Overrode contexts on behalf of kikisdeliveryservice: ci/prow/e2e-gcp-op

Details

In response to this:

/override ci/prow/e2e-gcp-op

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

approved Indicates a PR has been approved by an approver from all required OWNERS files. lgtm Indicates that a PR is ready to be merged. size/L Denotes a PR that changes 100-499 lines, ignoring generated files.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants