Skip to content

Conversation

@vrutkovs
Copy link
Contributor

@vrutkovs vrutkovs commented Aug 5, 2019

MCO image names have been updated, see openshift/machine-config-operator#1028

@openshift-ci-robot openshift-ci-robot added the size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. label Aug 5, 2019
@openshift-ci-robot openshift-ci-robot added approved Indicates a PR has been approved by an approver from all required OWNERS files. size/S Denotes a PR that changes 10-29 lines, ignoring generated files. and removed size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. labels Aug 5, 2019
@vrutkovs vrutkovs force-pushed the mco-image-name branch 2 times, most recently from d5af79e to 6d01847 Compare August 5, 2019 10:54
@vrutkovs vrutkovs changed the title Update MCO image name Update MCO image name and entrypoint Aug 5, 2019
Copy link
Member

@mtnbikenc mtnbikenc left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

/lgtm

@openshift-ci-robot openshift-ci-robot added the lgtm Indicates that a PR is ready to be merged. label Aug 5, 2019
@vrutkovs
Copy link
Contributor Author

vrutkovs commented Aug 5, 2019

/retest

@vrutkovs
Copy link
Contributor Author

vrutkovs commented Aug 5, 2019


Failing tests:
[Feature:Prometheus][Conformance] Prometheus when installed on the cluster should report less than two alerts in firing or pending state [Suite:openshift/conformance/parallel/minimal]
[sig-autoscaling] [HPA] Horizontal pod autoscaling (scale resource: CPU) [sig-autoscaling] ReplicationController light Should scale from 2 pods to 1 pod [Suite:openshift/conformance/parallel] [Suite:k8s] 

/retest

@openshift-bot
Copy link

/retest

Please review the full test history for this PR and help us cut down flakes.

5 similar comments
@openshift-bot
Copy link

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link

/retest

Please review the full test history for this PR and help us cut down flakes.

@vrutkovs
Copy link
Contributor Author

vrutkovs commented Aug 6, 2019

Alerts: machine ci-op-mfwy9hl8-716a3-p8hcp-worker-us-east-1a-centos-x5ffc does not have valid node reference

@runcom any ideas why is happening to RHEL7 nodes?

@vrutkovs
Copy link
Contributor Author

vrutkovs commented Aug 6, 2019

/hold

until machine link alert is resolved

@openshift-ci-robot openshift-ci-robot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Aug 6, 2019
@mtnbikenc mtnbikenc changed the title Update MCO image name and entrypoint Bug 1737806: Update MCO image name and entrypoint Aug 6, 2019
@openshift-ci-robot openshift-ci-robot added the bugzilla/valid-bug Indicates that a referenced Bugzilla bug is valid for the branch this PR is targeting. label Aug 6, 2019
@openshift-ci-robot
Copy link

@vrutkovs: This pull request references a valid Bugzilla bug. The bug has been moved to the POST state. The bug has been updated to refer to the pull request using the external bug tracker.

Details

In response to this:

Bug 1737806: Update MCO image name and entrypoint

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

- name: Get machine-config-operator image
command: >
oc get daemonset machine-config-daemon
oc get daemonset machine-config-operator
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this looks wrong:

12:37:51 [github.com/openshift/installer] ‹master*›  oc get daemonset machine-config-daemon
NAME                    DESIRED   CURRENT   READY   UP-TO-DATE   AVAILABLE   NODE SELECTOR            AGE
machine-config-daemon   6         6         6       6            6           kubernetes.io/os=linux   16h
14:41:52 [github.com/openshift/installer] ‹master*›  oc get daemonset machine-config-operator
Error from server (NotFound): daemonsets.extensions "machine-config-operator" not found

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Fixed

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ah, right, the daemonset name didn't change

--config={{ openshift_node_kubeconfig_path }}
--namespace=openshift-machine-config-operator
--output=jsonpath='{.spec.template.spec.containers[?(@.name=="machine-config-daemon")].image}'
--output=jsonpath='{.spec.template.spec.containers[?(@.name=="machine-config-operator")].image}'
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this as well (wrong)

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Fixed

@openshift-ci-robot openshift-ci-robot added size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. and removed lgtm Indicates that a PR is ready to be merged. size/S Denotes a PR that changes 10-29 lines, ignoring generated files. labels Aug 6, 2019
@mtnbikenc
Copy link
Member

Failing tests:

[Feature:Prometheus][Conformance] Prometheus when installed on the cluster should report less than two alerts in firing or pending state [Suite:openshift/conformance/parallel/minimal]

Opened bug for failing test:
https://bugzilla.redhat.com/show_bug.cgi?id=1738291

@mtnbikenc
Copy link
Member

I tested this PR manually and found that the nodes are not coming up.

On the new hosts I found:
machine-config-daemon-host.service failing with Failed to start Machine Config Daemon Initial.
network.service failing with Failed to start LSB: Bring up/down networking.

@cgwalters
Copy link
Member

I tested this PR manually and found that the nodes are not coming up.

Can you be more precise about "not coming up"? Like, kubelet.service failing, nodes not joining the cluster, something else?

@mtnbikenc
Copy link
Member

nodes are not joining the cluster
kubelet.service is running but unable to find the node
I'll get kubelet logs tomorrow

@mtnbikenc
Copy link
Member

Manually tested with the most recent build and nodes joined the cluster.
/retest

@mtnbikenc
Copy link
Member

Alert issue resolved now that nodes are joining the cluster.
/hold cancel
/lgtm

@openshift-ci-robot openshift-ci-robot added lgtm Indicates that a PR is ready to be merged. and removed do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. labels Aug 8, 2019
@openshift-ci-robot
Copy link

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: mtnbikenc, vrutkovs

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@mtnbikenc
Copy link
Member

ssh load balancer was not deployed
/retest

@openshift-merge-robot openshift-merge-robot merged commit 52e6680 into openshift:master Aug 8, 2019
@openshift-ci-robot
Copy link

@vrutkovs: All pull requests linked via external trackers have merged. The Bugzilla bug has been moved to the MODIFIED state.

Details

In response to this:

Bug 1737806: Update MCO image name and entrypoint

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

approved Indicates a PR has been approved by an approver from all required OWNERS files. bugzilla/valid-bug Indicates that a referenced Bugzilla bug is valid for the branch this PR is targeting. lgtm Indicates that a PR is ready to be merged. size/XS Denotes a PR that changes 0-9 lines, ignoring generated files.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

7 participants