Switch to using cluster-hosted Ironic for worker deployments by hardys · Pull Request #659 · openshift-metal3/dev-scripts

hardys · 2019-07-09T12:59:32Z

Testing with some additional patches to prove #635 in CI cc @imain

metal3ci · 2019-07-09T14:59:29Z

Build SUCCESS, see build http://10.8.144.11:8080/job/dev-tools/864/

Switch to using the new ironic + baremetal operator pod.

This means we can keep the OCP specific pieces out of the upstream repo, and prototype the integration which will ultimately be handled via openshift/machine-api-operator#302 Also set the RHCOS image URL n the configmap based on the variable from common.sh Co-Authored-By: Ian Main <imain@redhat.com>

metal3ci · 2019-07-09T17:36:55Z

Build SUCCESS, see build http://10.8.144.11:8080/job/dev-tools/866/

russellb · 2019-07-09T20:44:46Z

+
+# Kill the dnsmasq container on the host since it is performing DHCP and doesn't
+# allow our pod in openshift to take over.
+for name in dnsmasq ironic-inspector ; do


What about the rest of the ironic containers?

If we kill all the containers we lose the ironic database and cleanup will not work. eg 'make clean' in dev-scripts will fail to remove the ironic managed masters.

OK, expanding the comment would help. I imagine all of this is going away shortly anyway once the bootstrap VM ironic work lands.

Yeah that's correct, but then we'll also need to modify make clean to not rely on the terraform cleanup (until we figure out how to reimplement destroy)

russellb · 2019-07-09T20:46:40Z

+POD_NAME=$(oc --config ocp/auth/kubeconfig get pods -n openshift-machine-api | grep metal3-baremetal-operator | cut -f 1 -d ' ')
+
+# Make sure our pod is running.
+echo "Waiting for baremetal-operator pod to become ready"


Why do we need to wait? It shouldn't be required to block here

Yeah it's probably optional. I haven't tested the whole thing without the wait yet though. I'll give it a go. On the other hand we could use this to catch errors early.

Fair enough.

FWIW in my testing this was helpful when one of the containers went into CrashLoopBackoff, it meant I could start investigating when it became clear the pod was wedged and not starting correctly.

I don't have a strong opinion but for development having some verbose monitoring of the pod startup is probably no bad thing?

yeah I'm 50/50 on it. I do kinda like how it can catch an issue with the pod, but for CI it's probably not useful.

russellb · 2019-07-09T20:49:58Z

+oc --config ocp/auth/kubeconfig adm --as system:admin policy add-scc-to-user privileged system:serviceaccount:openshift-machine-api:baremetal-operator
+oc --config ocp/auth/kubeconfig apply -f ocp/deploy/operator_ironic.yaml -n openshift-machine-api
+
+# Sadly I don't see a way to get this from the json..


oc get pod -l name=metal3-baremetal-operator -n openshift-machine-api -o jsonpath="{.items[0].metadata.name}"

hardys · 2019-07-10T08:44:43Z

@imain how do you want to proceed with this PR vs #635 ?

I pushed this mainly to prove things in CI, but if you're happy with the approach of breaking the dependency on metal3-io/baremetal-operator#212 we could go ahead with this one? wdyt?

russellb · 2019-07-11T00:53:30Z

It’s fine. It can stay for the dev scripts. A worker couldn’t deploy until this is done anyway.

hardys · 2019-07-11T17:50:05Z

@imain is updating #635 so I'll abandon this one

hardys added the CI check this PR with CI label Jul 9, 2019

hardys mentioned this pull request Jul 9, 2019

Use new interface to ironic containers. metal3-io/baremetal-operator#212

Closed

hardys removed the CI check this PR with CI label Jul 9, 2019

imain and others added 3 commits July 9, 2019 16:28

Use the new baremetal ironic pod to deploy the BMO

9023021

Switch to using the new ironic + baremetal operator pod.

Remove lolcat

9d5dd19

hardys force-pushed the pr635 branch from 4d83ae7 to 24a3969 Compare July 9, 2019 15:29

hardys changed the title ~~WIP test of pr635 with additions~~ Switch to using cluster-hosted Ironic for worker deployments Jul 9, 2019

hardys requested review from russellb and sadasu July 9, 2019 15:29

Collect baremetal-operator container logs in CI

6fbdedd

hardys added the CI check this PR with CI label Jul 9, 2019

hardys requested a review from derekhiggins July 9, 2019 15:58

russellb reviewed Jul 9, 2019

View reviewed changes

hardys closed this Jul 11, 2019

Conversation

hardys commented Jul 9, 2019

Uh oh!

metal3ci commented Jul 9, 2019

Uh oh!

metal3ci commented Jul 9, 2019

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

hardys commented Jul 10, 2019

Uh oh!

russellb commented Jul 11, 2019 via email • edited by hardys Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hardys commented Jul 11, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

russellb commented Jul 11, 2019 via email •

edited by hardys

Loading