Revert Bump sushy-tools and allow to ignore boot device to fix CI#609
Revert Bump sushy-tools and allow to ignore boot device to fix CI#609maelk merged 1 commit intometal3-io:masterfrom
Conversation
|
[APPROVALNOTIFIER] This PR is NOT APPROVED This pull-request has been approved by: furkatgofurov7 The full list of commands accepted by this bot can be found here. DetailsNeeds approval from an approver in each of these files:Approvers can indicate their approval by writing |
|
/cc @hardys |
|
The tests might fail in this PR also as master jobs are failing, so we might need to force merge it. |
|
Force merging to unblock the CI |
|
I think the logic of how sushy-tools image is built should be changed in metal3-dev-env. Currently, we always pull the latest image from quay.io when running integration tests. But that can be a problem, in a case like in this patch, where we bump sushy-tools version in Dockerfile but integration tests will not build a new image from that bumped version but rather just pull latest from the quay, resulting integration tests pass with the version of an image from quay.io. But once the patch is merged, quay will automatically build a new sushy-tools image with the new version from Dockerfile (which was never actually tested in CI), and that new version might break the CI again in the future. |
|
Could you please let us know what exactly broke? Some logs maybe? Otherwise we won't fix the problem and it will resurface on the next update. |
|
It sounds like https://github.com/metal3-io/metal3-dev-env/pull/606/files#r595873052 may explain the problem - but before I propose a revised PR, how do we adjust the CI workflow to actually test the sushy-tools image defined in this repo? |
Thanks, sorry for the trouble - I did test it locally but not the deprovisioning phase unfortunately. Any thoughts on how we can adjust CI to catch this kind of issue in future? |
To adjust the CI workflow, we need to make sure when we are changing the Dockerfile for sushy-tools or vbmc for example, we build local images out of it, push it to local registry and use that in the CI workflow. Since the changes on this directory are very less frequent , CI currently doesn't include this |
PTAL here https://jenkins.nordix.org/view/Airship/job/airship_master_v1a3_integration_test_centos/544/, you ll find the logs in the zipp file here. We didnt see anything particular, we experienced only redfish bmhs waiting indefinitely in clean_wait in ironic. And susy-tools log had a warning - |
@dtantsur while deprovisioning one of the BMHs' specifically with redfish bmc (not ipmi) got stuck in clean_wait state and never went out of it. Logs did not give much visibility except some failed error messages thrown in node serial logs: Since it was related to booting, we suspected bumping of sushy-tools could break it and that was the case. |
Right, I think that's how we test patches to the Dockerfile in ironic-image repo. We can do the similar here too maybe? We will probably need to export a variable for the sushy-tools in project-infra repo as we do for ironic-image. |
|
Let's continue the discussion on #610. Because I think closed PR is not the best place to discuss the problem. |
These container images are defined in the metal3-dev-env repo but currently aren't tested in CI, leading to potential regressions such as metal3-io/metal3-dev-env#609 We should always build/test these for metal3-dev-env CI, to catch issues before changes to the Dockerfiles land.
…to fix CI" This reverts metal3-io#609 and restores metal3-io#606 FIXME - testing CI with the original broken version, then the issue identified will be fixed before merging: metal3-io#606 (comment) This reverts commit 505dcf0.
…x CI" This reverts metal3-io#609 and restores metal3-io#606 Fix added for issue mentioned in: https://github.com/metal3-io/metal3-dev-env/pull/606/files#r595873052 This will also be tested by CI since: metal3-io/project-infra#175 This reverts commit 505dcf0.
This reverts metal3-io#609 and restores metal3-io#606 Fix added for issue mentioned in: https://github.com/metal3-io/metal3-dev-env/pull/606/files#r595873052 This will also be tested by CI since: metal3-io/project-infra#175 This reverts commit 505dcf0.
This is a revert of #606
We are struggling deprovision the BMH with redfish as follows:
Ironic always hangs in
clean_waitstate for node-1: