-
Notifications
You must be signed in to change notification settings - Fork 462
Add Nutanix Platform to Machine Config Operator #2942
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add Nutanix Platform to Machine Config Operator #2942
Conversation
|
/test verify |
6fbcb47 to
32abbf2
Compare
|
|
Trying to narrow down the offending commit. The unit tests don't fail when bumping the API dependency to 58db72a40994aaa03e79fd9d7ec791987a5fb1dc. However, when bumping it to the the commit next in chronological order de2aae00abd3f89af713a9c718eeb0ad1a12905c they start failing. Failing tests: |
32abbf2 to
e270c7d
Compare
|
/retest |
kikisdeliveryservice
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
would you mind squashing your newest commit(abca3c7e6251c0a8e86ab898b9619ec70118acca) into the prev one(e270c7d49fa9a3c5c108dfed3df0ef1652773cad)
|
adding a hold to let us get the api dependency sorted out between the 2 prs. once https://github.com/openshift/machine-config-operator/pull/2949/files lands shortly we can update this one. /hold |
abca3c7 to
4ef1ea6
Compare
kikisdeliveryservice
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for the updates @thunderboltsid
|
Since they'll be approvers for this platform: @yanhua121 @adiantum can you PTAL |
|
/retest |
|
/retest-required |
1 similar comment
|
/retest-required |
/lgtm |
|
@adiantum: changing LGTM is restricted to collaborators DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
|
/retest-required Please review the full test history for this PR and help us cut down flakes. |
9 similar comments
|
/retest-required Please review the full test history for this PR and help us cut down flakes. |
|
/retest-required Please review the full test history for this PR and help us cut down flakes. |
|
/retest-required Please review the full test history for this PR and help us cut down flakes. |
|
/retest-required Please review the full test history for this PR and help us cut down flakes. |
|
/retest-required Please review the full test history for this PR and help us cut down flakes. |
|
/retest-required Please review the full test history for this PR and help us cut down flakes. |
|
/retest-required Please review the full test history for this PR and help us cut down flakes. |
|
/retest-required Please review the full test history for this PR and help us cut down flakes. |
|
/retest-required Please review the full test history for this PR and help us cut down flakes. |
|
/retest-required |
|
/retest-required Please review the full test history for this PR and help us cut down flakes. |
3 similar comments
|
/retest-required Please review the full test history for this PR and help us cut down flakes. |
|
/retest-required Please review the full test history for this PR and help us cut down flakes. |
|
/retest-required Please review the full test history for this PR and help us cut down flakes. |
|
There are some IAM issues that are causing some of these jobs to fail. I'll go investigate and put a hold on this for now to avoid us all getting spammed with retests and failures. /hold |
|
/skip |
|
/hold cancel |
|
/test e2e-aws |
|
/retest-required Please review the full test history for this PR and help us cut down flakes. |
1 similar comment
|
/retest-required Please review the full test history for this PR and help us cut down flakes. |
|
@thunderboltsid: The following tests failed, say
Full PR test history. Your PR dashboard. DetailsInstructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here. |
|
/retest-required Please review the full test history for this PR and help us cut down flakes. |
1 similar comment
|
/retest-required Please review the full test history for this PR and help us cut down flakes. |
Discussion in [1] has some details, but trying to unpack "all on-prem IPI except baremetal IPI" into specifics, [2] is in an on-prem directory configuring keepalived, and it switches on onPremPlatformAPIServerInternalIP for enabled vs. disabled. onPremPlatformAPIServerInternalIP is true (enabling the keepalived configuration) for: * BareMetal (4.10 [3] and 4.11 [4]) * oVirt (4.10 [3] and 4.11 [4]) * OpenStack (4.10 [3] and 4.11 [4]) * VSphere (4.10 [3] and 4.11 [4]), * KubeVirt (4.10 [3], dropped in 4.11 [4,5]) * Nutanix (new in 4.11 [4,6,7]). Before 4.11, ENABLE_UNICAST was conditional on onPremPlatformKeepalivedEnableUnicast [8], but since 4.11, it has always been 'yes' [9]. The platforms that were unicast on 4.10's onPremPlatformKeepalivedEnableUnicast were BareMetal and KubeVirt [10]. Putting this all together, AWS and other platforms that don't match the onPremPlatformAPIServerInternalIP logic aren't impacted, because they don't enable the keepalived configuration. BareMetal is not impacted by 4.10-to-4.11 updates, because any to-unicast transition issues will already have been resolved by 4.10. Remaining onPremPlatformAPIServerInternalIP platforms which occur in both 4.10 and 4.11 are interested, and I match them here. Generated by writing the 4.11.0 declaration by hand, and then copying out to other 4.11 releases with: $ curl -s 'https://api.openshift.com/api/upgrades_info/graph?channel=candidate-4.11' | jq -r '.nodes[].version' | grep '^4[.]11[.]' | grep -v '^4[.]11[.]0$' | while read V; do sed "s/4[.]11[.]0/${V}/g" blocked-edges/4.11.0-KeepalivedMulticastSkew.yaml > "blocked-edges/${V}-KeepalivedMulticastSkew.yaml"; done $ git add blocked-edges/4.11.*KeepalivedMulticastSkew.yaml [1]: https://issues.redhat.com/browse/OPNET-296 [2]: https://github.com/openshift/machine-config-operator/blame/8fa0b7e8903226b3cfb76e6c6f49409cfc0dd0e7/templates/common/on-prem/files/keepalived.yaml#L2 [3]: https://github.com/openshift/machine-config-operator/blob/afb47c916680dd5870e48e5c9cf819f59e12ff4d/pkg/operator/render.go#L282-L294 [4]: https://github.com/openshift/machine-config-operator/blob/8fa0b7e8903226b3cfb76e6c6f49409cfc0dd0e7/pkg/operator/render.go#L282-L294 [5]: openshift/machine-config-operator#3084 [6]: openshift/machine-config-operator#2942 [7]: https://docs.openshift.com/container-platform/4.11/release_notes/ocp-4-11-release-notes.html#ocp-4-11-nutanix [8]: https://github.com/openshift/machine-config-operator/blob/afb47c916680dd5870e48e5c9cf819f59e12ff4d/templates/common/on-prem/files/keepalived.yaml#L155-L156 [9]: openshift/machine-config-operator@84d0bae#diff-c4a27bc4c14847dd581f495e992f67cf49b430644e8f113aabfa879de076564dL156 [10]: https://github.com/openshift/machine-config-operator/blob/afb47c916680dd5870e48e5c9cf819f59e12ff4d/pkg/operator/render.go#L249-L250
Discussion in [1] has some details, but trying to unpack "all on-prem IPI except baremetal IPI" into specifics, [2] is in an on-prem directory configuring keepalived, and it switches on onPremPlatformAPIServerInternalIP for enabled vs. disabled. onPremPlatformAPIServerInternalIP is true (enabling the keepalived configuration) for: * BareMetal (4.10 [3] and 4.11 [4]) * oVirt (4.10 [3] and 4.11 [4]) * OpenStack (4.10 [3] and 4.11 [4]) * VSphere (4.10 [3] and 4.11 [4]), * KubeVirt (4.10 [3], dropped in 4.11 [4,5]) * Nutanix (new in 4.11 [4,6,7]). Before 4.11, ENABLE_UNICAST was conditional on onPremPlatformKeepalivedEnableUnicast [8], but since 4.11, it has always been 'yes' [9]. The platforms that were unicast on 4.10's onPremPlatformKeepalivedEnableUnicast were BareMetal and KubeVirt [10]. Putting this all together, AWS and other platforms that don't match the onPremPlatformAPIServerInternalIP logic aren't impacted, because they don't enable the keepalived configuration. BareMetal is not impacted by 4.10-to-4.11 updates, because any to-unicast transition issues will already have been resolved by 4.10. Remaining onPremPlatformAPIServerInternalIP platforms which occur in both 4.10 and 4.11 are interested, and I match them here. Generated by writing the 4.11.0 declaration by hand, and then copying out to other 4.11 releases with: $ curl -s 'https://api.openshift.com/api/upgrades_info/graph?channel=candidate-4.11' | jq -r '.nodes[].version' | grep '^4[.]11[.]' | grep -v '^4[.]11[.]0$' | while read V; do sed "s/4[.]11[.]0/${V}/g" blocked-edges/4.11.0-KeepalivedMulticastSkew.yaml > "blocked-edges/${V}-KeepalivedMulticastSkew.yaml"; done $ git add blocked-edges/4.11.*KeepalivedMulticastSkew.yaml [1]: https://issues.redhat.com/browse/OPNET-296 [2]: https://github.com/openshift/machine-config-operator/blame/8fa0b7e8903226b3cfb76e6c6f49409cfc0dd0e7/templates/common/on-prem/files/keepalived.yaml#L2 [3]: https://github.com/openshift/machine-config-operator/blob/afb47c916680dd5870e48e5c9cf819f59e12ff4d/pkg/operator/render.go#L282-L294 [4]: https://github.com/openshift/machine-config-operator/blob/8fa0b7e8903226b3cfb76e6c6f49409cfc0dd0e7/pkg/operator/render.go#L282-L294 [5]: openshift/machine-config-operator#3084 [6]: openshift/machine-config-operator#2942 [7]: https://docs.openshift.com/container-platform/4.11/release_notes/ocp-4-11-release-notes.html#ocp-4-11-nutanix [8]: https://github.com/openshift/machine-config-operator/blob/afb47c916680dd5870e48e5c9cf819f59e12ff4d/templates/common/on-prem/files/keepalived.yaml#L155-L156 [9]: openshift/machine-config-operator@84d0bae#diff-c4a27bc4c14847dd581f495e992f67cf49b430644e8f113aabfa879de076564dL156 [10]: https://github.com/openshift/machine-config-operator/blob/afb47c916680dd5870e48e5c9cf819f59e12ff4d/pkg/operator/render.go#L249-L250
Update controllerconfig CRD and relevant switch statements
in pkg to handle Nutanix platform.