OCPBUGS-55317: Check capabilities before watching OLM resource#1232
Conversation
|
@Miciah: This pull request references Jira Issue OCPBUGS-55317, which is invalid:
Comment The bug has been updated to refer to the pull request using the external bug tracker. DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
|
/jira refresh |
|
@Miciah: This pull request references Jira Issue OCPBUGS-55317, which is invalid:
Comment DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
|
/jira refresh |
|
@Miciah: This pull request references Jira Issue OCPBUGS-55317, which is valid. The bug has been moved to the POST state. 3 validation(s) were run on this bug
No GitHub users were found matching the public email listed for the QA contact in Jira (iamin@redhat.com), skipping review request. DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
|
/test ci-4.19-e2e-aws-ovn-no-capabilities |
|
@rhamini3: The specified target(s) for The following commands are available to trigger optional jobs: Use DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. |
c018b1e to
195381f
Compare
|
@Miciah: This pull request references Jira Issue OCPBUGS-55317, which is valid. 3 validation(s) were run on this bug
No GitHub users were found matching the public email listed for the QA contact in Jira (iamin@redhat.com), skipping review request. DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
|
https://github.com/openshift/cluster-ingress-operator/compare/c018b1e2288c5a4127e8842662d4691bee15be6d..195381f489dfe6525117a05c8855e2deee29b560 changes the gatewayapi controller not to start the gatewayclass, gateway-service-dns, or gateway-labeler controllers if the marketplace and OLM capabilities are not enabled. As @alebedev87 pointed out to me, the gatewayclass controller watches subscriptions and installplans, both of which are OLM resources. |
| // The subscriptions resource only exists if the "marketplace" and | ||
| // "OperatorLifecycleManager" capabilities are enabled. We cannot |
There was a problem hiding this comment.
Only OperatorLifecycleManager capability owns Subscription CRD. marketplace capability helps with the management of the "out of the box catalog sources".
There was a problem hiding this comment.
So it seems the operator could check just the "OperatorLifecycleManager" capability. The operator would then watch the subscription if that capability were enabled but the "marketplace" weren't, and that would let the cluster-admin manage the catalog sources manually. Is that something we should support?
There was a problem hiding this comment.
Is that something we should support?
I don't know. In the context of this bug, I tend to stay on the safer side which is having both enabled. The requirement to support custom catalog sources may be expressed as a separate requirement.
My initial remark was about the phrasing of the comment.
There was a problem hiding this comment.
Ah, I understand now. I have fixed the comments in https://github.com/openshift/cluster-ingress-operator/compare/d3f3c9080be8af794c17318ceddfa4334dca8214..30dda056da6c3184b4ac60c983b2f1cda7d7b169.
There was a problem hiding this comment.
Should we protect the retrieval of the subscription object if OLM capability is disabled in getOperatorState() too?
There was a problem hiding this comment.
Right, we should. The periodic job run by Hongan failed at the installation due to this error (log file):
2025-05-13T07:29:00.307Z ERROR operator.init controller/controller.go:288 Reconciler error {"controller": "status_controller", "object": {"name":"default","namespace":"openshift-ingress-operator"}, "namespace": "openshift-ingress-operator", "name": "default", "reconcileID": "f0b5ca1b-4b3b-4273-8624-8725c5422b09", "error": "failed to get operator state: failed to get subscription \"openshift-operators/servicemeshoperator3\": failed to get restmapping: no matches for kind \"Subscription\" in version \"operators.coreos.com/v1alpha1\""}
There was a problem hiding this comment.
|
@rhamini3: |
|
/payload-job periodic-ci-openshift-release-master-ci-4.20-e2e-aws-ovn-no-capabilities |
|
@lihongan: trigger 1 job(s) for the /payload-(with-prs|job|aggregate|job-with-prs|aggregate-with-prs) command
See details on https://pr-payload-tests.ci.openshift.org/runs/ci/4c9a0040-2fc1-11f0-9ec8-e921e9a09dce-0 |
|
@lihongan: This PR was included in a payload test run from openshift/origin#29784
See details on https://pr-payload-tests.ci.openshift.org/runs/ci/20550170-2fc5-11f0-9635-4ac681776cf3-0 |
195381f to
d3f3c90
Compare
|
/payload-job periodic-ci-openshift-release-master-ci-4.20-e2e-aws-ovn-no-capabilities |
|
@Miciah: trigger 1 job(s) for the /payload-(with-prs|job|aggregate|job-with-prs|aggregate-with-prs) command
See details on https://pr-payload-tests.ci.openshift.org/runs/ci/699e67c0-2ff4-11f0-894c-248eefb7cf18-0 |
|
/assign |
The results have 77 failures, but it's a known issue that many things break with no-capabilities. Of interest to us, all the "[OCPFeatureGate:GatewayAPI]" tests passed, and all the "[OCPFeatureGate:GatewayAPIController]] tests failed, as expected. (openshift/origin#29784 should change the failures to skips.) |
|
/payload-job periodic-ci-openshift-release-master-ci-4.20-e2e-aws-ovn-no-capabilities |
|
@Miciah: trigger 1 job(s) for the /payload-(with-prs|job|aggregate|job-with-prs|aggregate-with-prs) command
See details on https://pr-payload-tests.ci.openshift.org/runs/ci/6189b7e0-3014-11f0-9474-41ad5f91c087-0 |
|
/assign |
Again, the results have 77 failures, all the "[OCPFeatureGate:GatewayAPI]" tests passed, and all the "[OCPFeatureGate:GatewayAPIController]" tests failed, as expected. |
|
/payload-job periodic-ci-openshift-openshift-tests-private-release-4.20-multi-nightly-aws-upi-basecap-none-arm-f7 try to trigger one QE job |
|
@lihongan: trigger 1 job(s) for the /payload-(with-prs|job|aggregate|job-with-prs|aggregate-with-prs) command
See details on https://pr-payload-tests.ci.openshift.org/runs/ci/808ee6b0-306e-11f0-956d-80de5ebb75ad-0 |
|
/payload-job periodic-ci-openshift-openshift-tests-private-release-4.20-multi-nightly-aws-upi-basecap-none-arm-f7 Hit |
|
@lihongan: trigger 1 job(s) for the /payload-(with-prs|job|aggregate|job-with-prs|aggregate-with-prs) command
See details on https://pr-payload-tests.ci.openshift.org/runs/ci/0f155170-308e-11f0-8ef2-78e7e4411319-0 |
|
/lgtm Holding for @candita to take a look. |
|
/retest-required |
|
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: alebedev87 The full list of commands accepted by this bot can be found here. The pull request process is described here DetailsNeeds approval from an approver in each of these files:
Approvers can indicate their approval by writing |
|
/payload-job periodic-ci-openshift-openshift-tests-private-release-4.20-arm64-nightly-aws-ipi-disc-priv-sts-basecap-none-mini-perm-f14 |
|
@lihongan: trigger 1 job(s) for the /payload-(with-prs|job|aggregate|job-with-prs|aggregate-with-prs) command
See details on https://pr-payload-tests.ci.openshift.org/runs/ci/5a2f0260-30a6-11f0-83ee-b1fa8d4a286c-0 |
|
@Miciah: The following tests failed, say
Full PR test history. Your PR dashboard. DetailsInstructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here. |
|
/test e2e-aws-ovn-serial The job timed out, openshift/release#64896 should help with the timeout issues. |
|
/unhold Taking into account the urgency on this fix, a single LGTM can be enough. We can follow up later. |
|
e2e-aws-operator-techpreview failed because |
|
e2e-aws-ovn-single-node failed because I filed OCPBUGS-56196 for this issue. |
f0e62c3
into
openshift:master
|
@Miciah: Jira Issue OCPBUGS-55317: Some pull requests linked via external trackers have merged: The following pull requests linked via external trackers have not merged:
These pull request must merge or be unlinked from the Jira bug in order for it to move to the next state. Once unlinked, request a bug refresh with Jira Issue OCPBUGS-55317 has not been moved to the MODIFIED state. DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
|
/cherry-pick release-4.19 |
|
@Miciah: new pull request created: #1233 DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. |
|
[ART PR BUILD NOTIFIER] Distgit: ose-cluster-ingress-operator |
Only try to watch subscriptions and installplans if the "marketplace" and "OperatorLifecycleManager" capabilities are enabled.
If these capabilities are not enabled, the installplans and subscriptions resources do not exist. Before this change, the operator would fail to initialize when these capabilities were not enabled as the status and gatewayclass controllers would try and fail to watch the non-existent resources.