NE-2183: Openshift conditions on Gateway API status #1871

rikatz · 2025-10-21T18:12:17Z

This enhancement proposal adds the Ingress Controller Conditions (LoadBalancerManaged, LoadBalancerReady, DNSManaged and DNSReady) to Gateway API resources that created with Openshift Gateway Class and on openshift-ingress namespace.

This proposal was partially generated with the help of Claude/AI

openshift-ci-robot · 2025-10-21T18:12:20Z

@rikatz: This pull request references NE-2183 which is a valid jira issue.

In response to this:

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.

openshift-ci-robot · 2025-10-22T19:03:53Z

@rikatz: This pull request references NE-2183 which is a valid jira issue.

In response to this:

This enhancement proposal adds the Ingress Controller Conditions (LoadBalancerManaged, LoadBalancerReady, DNSManaged and DNSReady) to Gateway API resources that created with Openshift Gateway Class and on openshift-ingress namespace.

This proposal was partially generated with the help of Claude/AI

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.

Miciah · 2025-10-30T14:47:50Z

/assign

candita · 2025-10-30T20:35:08Z

/assign

candita · 2025-10-30T20:36:31Z

/retest

rikatz · 2025-10-31T11:16:39Z

the error is valid, it is due to the metadata not containing real approvers and reviewers for now. Once I get some review and approval I can fix it

enhancements/ingress/add-dns-and-loadbalancer-conditions-to-managed-gateway.md

alebedev87 · 2025-11-06T15:01:03Z

Forgot to add a description to the review. I was about to look at openshift/cluster-ingress-operator#1294 but then recalled that there is an EP. So I started from the EP.

openshift-ci · 2025-11-17T19:27:10Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:
Once this PR has been reviewed and has the lgtm label, please ask for approval from candita. For more information see the Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

enhancements/ingress/OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

enhancements/ingress/add-dns-and-loadbalancer-conditions-to-managed-gateway.md

candita · 2025-11-18T19:23:45Z

enhancements/ingress/add-dns-and-loadbalancer-conditions-to-managed-gateway.md

+* Adding these conditions to user-managed Gateway resources outside the 
+`openshift-ingress` namespace
+* Modifying or changing existing IngressController condition behavior or semantics
+* Introducing custom condition types beyond DNS and LoadBalancer at this time


So this means we don't mark IngressController as Degraded if there are problems with the Gateway?

that's right, this proposal is just about adding conditions to the Gateway resource, not changing anything else

enhancements/ingress/add-dns-and-loadbalancer-conditions-to-managed-gateway.md

candita · 2025-11-18T19:29:00Z

enhancements/ingress/add-dns-and-loadbalancer-conditions-to-managed-gateway.md

+IngressController by:
+1. Creating a shared `pkg/resources/status` package with condition computation
+functions
+2. Refactoring existing IngressController status code to use this shared package


Does this mean we mark the IngressController as Degraded if DNSReady and/or LoadBalancerReady are false?

No, this means that we are moving the conditions function previously used by IngressController only to a common place that can also be reused by Gateway API. It is about the condition calculation functions (given DNSRecord, LoadBalancer service, etc what should be the Gateway resource conditions) but we don't touch IngressController behavior

enhancements/ingress/add-dns-and-loadbalancer-conditions-to-managed-gateway.md

candita · 2025-11-18T19:34:12Z

enhancements/ingress/add-dns-and-loadbalancer-conditions-to-managed-gateway.md

+3. Cloud Provider API provisions the LoadBalancer successfully
+4. LoadBalancer service status is updated with external IP/hostname
+5. Cluster Ingress Operator detects the Gateway resource and begins reconciliation
+6. Cluster Ingress Operator initiates DNS record provisioning through its own dns controller


nit: There is a Gateway API dns record creater controller alongside the cluster ingress one.

right, and it is on Cluster Ingress Operator. Am I missing something more explicit here? (like the package name?)

enhancements/ingress/add-dns-and-loadbalancer-conditions-to-managed-gateway.md

openshift-ci · 2025-11-18T19:37:45Z

@rikatz: all tests passed!

Full PR test history. Your PR dashboard.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

candita · 2025-11-18T19:38:28Z

enhancements/ingress/add-dns-and-loadbalancer-conditions-to-managed-gateway.md

+zone, quota exceeded, provider API error)
+4. Cluster Ingress Operator DNS controller reports failure status in the 
+DNSRecord resource
+5. Gateway Status Controller updates Gateway condition `DNSManaged=True`


Is there any reason DNSManaged=False? Maybe reserved for the future, if another DNS management system is selected, such as ExternalDNS?

From CIO code:

// In case there is no managed DNS zone configured, return a single condition // that DNSManaged=False because no zone is configured

any other specific cases (like IngressController endpointpublishingstrategy) are not verified by Gateway API, so we intentionally skip it.

But managed can yes, be false in case the DNSConfig doesn't specify a proper public or private zone, even on Gateway API

candita · 2025-11-18T19:40:40Z

enhancements/ingress/add-dns-and-loadbalancer-conditions-to-managed-gateway.md

+5. Gateway Status Controller updates Gateway condition `DNSManaged=True`
+(DNS should be managed, configuration is correct)
+6. Gateway Status Controller updates Gateway condition `DNSReady=False` with
+reason `FailedZones` and detailed error message from DNS provider


Suggested change

reason `FailedZones` and detailed error message from DNS provider

reason and error message as detailed in the section on Implementation Details.

hum, I think it is fine to keep it here. The implementation details are more about "How" we will implement, but it does make sense IMO keeping the conditions that will be used on the failure flow.

candita · 2025-11-18T19:58:14Z

enhancements/ingress/add-dns-and-loadbalancer-conditions-to-managed-gateway.md

+* DNS conditions apply regardless of platform if DNS records are being managed
+
+**MicroShift:**
+* MicroShift typically does not use Gateway API or cloud LoadBalancer services


I remember hearing about a MicroShift integration, handled by MicroShift, so maybe remove the first line?

candita · 2025-11-18T19:59:17Z

enhancements/ingress/add-dns-and-loadbalancer-conditions-to-managed-gateway.md

+* No impact on MicroShift resource consumption or configuration
+
+**Resource Impact:**
+* Minimal CPU/memory impact: only adds condition updates during reconciliation


It doesn't watch DNS records? During reconciliation of what object?

it watches, but these watches are negligible from a resource impact perspective IMO. I am not sure it is worth mentioning it here (also this is related to SNO deployments, so it is not different from other resource impacts considered above)

candita · 2025-11-18T20:01:25Z

enhancements/ingress/add-dns-and-loadbalancer-conditions-to-managed-gateway.md

+
+**Resource Impact:**
+* Minimal CPU/memory impact: only adds condition updates during reconciliation
+* No additional controllers or processes required


There is a new gateway-status controller mentioned below.

changed to "* A new gateway-status controller is created on existing Cluster Ingress Operator"

candita · 2025-11-18T20:20:03Z

enhancements/ingress/add-dns-and-loadbalancer-conditions-to-managed-gateway.md

+namespace that have the OpenShift Gateway Class as their `.spec.gatewayClassName` controller
+* Associated DNSRecord and Service resources are discovered using the
+`gateway.networking.k8s.io/gateway-name` label
+* Only the first matching DNSRecord and Service in the same namespace are used


What's the reasoning behind only the first matching DNSRecord being used? Why not check all DNSRecords for the Gateway and report if one or more are in a failure status?

well:

For services we know that Istio will provision just one Service (of type LoadBalancer) for the Gateway

For DNS, it is following the same logics from CIO, that receives the "wildcard DNS record" only. Maybe this assumption is wrong for Gateway API, and we should compute the DNS record from all of the provisioned DNS Records (we do watch all of the DNS Records related to the Gateway).

I will fix the EP here, as we need to watch all the DNSRecords from the Gateway, good catch!

candita · 2025-11-18T20:24:34Z

enhancements/ingress/add-dns-and-loadbalancer-conditions-to-managed-gateway.md

+
+*DNSReady Condition:*
+* Set to `Unknown` when DNSManagementPolicy is `Unmanaged` (OpenShift doesn't manage DNS, so status is unknown)
+* Set to `False` with reason `RecordNotFound` when the associated DNSRecord resource cannot be found


Is this more precise?

Suggested change

* Set to `False` with reason `RecordNotFound` when the associated DNSRecord resource cannot be found

* Set to `False` with reason `RecordNotFound` when one or more of the associated DNSRecord resources cannot be found

candita · 2025-11-18T21:36:01Z

enhancements/ingress/add-dns-and-loadbalancer-conditions-to-managed-gateway.md

+
+**Condition Lifecycle:**
+* Conditions are added when a Gateway is reconciled in the `openshift-ingress` namespace
+* Conditions are updated in-place using `condutils.SetStatusCondition()` to preserve transition times


Why preserve transition times? Transition time should be updated if the condition changes, at least if it changes from true to false or vice versa.

candita · 2025-11-18T21:37:57Z

enhancements/ingress/add-dns-and-loadbalancer-conditions-to-managed-gateway.md

+* Maximum of 8 total conditions are maintained per Gateway to prevent unbounded growth
+
+**Permissions:**
+* The cluster-ingress-operator service account is granted RBAC permissions to:


I would like to see it say:

Suggested change

* The cluster-ingress-operator service account is granted RBAC permissions to:

* The cluster-ingress-operator service account uses existing RBAC permissions to:

enhancements/ingress/add-dns-and-loadbalancer-conditions-to-managed-gateway.md

candita · 2025-11-18T21:42:03Z

enhancements/ingress/add-dns-and-loadbalancer-conditions-to-managed-gateway.md

+* No additional controllers or processes required
+* Negligible increase in etcd storage for condition status (~1KB per Gateway)
+
+### Implementation Details/Notes/Constraints


One more implementation detail - how about making sure the status gets added to the must-gather troubleshooting document?

candita · 2025-11-18T21:44:31Z

enhancements/ingress/add-dns-and-loadbalancer-conditions-to-managed-gateway.md

+**Not applicable to all environments:**
+* The LoadBalancer condition is only meaningful on cloud platforms or platforms with
+`LoadBalancer` support.
+* Users on bare metal may see persistent `False` or `Unknown` status which could


We should make sure that either the messages are clearly indicating that status isn't supported (e.g. "Bare metal clusters don't measure Gateway status"), or that they are at worst Unknown, not False.

so, what we discussed so far is that as Load balancers are not supported on Bare metal, but we also "do not provide" the Load Balancer managed condition on Gateway API, we will simply feed the "LoadBalancerReady" condition. This condition will reflect the current behavior of CCM / Baremetal provisioner, so let's say you are on baremetal:

If you have metallb, it will work fine

If you don't have a Load balancer controller, the status of the LoadBalancerReady condition will be false and the reason will be the LoadBalancer is pending, which means you don't have a LoadBalancer controller on your environment, and reflects the same behavior of CIO.

IMO as we don't have a clear definition yet on bare metal loadbalancer, I think this is the most meaningful information we can provide to users without being misleading, wdyt?

(also it should be considered that this is the "Drawbacks" section, meaning this may be a known drawback)

enhancements/ingress/add-dns-and-loadbalancer-conditions-to-managed-gateway.md

candita · 2025-11-18T21:49:26Z

enhancements/ingress/add-dns-and-loadbalancer-conditions-to-managed-gateway.md

+* Test `ComputeGatewayAPIDNSStatus` wrapper correctly converts internal conditions to Gateway API conditions
+* Test `ComputeGatewayAPILoadBalancerStatus` wrapper correctly converts internal conditions to Gateway API conditions
+* Test condition computation with DNSManagementPolicy set to Managed vs Unmanaged
+* Test ObservedGeneration is correctly set on conditions


Maybe also: Test transition times?

enhancements/ingress/add-dns-and-loadbalancer-conditions-to-managed-gateway.md

candita · 2025-11-18T21:53:15Z

enhancements/ingress/add-dns-and-loadbalancer-conditions-to-managed-gateway.md

+ * On the same test, verify the condition count is consistent with Istio and Openshift
+added conditions
+* Create Gateway out of `openshift-ingress` and verify that no Openshift condition is added
+* Create Gateway with wrong DNS Domain and verify that Openshift conditions reflect the failue


How about:
Create Gateway with multiple listeners, only one of which can get a successful DNS record. I expect the dns ready status to be False.

candita · 2025-11-18T21:54:13Z

enhancements/ingress/add-dns-and-loadbalancer-conditions-to-managed-gateway.md

+* No CSI, CRI, or CNI changes are involved
+
+**Compatibility:**
+* Feature works with Gateway API v1 (both support custom conditions)


nit: What does the "both" refer to here?

halucination, apparently. Let me remove the word

candita · 2025-11-18T21:56:05Z

enhancements/ingress/add-dns-and-loadbalancer-conditions-to-managed-gateway.md

+* Negligible impact on API throughput: condition updates happen during normal reconciliation
+* No new API calls introduced; only status updates to existing Gateway resources
+* Expected number of managed Gateways in `openshift-ingress` namespace: typically 1-10 per cluster
+* Condition updates are rate-limited to prevent excessive writes


The rate-limiting is automatic, no coding needed, right?

yeah, it is part of the maximum reconciliation that we define, and client-go throttling (or it should).

I am not sure where claude got this, so I would be happy to also remove this line if we feel it may be misleading.

candita · 2025-11-18T21:58:08Z

enhancements/ingress/add-dns-and-loadbalancer-conditions-to-managed-gateway.md

+
+**Detecting Issues:**
+
+*Symptom: Gateway conditions show `DNSManaged=False`*


This is not necessarily an error condition. Some users may choose to not have DNS be managed.

candita · 2025-11-18T22:04:45Z

@rikatz Overall I think this looks great. Do you think we should use the generator for other EP/KEP/GEPs?

I have a few nits and questions, and one major question: can you discuss the decision not to propagate condition status up to the ingress controller status? I can see pros and cons for both, but we should document that decision. Thanks.

openshift-ci-robot added the jira/valid-reference Indicates that this PR references a valid Jira ticket of any type. label Oct 21, 2025

openshift-ci bot added the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Oct 21, 2025

openshift-ci bot requested review from knobunc and rfredette October 21, 2025 18:12

rikatz mentioned this pull request Oct 21, 2025

[NE-2195] Add initial add-enhancement command #1870

Merged

rikatz force-pushed the ne-2183-ep branch from 520ec81 to 31a44a8 Compare October 22, 2025 19:02

rikatz changed the title ~~WIP: NE-2183: Initial write on Gateway API conditions~~ NE-2183: Initial write on Gateway API conditions Oct 22, 2025

openshift-ci bot removed the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Oct 22, 2025

rikatz changed the title ~~NE-2183: Initial write on Gateway API conditions~~ NE-2183: Openshift conditions on Gateway API status Oct 22, 2025

openshift-ci bot assigned Miciah Oct 30, 2025

openshift-ci bot assigned candita Oct 30, 2025

alebedev87 reviewed Nov 6, 2025

View reviewed changes

rikatz added 3 commits November 17, 2025 16:25

NE-2183: Initial write on Gateway API conditions

bdf1eac

Add enhancement from code

089dcec

NE-2183: Improve wording on EP

a97bf83

rikatz force-pushed the ne-2183-ep branch from 31a44a8 to a97bf83 Compare November 17, 2025 19:26

candita reviewed Nov 18, 2025

View reviewed changes

enhancements/ingress/add-dns-and-loadbalancer-conditions-to-managed-gateway.md Show resolved Hide resolved

NE-2183: Remove unnecessary LB conditions

ee01e3e

rikatz force-pushed the ne-2183-ep branch from 4ee7bcf to ee01e3e Compare November 18, 2025 19:15

candita reviewed Nov 18, 2025

View reviewed changes

enhancements/ingress/add-dns-and-loadbalancer-conditions-to-managed-gateway.md Show resolved Hide resolved

candita reviewed Nov 18, 2025

View reviewed changes

enhancements/ingress/add-dns-and-loadbalancer-conditions-to-managed-gateway.md Show resolved Hide resolved

candita reviewed Nov 18, 2025

View reviewed changes

enhancements/ingress/add-dns-and-loadbalancer-conditions-to-managed-gateway.md Show resolved Hide resolved

candita reviewed Nov 18, 2025

View reviewed changes

enhancements/ingress/add-dns-and-loadbalancer-conditions-to-managed-gateway.md Show resolved Hide resolved

candita reviewed Nov 18, 2025

View reviewed changes

enhancements/ingress/add-dns-and-loadbalancer-conditions-to-managed-gateway.md Show resolved Hide resolved

candita reviewed Nov 18, 2025

View reviewed changes

enhancements/ingress/add-dns-and-loadbalancer-conditions-to-managed-gateway.md Show resolved Hide resolved

candita reviewed Nov 18, 2025

View reviewed changes

enhancements/ingress/add-dns-and-loadbalancer-conditions-to-managed-gateway.md Show resolved Hide resolved

candita reviewed Nov 18, 2025

View reviewed changes

	reason `FailedZones` and detailed error message from DNS provider
	reason and error message as detailed in the section on Implementation Details.

	* Set to `False` with reason `RecordNotFound` when the associated DNSRecord resource cannot be found
	* Set to `False` with reason `RecordNotFound` when one or more of the associated DNSRecord resources cannot be found

	* The cluster-ingress-operator service account is granted RBAC permissions to:
	* The cluster-ingress-operator service account uses existing RBAC permissions to:


		Detecting Issues:

		Symptom: Gateway conditions show `DNSManaged=False`

NE-2183: Openshift conditions on Gateway API status #1871

Are you sure you want to change the base?

NE-2183: Openshift conditions on Gateway API status #1871

Uh oh!

Conversation

rikatz commented Oct 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

openshift-ci-robot commented Oct 21, 2025 • edited by openshift-ci bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

openshift-ci-robot commented Oct 22, 2025 • edited by openshift-ci bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Miciah commented Oct 30, 2025

Uh oh!

candita commented Oct 30, 2025

Uh oh!

candita commented Oct 30, 2025

Uh oh!

rikatz commented Oct 31, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

alebedev87 commented Nov 6, 2025

Uh oh!

openshift-ci bot commented Nov 17, 2025

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

candita Nov 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

openshift-ci bot commented Nov 18, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rikatz Nov 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

candita Nov 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rikatz commented Oct 21, 2025 •

edited

Loading

openshift-ci-robot commented Oct 21, 2025 •

edited by openshift-ci bot

Loading

openshift-ci-robot commented Oct 22, 2025 •

edited by openshift-ci bot

Loading

candita Nov 18, 2025 •

edited

Loading

rikatz Nov 19, 2025 •

edited

Loading

candita Nov 18, 2025 •

edited

Loading

candita Nov 18, 2025 •

edited

Loading

candita Nov 18, 2025 •

edited

Loading