Add status field to Kubernetes clusters to support discovery workflows by tigrato · Pull Request #62161 · gravitational/teleport

tigrato · 2025-12-11T12:51:54Z

Adds a new Status field to KubernetesClusterV3 containing discovery-related information. The status includes cloud provider-specific data, starting with AWS which tracks the ARN for access setup, integration name, and the assumed role used during discovery.

This enables the discovery service to persist state about clusters it discovers, which is needed to properly cleanup created access entries when a cluster is removed or no longer matches the label filtering. Without this information, dangling AWS resources would be left behind after discovery changes.

Changelog: Added cleanup of access entries for EKS auto-discovered clusters when they no longer match the filtering criteria and are removed.

Adds a new Status field to KubernetesClusterV3 containing discovery-related information. The status includes cloud provider-specific data, starting with AWS which tracks the ARN for access setup, integration name, and the assumed role used during discovery. This enables the discovery service to persist state about clusters it discovers, which is needed to properly cleanup created access entries when a cluster is removed or no longer matches the label filtering. Without this information, dangling AWS resources would be left behind after discovery changes. Signed-off-by: Tiago Silva <tiago.silva@goteleport.com>

api/proto/teleport/legacy/types/types.proto

espadolini · 2025-12-11T13:53:10Z

api/proto/teleport/legacy/types/types.proto

+  // Status is the resource status.
+  KubernetesClusterStatus Status = 6 [
+    (gogoproto.nullable) = false,
+    (gogoproto.jsontag) = "status"
+  ];


Please don't add gogoproto options if they're unnecessary.

Also, style: protobuf field names and protobuf docstrings should follow protobuf styling.

Suggested change

// Status is the resource status.

KubernetesClusterStatus Status = 6 [

(gogoproto.nullable) = false,

(gogoproto.jsontag) = "status"

];

// the resource status, intended to be ignored by IaC tools

KubernetesClusterStatus status = 6;

<the reason why I added json tags is mostly because the resource is always marshaled using json.Marshal which might create inconsistent namings

The default encoding/json field name is the protobuf field name, so unless we're trying to match a specific go struct field name for the sake of existing code (which is not the case here) we can just pick the field name for the json and live with whatever field name ends up in the go codegen.

api/types/kubernetes.go

api/proto/teleport/legacy/types/types.proto

espadolini

Is this something that fits the "status" field or should it just be in "spec"?

No matter where we put the additional field, what can break if the field is discarded (because the auth might not support it) in the cleanup logic? Will it just keep the existing behavior?

Is it possible for this field to change between writes and thus trigger more writes and reconciliations? Should we change (*KubernetesClusterV3).IsEqual?

lib/srv/discovery/fetchers/eks.go

tigrato · 2025-12-18T14:27:32Z

Is this something that fits the status field, or should it live in spec?

This belongs in the status field. These values should not be user editable and should not be treated as part of the desired state. They are owned and managed by the discovery process itself. While we allow users to manually create dynamic Kubernetes clusters, we don’t want them to modify these fields, as they are not assumed or consumed unless discovery is attempting to delete the resource.

Regardless of where we place the additional field, what could break if it’s discarded (for example, if auth doesn’t support it) during cleanup? Would the behavior remain the same?

If the field is discarded, the current behavior remains unchanged. The access entry will simply be left in place for future generations.

Could this field change between writes and cause additional writes or reconciliations? Do we need to update (*KubernetesClusterV3).IsEqual?

This field cannot change as part of normal writes. The only way it can change is via updates to the discovery configuration or teleport.yaml. Both cases already trigger reconciliation, so this does not introduce additional churn.

It also does not affect Kubernetes heartbeats, since this field along with other sensitive fields are discarded during heartbeating.

PS: I forgot that our goderive stuff ignores status fields. Updated

espadolini · 2025-12-18T15:27:40Z

lib/srv/discovery/kube_watcher.go

+				if kc1.GetStatus().IsEqual(kc2.GetStatus()) {
+					return services.Equal
+				}
+				return services.Different


nit for consistency

Suggested change

if kc1.GetStatus().IsEqual(kc2.GetStatus()) {

return services.Equal

}

return services.Different

if !kc1.GetStatus().IsEqual(kc2.GetStatus()) {

return services.Different

}

return services.Equal

smallinsky · 2025-12-18T15:36:42Z

lib/srv/discovery/kube_watcher.go

+					return res
+				}
+				// Additionally compare Status field using its IsEqual method.
+				// This is needed because CompareResources ignores Status field of KubeCluster and for most


.. CompareResources ignores Status field of KubeCluster

Are you sure that this is true ?

based on the code status is ignored only for DatabaseV3 and UserSpecV2 and AccessList types:

func CompareResources[T any](resA, resB T) int { var equal bool if hasEqual, ok := any(resA).(compare.IsEqual[T]); ok { equal = hasEqual.IsEqual(resB) } else { equal = cmp.Equal(resA, resB, ignoreProtoXXXFields(), cmpopts.IgnoreFields(types.Metadata{}, "Revision"), cmpopts.IgnoreFields(types.DatabaseV3{}, "Status"), cmpopts.IgnoreFields(types.UserSpecV2{}, "Status"), cmpopts.IgnoreFields(accesslist.AccessList{}, "Status"), cmpopts.IgnoreFields(header.Metadata{}, "Revision"), cmpopts.IgnoreUnexported(headerv1.Metadata{}), // Managed by IneligibleStatusReconciler, ignored by all others. cmpopts.IgnoreFields(accesslist.AccessListMemberSpec{}, "IneligibleStatus"), cmpopts.IgnoreFields(accesslist.Owner{}, "IneligibleStatus"), cmpopts.EquateEmpty(), ) } if equal { return Equal } return Different }

reference:
https://github.com/gravitational/teleport/blob/master/lib/services/compare.go#L35

So the whole kc1.GetStatus().IsEqual(kc2.GetStatus()) call seems to be redundant here.

The KubeCluster type implements an IsEqual method, which causes the comparison logic to use hasEqual.IsEqual(resB) instead of falling back to cmp.Equal.

This IsEqual method is generated by Teleport's Go derive plugin, which is responsible for generating comparison code. The plugin intentionally skips status fields during comparison, as shown in this code

Ah, thanks. The CompareResources ignores Status statment suggested that the ignore logic is implemented in CompareResources like in case of filed like: IgnoreFields(types.DatabaseV3{}, "Status"),

backport-bot-workflows · 2026-01-02T19:52:14Z

@tigrato See the table below for backport results.

Branch	Result
branch/v17	Failed
branch/v18	Failed

#62161) * Add status field to Kubernetes clusters to support discovery workflows Adds a new Status field to KubernetesClusterV3 containing discovery-related information. The status includes cloud provider-specific data, starting with AWS which tracks the ARN for access setup, integration name, and the assumed role used during discovery. This enables the discovery service to persist state about clusters it discovers, which is needed to properly cleanup created access entries when a cluster is removed or no longer matches the label filtering. Without this information, dangling AWS resources would be left behind after discovery changes. Signed-off-by: Tiago Silva <tiago.silva@goteleport.com> * handle code review comments * handle code review comments * add comment --------- Signed-off-by: Tiago Silva <tiago.silva@goteleport.com>

#62161) (#62599) * Add status field to Kubernetes clusters to support discovery workflows Adds a new Status field to KubernetesClusterV3 containing discovery-related information. The status includes cloud provider-specific data, starting with AWS which tracks the ARN for access setup, integration name, and the assumed role used during discovery. This enables the discovery service to persist state about clusters it discovers, which is needed to properly cleanup created access entries when a cluster is removed or no longer matches the label filtering. Without this information, dangling AWS resources would be left behind after discovery changes. * handle code review comments * handle code review comments * add comment --------- Signed-off-by: Tiago Silva <tiago.silva@goteleport.com>

#62161) (#62598) * Add status field to Kubernetes clusters to support discovery workflows Adds a new Status field to KubernetesClusterV3 containing discovery-related information. The status includes cloud provider-specific data, starting with AWS which tracks the ARN for access setup, integration name, and the assumed role used during discovery. This enables the discovery service to persist state about clusters it discovers, which is needed to properly cleanup created access entries when a cluster is removed or no longer matches the label filtering. Without this information, dangling AWS resources would be left behind after discovery changes. * handle code review comments * handle code review comments * add comment --------- Signed-off-by: Tiago Silva <tiago.silva@goteleport.com>

tigrato added backport/branch/v17 backport/branch/v18 labels Dec 11, 2025

github-actions bot requested review from boxofrad and ryanclark December 11, 2025 12:52

github-actions bot added discovery size/md labels Dec 11, 2025

espadolini reviewed Dec 11, 2025

View reviewed changes

handle code review comments

eb13e2c

boxofrad approved these changes Dec 18, 2025

View reviewed changes

espadolini reviewed Dec 18, 2025

View reviewed changes

lib/srv/discovery/fetchers/eks.go Show resolved Hide resolved

tigrato added 3 commits December 18, 2025 14:49

handle code review comments

3d2e849

Merge branch 'master' into tigrato/eksautocleanup

38cb6d5

add comment

410e1fa

espadolini approved these changes Dec 18, 2025

View reviewed changes

public-teleport-github-review-bot bot removed the request for review from ryanclark December 18, 2025 15:32

smallinsky reviewed Dec 18, 2025

View reviewed changes

smallinsky approved these changes Dec 18, 2025

View reviewed changes

Merge branch 'master' into tigrato/eksautocleanup

c510933

tigrato enabled auto-merge January 2, 2026 16:57

tigrato added this pull request to the merge queue Jan 2, 2026

Merged via the queue into master with commit 533286b Jan 2, 2026
47 of 49 checks passed

tigrato deleted the tigrato/eksautocleanup branch January 2, 2026 19:50

This was referenced Jan 5, 2026

[v18] Add status field to Kubernetes clusters to support discovery workflows #62598

Merged

[v17] Add status field to Kubernetes clusters to support discovery workflows #62599

Merged

Conversation

tigrato commented Dec 11, 2025

Uh oh!

Uh oh!

Uh oh!

espadolini Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

tigrato Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

espadolini Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

espadolini left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

tigrato commented Dec 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

espadolini Dec 18, 2025

Choose a reason for hiding this comment

Uh oh!

smallinsky Dec 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tigrato Dec 18, 2025

Choose a reason for hiding this comment

Uh oh!

smallinsky Dec 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

backport-bot-workflows bot commented Jan 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

tigrato commented Dec 18, 2025 •

edited

Loading

smallinsky Dec 18, 2025 •

edited

Loading

smallinsky Dec 18, 2025 •

edited

Loading