feat: use informer cache for Get method in Kubernetes backend #525

juanxiu · 2025-08-20T03:37:52Z

What does this PR do / why we need it:
This PR enhances the KubernetesBackend by modifying the Get method to use the informer cache instead of querying the Kubernetes API server directly when retrieving Application resources. It leverages the generic Informer's Lister() method to efficiently access cached resources, thereby reducing load on the API server and improving performance. Corresponding unit tests have been added to verify correct informer cache usage. This change creates a foundation for more efficient resource management via the informer caching mechanism.

Which issue(s) this PR fixes:

Fixes #251

How to test changes / Special notes to the reviewer:

Unit tests utilize a fake clientset combined with informer creation to validate that the Get method correctly retrieves Application resources from the informer cache instead of making direct API calls.
Manual or integration tests may confirm informer startup, cache synchronization, and API call reduction.
Reviewer should verify that the Get method no longer calls the API server directly and uses cached data.

Checklist

Documentation update is required by this PR (and has been updated) OR no documentation update is required.

Signed-off-by: yeonsoo <[email protected]>

jannfis

Thanks @juanxiu for this PR.

I have a comment requiring some more discussion, PTAL.

jannfis · 2025-08-20T12:42:24Z

internal/backend/kubernetes/application/kubernetes.go

+	if !ok {
+		return nil, fmt.Errorf("object is not an Application: %T", obj)
+	}
+	return app, nil


Items returned from the cache need to be treated read-only. I believe that's why unrelated unit tests are failing and panicing.

There are two options imho:

We clearly document that objects returned by this function are to be treated read-only, because they are retrieved directly from the cache and caller needs to make a copy if they want to modify it, or

We return a copy of the object retrieved from the cache, to take the burden from the caller.

The first option puts more responsibility to the caller, but is resource efficient. The second option would ensure that the caller can treat the objects lightly, but for the cost of increased memory consumption.

I have not yet made up my mind with regards to which solution I'd prefer. Which one do you think makes more sense?

In option 1, when objects are retrieved through multiple informers, there is a risk that callers may forget to use DeepCopy(). Such oversights can make it difficult to trace and resolve bugs. On the other hand, callers can use resources efficiently and have the flexibility to decide when to create copies.

Option 2 always returns a copy of the object from the function, so callers cannot control when the copy is made. However, callers can still customize behavior by registering event handlers with the informer. Returning a copy from the function enforces consistent object usage and prevents inconsistent handling of copying across different callers.

Personally, I chose option 2 because I believe minimizing the potential for bugs and ensuring consistent and stable behavior throughout the codebase is important.

For these reasons, I have modified the code to return app.DeepCopy(), nil.
Additionally, I plan to update the List methods in the Kubernetes backend for Application and appproject type resources to use the informer cache as well. I would appreciate it if we could merge this after those changes are completed.

It seems there are other problems with this approach, at least regarding the tests. Some of them are still failing.

There is no issue with loading the informer cache itself. However, in the current unit tests, a different problem arises. To load the cache, informer.Run must be executed, which requires calling the Start method of either the manager or the server. Until now, the code has been written without assuming that Start would be called in the test code. As a result, timing issues related to goroutine execution occur during test runs. How can we resolve this situation?

Signed-off-by: yeonsoo <[email protected]>

juanxiu · 2025-08-23T07:57:24Z

principal/event_test.go

 		wq.On("Get").Return(&ev, false)
 		wq.On("Done", &ev)
 		s, err := NewServer(context.Background(), fac, "argocd", WithGeneratedTokenSigningKey(), WithAutoNamespaceCreate(true, "", nil))
+		s.Start(context.Background(), make(chan error))


During testing in this way, a Start call is required to load the informer.

@jannfis Can you take a look at this issue and let me know if you have any ideas to resolve it?

juanxiu added 2 commits August 20, 2025 12:29

feat: add Lister() to expose informer cache bia GenericLister

f462434

Signed-off-by: yeonsoo <[email protected]>

feat: use informr cache in KubernetesBackend Get method

d809156

Signed-off-by: yeonsoo <[email protected]>

juanxiu requested review from jannfis, jgwest and chetan-rns as code owners August 20, 2025 03:37

test: add unit tests for Get method using informer cache

29659aa

Signed-off-by: yeonsoo <[email protected]>

jannfis reviewed Aug 20, 2025

View reviewed changes

juanxiu added 2 commits August 21, 2025 15:02

fix: Return deep copy of Application object in Get()

894e20a

Signed-off-by: yeonsoo <[email protected]>

test: add informer start

5cbd95c

Signed-off-by: yeonsoo <[email protected]>

juanxiu commented Aug 23, 2025

View reviewed changes

Merge branch 'argoproj-labs:main' into main

829de82

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: use informer cache for Get method in Kubernetes backend #525

feat: use informer cache for Get method in Kubernetes backend #525

Uh oh!

juanxiu commented Aug 20, 2025 •

edited

Loading

Uh oh!

jannfis left a comment

Uh oh!

jannfis Aug 20, 2025

Uh oh!

juanxiu Aug 21, 2025

Uh oh!

jannfis Aug 22, 2025

Uh oh!

juanxiu Aug 23, 2025

Uh oh!

juanxiu Aug 23, 2025

Uh oh!

juanxiu Sep 7, 2025

Uh oh!

Uh oh!

feat: use informer cache for Get method in Kubernetes backend #525

Are you sure you want to change the base?

feat: use informer cache for Get method in Kubernetes backend #525

Uh oh!

Conversation

juanxiu commented Aug 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jannfis left a comment

Choose a reason for hiding this comment

Uh oh!

jannfis Aug 20, 2025

Choose a reason for hiding this comment

Uh oh!

juanxiu Aug 21, 2025

Choose a reason for hiding this comment

Uh oh!

jannfis Aug 22, 2025

Choose a reason for hiding this comment

Uh oh!

juanxiu Aug 23, 2025

Choose a reason for hiding this comment

Uh oh!

juanxiu Aug 23, 2025

Choose a reason for hiding this comment

Uh oh!

juanxiu Sep 7, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

juanxiu commented Aug 20, 2025 •

edited

Loading