KEDA should check whether scale target is already managed by another ScaledObject #3755

zroubalik · 2022-10-17T13:22:44Z

Proposal

When a new ScaledObject is reconciled by Operator, it should be checked whether the scale target referenced by the new ScaledObject is being already managed by another ScaledObject, if so an error should be raised.

Use-Case

No response

Anything else?

No response

JorTurFer · 2022-10-17T13:42:57Z

Nice idea!
What are you thinking about? Maybe an admission controller?

zroubalik · 2022-10-17T14:01:20Z

As a start, an error while creating the new ScaledObject is enough imho.

JorTurFer · 2022-10-20T19:31:35Z

BTW, this is duplicated I think #3087

yuvalweber · 2022-11-28T14:52:03Z

I think you can implement this using GateKeeper policy if you would like to

JorTurFer · 2022-11-28T14:55:07Z

We prefer to manage it internally better than using a 3rd party software. Something like checking it during the HPA creation or an admission hook.
Are you willing to contribute? 😄

yuvalweber · 2022-11-28T14:59:16Z

I actually want to contribute but I think I prefer to contribute in areas that I am more familiar with 😄

JorTurFer · 2022-11-28T16:02:48Z

makes sense totally, don't worry :)

JorTurFer · 2022-12-10T15:35:25Z

I have been thinking about this, and I guess that an admission controller is the best option, because the person who deploys the SO could not be the person with access to KEDA logs. In companies with SRE teams managing the clusters, developers maybe don't have access to KEDA logs because it's in other namespace and they don't have access to check the logs. With an admission hook, we can give the feedback to those users during the deployment process.
IMO this component should be optional, WDYT? Maybe we can just add another component inside keda repo, or maybe in another split repo we can extend in the future with new features for this hook (like checking not only other SOs but also HPAs).

If we agree with the admission hook, I'm open to do a PoC to check it

zroubalik · 2022-12-11T08:16:38Z

+1

Admissions controller inside this repo is the best approach imho. We can then extend this with another checks (find out whether there is HPA targeting the same resource) and we can also move some validation from the controller to admission controller.

tomkerkhove · 2022-12-23T08:29:51Z

Given we already have a discussion on #3087 and this is a duplicate, can we merge the conversation please?

tomkerkhove · 2022-12-23T08:32:18Z

As per the issues on https://github.com/kedacore/keda/issues?q=is%3Aopen+label%3Aprevention+sort%3Aupdated-desc, I believe that blocking creation is the only user-friendly way as error logs will not be consumed indeed.

The resource should be prevented from being created.

@JorTurFer I see you started working on this; would you mind making sure that the approach is extensible so that the other scenarios can be easily added in the future please?

As per #3087 we discussed to make it configurable and make it an opt-in. However, I think we can make it an opt-out instead as this would reduce the support cases given the nature of using multiple SOs and how of an anti-pattern that is.

JorTurFer · 2022-12-23T09:23:14Z

The resource should be prevented from being created.

Exactly, the webhooks' server logs the error, but the k8s api blocks the resource creation with a clear message like:
the workload 'XXX' of type 'YYY/ZZZ' is already managed by the ScaledObject 'foo' or the workload 'XXX' of type 'YYY/ZZZ' is already managed by the hpa 'foo'

@JorTurFer I see you started working on this; would you mind making sure that the approach is extensible so that the other scenarios can be easily added in the future please?

I thought in this and that's why I split the admission hook in other (the 3rd) deployment instead of including it in the operator. We will have the webhooks' server and we can include all new validating (or mutating) webhooks in the future, just adding the new code and registering them in the cluster, but without any "major change" as now.

As per #3087 we discussed to make it configurable and make it an opt-in. However, I think we can make it an opt-out instead as this would reduce the support cases given the nature of using multiple SOs and how of an anti-pattern that is.

Using kubectl, I have to install the webhook because we use those manifest for e2e tests (and obviously, I'm covering it with e2e test), but in this case, I don't block the resource if the webhook isn't alive, only if it is ready and blocks the validation (which I think is correct, because there is an error indeed).
My idea for helm chart is to make it configurable, deployed as default in the same way as the manifest, but supporting the restrictive mode (the webhook MUST validate the resource and k8s blocks it if the webhook isn't available) and also not installing the webhook (for multi-tenant scenarios where you can only have just 1 webhook for all the tenants due to k8s webhook api is a single endpoint registration at cluster scope)

tomkerkhove · 2023-01-03T08:05:37Z

Sounds good to me, thanks. For Helm, I would make it on by default though since it has it's own cycle and they can just turn it off when need be.

yuvalweber · 2023-01-03T08:12:30Z

I have question regarding the implementation.
I looked at this feature on gatekeeper and I see that in gatekeeper you need to cache all of the hpas and scaledobject in the cluster and then you can filter only by namespace to check if there is HPA or scaledobject that points to your scaleTarget (cause both are namespaced object).

In real production environment the amount of hpas and scaledobjects can get up to thousands of objects.
How do you solve the slowliness in your implementation??

tomkerkhove · 2023-01-03T08:25:06Z

Would you mind moving this to the PR to keep the conversation close to that please?

tomkerkhove · 2023-01-13T12:24:41Z

Just to be sure; are we effectively checking this today or was it closed by accident? (I think it's fine but just checking)

JorTurFer · 2023-01-13T12:37:00Z

It was closed automatically because the feature is fully implemented.
Feature + Docs + Helm chart changes are merged (helm changes merge closed this)

hanna-liashchuk · 2023-07-19T13:52:56Z

hi,
Is there a reason why it's not allowed to have 2 or more ScaledObjects that control one Deployment/etc?
We are using keda to scale our database servers and since one Scaled object can go either UP or DOWN, we have two, because we scale up on one condition and down on another. this is not optimal in terms of resource utilization ofc, but we cannot afford to kill user connects.
and after 2.10 it doesn't work anymore

JorTurFer · 2023-07-19T14:27:36Z

Is there a reason why it's not allowed to have 2 or more ScaledObjects that control one Deployment/etc?

Yes, there is 😄
Under the hood, KEDA generates an HPA for each ScaledObject, and the HPA controller doesn't take care about if a deployment is scaled by different HPA. It treats each HPA as independent on each cycle and 2 HPAs could produce (and produce) cases where the HPA 1 requires X instances and the HPA 2 requires Y instances, so every 15 seconds you'd see your workload doing X->Y->wait 15 sec->X->Y->.....

In fact, Kubernetes (1.26) has added a check in HPA Controller that disables the autoscaling if the HPA detects multiple deployments (based on labels). I mean, even if we'd remove the validation, it'll fail in k8s >= 1.26 because the HPA Controller disables the HPA

Could you share your user case more detailed? Maybe there is other option to address it

hanna-liashchuk · 2023-07-19T15:02:04Z

sure, gladly
so we have database servers that (let's say) can handle 10 users each. we have a ScaledObject that is triggered by Prometheus metric and if the average number of connections across servers is higher than the threshold - keda scales the servers up. if there is no connection left - another scaledObject scales them down.

JorTurFer · 2023-07-19T16:36:48Z

I'm quite sure that both ScaledObjects requires the same replicas or you should see the workload flapping between the required replicas for ScaledObject A and the required replicas for ScaledObject B.
Could you share them? Maybe I'm missing something 🤔

hanna-liashchuk · 2023-07-19T17:09:33Z

Yeah, they are having the same number of target replicas. I forgot to mention one thing - The downscaling object has upscale disabled and vice versa.

…

On Wed, Jul 19, 2023, 7:36 PM Jorge Turrado Ferrero < ***@***.***> wrote: I'm quite sure that both ScaledObjects requires the same replicas or you should see the workload flapping between the required replicas for ScaledObject A and the required replicas for ScaledObject B. Could you share them? Maybe I'm missing something 🤔 — Reply to this email directly, view it on GitHub <#3755 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ALNTT466VYL5F3HJJIF77H3XRAEKXANCNFSM6AAAAAARHCFPVM> . You are receiving this because you commented.Message ID: ***@***.***>

JorTurFer · 2023-07-19T17:43:18Z

You're right, I was missing something xD
But in any case, it won't work on kubernetes >= 1.26

In fact, Kubernetes (1.26) has added a check in HPA Controller that disables the autoscaling if the HPA detects multiple deployments (based on labels).

I don't think that we should remove the validation if the upstream doesn't roll back that feature.
For the moment, you can disable the webhooks because they are optional, there is a helm value to disable them :)

yuvalweber · 2023-07-19T17:47:18Z

But if you specify a threshold and minReplicas you should be good.

If for example your threshold is 10 and you minReplicas is 4 then if the threshold is not met then it will scaleDown to minimum 4 replicas but if the threshold is met than it will upload more pods.

You can use same scaledObject for both upward and downward

hanna-liashchuk · 2023-07-19T18:13:21Z

The problem is that there is a chance that there is a connection on the 5th server and scaling down to 4 will disconnect the user. Sure, you shouldn't remove the check, especially cause k8s no longer supports it. I'll work around the Prometheus query, maybe something can be done there. Thanks for your prompt reply :)

…

On Wed, Jul 19, 2023, 8:47 PM yuval weber ***@***.***> wrote: But if you specify a threshold and minReplicas you should be good. If for example your threshold is 10 and you minReplicas is 4 then if the threshold is not met then it will scaleDown to minimum 4 replicas but if the threshold is met than it will upload more pods. You can use same scaledObject for both upward and downward — Reply to this email directly, view it on GitHub <#3755 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ALNTT47VNHTMKUENRBPFLSDXRAMTBANCNFSM6AAAAAARHCFPVM> . You are receiving this because you commented.Message ID: ***@***.***>

yuvalweber · 2023-07-19T18:49:31Z

If the connections of the users are not super long then you can add a preStop hook to the pod which will maybe wait for termination or will terminate the connection gracefully.

If you want you can provide a bit more details regarding your use-case and we will try to help

tomkerkhove · 2023-07-20T10:04:09Z

Even if HPA would support it, I don't think having multiple autoscaling definitions tell 1 app to scale is a good idea.

yuvalweber · 2023-07-20T10:08:43Z

I know but there are some things I would do differently.
For example it would be nice if you could configure different scaling policies for different metrics.

Cause maybe to cpu I want to bring 2 more pods at a time but for memory I want to bring 4 pods at a time

JorTurFer · 2023-07-20T10:21:10Z

I know but there are some things I would do differently. For example it would be nice if you could configure different scaling policies for different metrics.

Cause maybe to cpu I want to bring 2 more pods at a time but for memory I want to bring 4 pods at a time

I think that here is where KEDA could try to extend the HPA for bringing that capabilities somehow and where KEDA can add value. I mean, KEDA is who is exposing the metrics to the HPA Controller and who manages the HPA, nothing blocks that KEDA modifies the HPA based on some conditions and modify the exposed values.

I mean, IMO that's a good idea and it's something that we can try to address in KEDA side, more than going against the upstream with multiple HPAs. In fact, didn't we created an issue for adding dynamic scaling rules @tomkerkhove ?

tomkerkhove · 2023-07-20T12:49:17Z

Yes - #2614

zroubalik added needs-discussion feature-request All issues for new features that have not been committed to labels Oct 17, 2022

zroubalik added this to Roadmap - KEDA Core Oct 17, 2022

zroubalik moved this to Proposed in Roadmap - KEDA Core Oct 17, 2022

JorTurFer added the stale-bot-ignore All issues that should not be automatically closed by our stale bot label Oct 17, 2022

JorTurFer added Hacktoberfest help wanted Looking for support from community labels Oct 17, 2022

tomkerkhove removed the Hacktoberfest label Nov 14, 2022

zroubalik changed the title ~~KEDA should whether scale target is already managed by another ScaledObject~~ KEDA should check whether scale target is already managed by another ScaledObject Dec 11, 2022

JorTurFer self-assigned this Dec 11, 2022

JorTurFer mentioned this issue Dec 12, 2022

feat: New validation hook to check if scale target is already managed #4001

Merged

6 tasks

JorTurFer mentioned this issue Jan 2, 2023

feat: add support to validating webhooks kedacore/charts#352

Merged

4 tasks

tomkerkhove mentioned this issue Jan 3, 2023

docs: validating webhook kedacore/keda-docs#1027

Merged

1 task

tomkerkhove moved this from Proposed to In Review in Roadmap - KEDA Core Jan 3, 2023

tomkerkhove added the prevention label Jan 3, 2023

JorTurFer closed this as completed in kedacore/charts#352 Jan 13, 2023

github-project-automation bot moved this from In Review to Ready To Ship in Roadmap - KEDA Core Jan 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KEDA should check whether scale target is already managed by another ScaledObject #3755

KEDA should check whether scale target is already managed by another ScaledObject #3755

zroubalik commented Oct 17, 2022

JorTurFer commented Oct 17, 2022

zroubalik commented Oct 17, 2022 •

edited

Loading

JorTurFer commented Oct 20, 2022

yuvalweber commented Nov 28, 2022

JorTurFer commented Nov 28, 2022 •

edited

Loading

yuvalweber commented Nov 28, 2022

JorTurFer commented Nov 28, 2022

JorTurFer commented Dec 10, 2022 •

edited

Loading

zroubalik commented Dec 11, 2022 •

edited

Loading

tomkerkhove commented Dec 23, 2022

tomkerkhove commented Dec 23, 2022

JorTurFer commented Dec 23, 2022 •

edited

Loading

tomkerkhove commented Jan 3, 2023

yuvalweber commented Jan 3, 2023

tomkerkhove commented Jan 3, 2023

tomkerkhove commented Jan 13, 2023

JorTurFer commented Jan 13, 2023

hanna-liashchuk commented Jul 19, 2023

JorTurFer commented Jul 19, 2023

hanna-liashchuk commented Jul 19, 2023

JorTurFer commented Jul 19, 2023

hanna-liashchuk commented Jul 19, 2023 via email

JorTurFer commented Jul 19, 2023 •

edited

Loading

yuvalweber commented Jul 19, 2023

hanna-liashchuk commented Jul 19, 2023 via email

yuvalweber commented Jul 19, 2023

tomkerkhove commented Jul 20, 2023

yuvalweber commented Jul 20, 2023

JorTurFer commented Jul 20, 2023

tomkerkhove commented Jul 20, 2023

KEDA should check whether scale target is already managed by another ScaledObject #3755

KEDA should check whether scale target is already managed by another ScaledObject #3755

Comments

zroubalik commented Oct 17, 2022

Proposal

Use-Case

Anything else?

JorTurFer commented Oct 17, 2022

zroubalik commented Oct 17, 2022 • edited Loading

JorTurFer commented Oct 20, 2022

yuvalweber commented Nov 28, 2022

JorTurFer commented Nov 28, 2022 • edited Loading

yuvalweber commented Nov 28, 2022

JorTurFer commented Nov 28, 2022

JorTurFer commented Dec 10, 2022 • edited Loading

zroubalik commented Dec 11, 2022 • edited Loading

tomkerkhove commented Dec 23, 2022

tomkerkhove commented Dec 23, 2022

JorTurFer commented Dec 23, 2022 • edited Loading

tomkerkhove commented Jan 3, 2023

yuvalweber commented Jan 3, 2023

tomkerkhove commented Jan 3, 2023

tomkerkhove commented Jan 13, 2023

JorTurFer commented Jan 13, 2023

hanna-liashchuk commented Jul 19, 2023

JorTurFer commented Jul 19, 2023

hanna-liashchuk commented Jul 19, 2023

JorTurFer commented Jul 19, 2023

hanna-liashchuk commented Jul 19, 2023 via email

JorTurFer commented Jul 19, 2023 • edited Loading

yuvalweber commented Jul 19, 2023

hanna-liashchuk commented Jul 19, 2023 via email

yuvalweber commented Jul 19, 2023

tomkerkhove commented Jul 20, 2023

yuvalweber commented Jul 20, 2023

JorTurFer commented Jul 20, 2023

tomkerkhove commented Jul 20, 2023

zroubalik commented Oct 17, 2022 •

edited

Loading

JorTurFer commented Nov 28, 2022 •

edited

Loading

JorTurFer commented Dec 10, 2022 •

edited

Loading

zroubalik commented Dec 11, 2022 •

edited

Loading

JorTurFer commented Dec 23, 2022 •

edited

Loading

JorTurFer commented Jul 19, 2023 •

edited

Loading