Support for kubelet config options: imageGCHighThresholdPercent, imageGCLowThresholdPercent #2065

DZDomi · 2022-04-12T15:15:44Z

What I'd like:

We were running into an issue on our development EKS cluster when we tried to update multiple deployments trough Argo CD on the same EKS node. The node was hitting the Condition DiskPressure for one to two minutes on each parallel deployment, before it reverted back to reporting NoDiskPressure. We tried to track the issue down and could identify the issue with the following timeline:

New images (read: > 20 images at approx the same time) are pushed trough CI/CD to different ECR repos
Argo CD image updater detects these new images and changes the deployment image tags in each deployment
Kubernetes tries to schedule some of these pods on a specific node
Node is currently sitting at around 82-84% of free disk space
Kubelet tries to download the new images (each of them has approx. 150-200 megabyte)
The Kubelet garbage collector kicks in and tries to delete images, since it hit the 85% (default) threshold. It will take a few minutes to delete all of them
New images are downloaded in parallel, tipping the free disk space > 90% and the hard eviction threshold is met
Node changes state into DiskPressure not scheduling any new nodes
Kubelet garbage collector finishes deletion of images (after a few minutes)
Node changes back into NoDiskPressure

This garbage collect interval could be changed via the kubelet configuration arguments. Unfortunately currently this is not supported by Bottlerocket. The following options should be able to configure via bootstrap arguments:

imageGCHighThresholdPercent: xx
imageGCLowThresholdPercent: xx

AWS itself allows this option on their EKS optimised instances: https://aws.amazon.com/premiumsupport/knowledge-center/eks-worker-nodes-image-cache/

Any alternatives you've considered:

We had to decrease the hard eviction limits from the defaults to lower values in order to not run into this condition:

[settings.kubernetes.eviction-hard]
"nodefs.available" = "5%"
"imagefs.available" = "10%"

This is not really optimal, since it makes the whole node more likely to get close to the available disk space.

The text was updated successfully, but these errors were encountered:

zmrow · 2022-04-13T15:51:39Z

Thanks for the report! We'll consider surfacing those options.

Could nodes be launched with a slightly larger data volume to avoid running so close to the thresholds?

DZDomi · 2022-04-13T16:16:58Z

Hey, thanks for the fast reply! Yes this would also certainly be an option, we will consider it, thanks! But in the end it is just a workaround for the underlying issue. Also if you are running hundreds of nodes, storage costs will certainly increase (especially if you run a lot of smaller nodes).

Would be happy to see this an option in the userdata config. Also happy to make the PR myself if you can guide me to the right parts of the code

bcressey · 2022-04-26T03:57:13Z

Also happy to make the PR myself if you can guide me to the right parts of the code.

Certainly! It looks like we haven't added more of these settings in a while, but #1659 has the basic structure:

changes to the model and modeled types to add the settings and any necessary validation
changes to documentation to describe the new settings
changes to all the relevant kubelet config templates to render the settings
migrations, to ensure that the new settings are erased on downgrade

On that last point: migrations are frankly just painful and we're currently iterating on approaches to make them either unnecessary or more bearable.

The problem they're solving is that new releases of Bottlerocket will have your changes and know about the new settings, but if someone upgrades to that new release and later downgrades to an older release, then the older release will not understand the new settings and will choke when it encounters them. This behavior is by design, to avoid the similar case of accidents where a security- or performance-critical setting contains a typo and gets ignored rather than applied, but it's also very unintuitive.

To work around this, whenever we add new settings, we create migration binaries that remove those settings on downgrade. There are helper macros and build system integrations to make this less of a chore, but it's still not intuitive. Up to you whether you want to go down this rabbit hole in your PR; if you'd prefer to ignore it, we're still delighted to have the contribution and can address it later during release prep.

zmrow · 2022-05-02T22:11:25Z

@DZDomi - let us know if you have any questions about the above! We're happy to help with a PR if you think you'd like to try putting one together.

zmrow added area/kubernetes K8s including EKS, EKS-A, and including VMW area/core Issues core to the OS (variant independent) labels Apr 13, 2022

kdaula assigned zmrow May 2, 2022

kdaula assigned mchaker and unassigned zmrow Jun 1, 2022

kdaula added this to the 1.9.0 milestone Jun 1, 2022

kdaula added this to 1.9.0 Jun 1, 2022

kdaula modified the milestones: 1.9.0, 1.10.0 Jun 2, 2022

kdaula added this to Bottlerocket Engineering Roadmap Jun 2, 2022

kdaula removed this from 1.9.0 Jun 2, 2022

kdaula removed this from Bottlerocket Engineering Roadmap Jun 2, 2022

kdaula added this to 1.9.0 Jun 2, 2022

kdaula modified the milestones: 1.10.0, 1.9.0 Jun 2, 2022

mchaker moved this to Todo in 1.9.0 Jun 13, 2022

This was referenced Jun 14, 2022

Add imageGC settings to all k8s variants’ models #2216

Closed

Add migrations required for imageGC settings to all k8s variants’ models #2217

Closed

mchaker moved this from Todo to In Progress in 1.9.0 Jun 15, 2022

mchaker mentioned this issue Jun 15, 2022

kubelet: add image GC threshold settings #2219

Merged

3 tasks

mchaker closed this as completed in #2219 Jul 14, 2022

Repository owner moved this from In Progress to Done in 1.9.0 Jul 14, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for kubelet config options: imageGCHighThresholdPercent, imageGCLowThresholdPercent #2065

Support for kubelet config options: imageGCHighThresholdPercent, imageGCLowThresholdPercent #2065

DZDomi commented Apr 12, 2022

zmrow commented Apr 13, 2022

DZDomi commented Apr 13, 2022

bcressey commented Apr 26, 2022

zmrow commented May 2, 2022

Support for kubelet config options: imageGCHighThresholdPercent, imageGCLowThresholdPercent #2065

Support for kubelet config options: imageGCHighThresholdPercent, imageGCLowThresholdPercent #2065

Comments

DZDomi commented Apr 12, 2022

zmrow commented Apr 13, 2022

DZDomi commented Apr 13, 2022

bcressey commented Apr 26, 2022

zmrow commented May 2, 2022