cap cpu shares to the max allowed value on Linux #4204

singholt · 2024-06-06T23:54:48Z

Summary

We've lately had reach-outs from customers using very large instance types experiencing runtime errors. This u-9tb1.112xlarge instance type, for example, has 448 vCPUs.

Error (truncated in the ECS describe-task API response)

CannotStartContainerError: Error response from daemon: failed to create shim task: OCI runtime create failed: runc create failed: unable to start container process: error during container init: error setting cgroup config for procHooks process: the maximu

Customers defining more than 256 vCPUs as their container CPU in an ECS task definition, will see an error during the cgroup creation.

The Linux kernel defines a min and max value for the CPU share: ref (2^1 - 2^18). Currently we have a minimumCPUShares that is set to 2. This PR defines a maximumCPUShares equal to 262144.

If you give the kernel anything less or more than these min and max values, the kernel will anyway make sure the value is not out of range and automatically clamp it. Systemd has an additional check that prevents the cgroup creation. A similar patch has been merged into the k8s kubelet as well: kubernetes/kubernetes#93248.

In an ECS task definition, the container.CPU is a CPU reservation value (not a hard limit). For example, a container with 1 vCPU defined, on the u-9tb1.112xlarge instance, is free to take up the entire 448 vCPUs. Whereas the task.CPU is a hard-limit, the task will be throttled beyond the defined value.

We use CPU shares in 2 places:

When container.CPU is defined
When task.CPU is not defined

These are explained below in detail. This PR makes changes to both the above places, to account for maximumCPUShares.

Implementation details

When the container.CPU is defined, cap it before sending it to Docker.

This handles the case where a task definition has container cpu defined. If the container cpu share is more than the max, we make it be the max. This is also consistent with how we bound the container cpu share to the min value today.

When the task.CPU is not defined, cap it before creating the task cgroup.

If an ECS task definition does not have task.CPU defined, the ECS agent sums up the container.CPU values and uses it as the task's CPU share. This is handled by the buildImplicitLinuxCPUSpec method - implicit because it was not defined explicitly. When customers explicitly define task.CPU, we do not create the task cgroup with a cpu share value; we use cpu quota and period (ref) to hard-limit the task.

Testing

New tests cover the changes: yes

Added unit tests to test the new code. Unfortunately, I'm unable to find capacity for a large instance type (>256 vCPU), hence we can't test this on a real instance. The unit tests suffice for testing the new code added.

Description for the changelog

enhancement: cap cpu shares to the max allowed value on Linux.

Does this PR include breaking model changes? If so, Have you added transformation functions?

no

Licensing

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

agent/api/task/task_linux.go

singholt force-pushed the max-cpu-shares branch 2 times, most recently from 3c3afda to 8d9ef16 Compare June 6, 2024 23:56

singholt added the bot/test label Jun 7, 2024

amazon-ecs-bot removed the bot/test label Jun 7, 2024

singholt force-pushed the max-cpu-shares branch from 8d9ef16 to 3f7cd36 Compare June 7, 2024 00:06

singholt added the bot/test label Jun 7, 2024

amazon-ecs-bot removed the bot/test label Jun 7, 2024

singholt force-pushed the max-cpu-shares branch from 3f7cd36 to 4ebc850 Compare June 12, 2024 17:00

singholt added the bot/test label Jun 12, 2024

amazon-ecs-bot removed the bot/test label Jun 12, 2024

singholt changed the title ~~wip~~ cap cpu shares to the max allowed on Linux Jun 12, 2024

singholt marked this pull request as ready for review June 12, 2024 17:54

singholt requested a review from a team as a code owner June 12, 2024 17:54

singholt changed the title ~~cap cpu shares to the max allowed on Linux~~ cap cpu shares to the max allowed value on Linux Jun 12, 2024

prateekchaudhry reviewed Jun 12, 2024

View reviewed changes

agent/api/task/task_linux.go Show resolved Hide resolved

singholt force-pushed the max-cpu-shares branch 3 times, most recently from 120fa04 to 3bc0fb1 Compare June 12, 2024 20:55

define maximumCPUShares

7bfb1fd

singholt force-pushed the max-cpu-shares branch from 3bc0fb1 to 7bfb1fd Compare June 12, 2024 20:56

singholt added the bot/test label Jun 12, 2024

amazon-ecs-bot removed the bot/test label Jun 12, 2024

prateekchaudhry approved these changes Jun 12, 2024

View reviewed changes

Yiyuanzzz approved these changes Jun 12, 2024

View reviewed changes

sparrc approved these changes Jun 12, 2024

View reviewed changes

singholt merged commit 4dd287e into aws:dev Jun 12, 2024
40 checks passed

prateekchaudhry mentioned this pull request Jun 13, 2024

Release 1.84.0 #4211

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cap cpu shares to the max allowed value on Linux #4204

cap cpu shares to the max allowed value on Linux #4204

singholt commented Jun 6, 2024 •

edited

Loading

cap cpu shares to the max allowed value on Linux #4204

cap cpu shares to the max allowed value on Linux #4204

Conversation

singholt commented Jun 6, 2024 • edited Loading

Summary

Implementation details

Testing

Description for the changelog

Licensing

singholt commented Jun 6, 2024 •

edited

Loading