Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add new ECS variants #2286

Closed
wants to merge 3 commits into from

Conversation

arnaldo2792
Copy link
Contributor

Issue number:
Closes #2164

Description of changes:

packages: add kmod-5.15-nvidia sources

This adds the sources to compile the 515 NVIDIA driver for the 5.15
kernel.  This version only supports the GPU architectures Maxwell,
Pascal, Volta, Turing, Ampere, and forward.  The driver will use the GPU
System Processor (GSP) feature if the underlying hardware supports it
by loading the binary file `/lib/firmware/nvidia/<version>/gsp.bin`.
variants: add aws-ecs-1.1 variant
variants: add aws-ecs-1.1-nvidia variant

Testing done:

  • Launched hosts with new ECS variants, and confirm tasks were scheduled in those hosts
  • Tested different types of instances except for p2 instances since those are no longer supported for in this driver:
[   32.672564] NVRM: The NVIDIA Tesla K80 GPU installed in this system is
[   32.672564] NVRM:  supported through the NVIDIA 470.xx Legacy drivers

Terms of contribution:

By submitting this pull request, I agree that this contribution is dual-licensed under the terms of both the Apache License, version 2.0, and the MIT license.

This adds the sources to compile the 515 NVIDIA driver for the 5.15
kernel.  This version only supports the GPU architectures Maxwell,
Pascal, Volta, Turing, Ampere, and forward.  The driver will use the GPU
System Processor (GSP) feature if the underlying hardware supports it
by loading the binary file `/lib/firmware/nvidia/<version>/gsp.bin`.

Signed-off-by: Arnaldo Garcia Rincon <[email protected]>
Signed-off-by: Arnaldo Garcia Rincon <[email protected]>
Signed-off-by: Arnaldo Garcia Rincon <[email protected]>
Copy link
Contributor

@mchaker mchaker left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

👍

Comment on lines +874 to +875
* 470.X
* 515.X
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Would it be helpful to add the corresponding Bottlerocket versions alongside the NVIDIA driver numbers?

e.g.

470.X: Bottlerocket v0.X - v0.Y
515.X: Bottlerocket v0.A - v0.B

* 515.X

The official AMIs for these variants can be used with EC2 GPU-equipped instance types such as: `p2`, `p3`, `p4`, `g3`, `g4dn`, `g5` and `g5g`.
Some of these instance types are built with hardware that is no longer supported by the 515.X NVIDIA driver.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I might rephrase this as:

Suggested change
Some of these instance types are built with hardware that is no longer supported by the 515.X NVIDIA driver.
Note that older instance types, such as 'p2', are not supported by NVIDIA driver 515.X and above.

Comment on lines 4 to +7
"aws-ecs-1",
"aws-ecs-1.1",
"aws-ecs-1-nvidia",
"aws-ecs-1.1-nvidia",
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit. While neither this, nor the above, is quite alphabetical, this might be more consistent with the rest:

Suggested change
"aws-ecs-1",
"aws-ecs-1.1",
"aws-ecs-1-nvidia",
"aws-ecs-1.1-nvidia",
"aws-ecs-1",
"aws-ecs-1-nvidia",
"aws-ecs-1.1",
"aws-ecs-1.1-nvidia",

Same goes for the README.md.

@arnaldo2792
Copy link
Contributor Author

I'm closing this since we are still working on our tooling to improve the release process for ECS variants.

@arnaldo2792 arnaldo2792 deleted the ecs-variants branch June 19, 2023 18:35
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

add a new aws-ecs variant for kernel 6.1
3 participants