Support for Resource Claims and DRA #15332
Labels
area/API
API objects and controllers
kind/feature
Well-understood/specified features, ready for coding.
lifecycle/frozen
Indicates that an issue or PR should not be auto-closed due to staleness.
Describe the feature
As K8s community is actively working on moving to resource claims it would be great to add support for it at some point (adding this for future reference).
This is Alpha since 1.26, will stay so in 1.31 but it is being worked agressively see kubernetes/enhancements#3063 (comment) so soon it will move to Beta and GA it seems.
In Knative, right now trying to set a resource claim fails validation as expected:
/area API
References
Unleashing the Power of DRA (Dynamic Resource Allocation) for Just-in-Time GPU Slicing
What Can I Get You? An Introduction to Dynamic Resource Allocation - Freddy Rolland & Adrian Chiris
Deploy vLLM server on Kubernetes using NVIDIA Kubernetes DRA driver
KCSEU 2024 - Dynamic Resource Allocation - the path towards GA - Kevin Klues Patrick Ohly
Meeting notes from K8s Serving WG
K8s issues/KEPs:
Dynamic Resource Allocation with Control Plane Controller
DRA: structured parameters
The text was updated successfully, but these errors were encountered: