-
Notifications
You must be signed in to change notification settings - Fork 689
feat: deploy SLA profiler to k8s #2030
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
…-create-deploy-crds-for-vllm_v1-example
… of github.com:ai-dynamo/dynamo into hannahz/dep-216-create-deploy-crds-for-vllm_v1-example
… of https://github.com/ai-dynamo/dynamo into hannahz/dep-216-create-deploy-crds-for-vllm_v1-example
… of https://github.com/ai-dynamo/dynamo into hzhou/profile_vllmv1_k8s
…mo/dynamo into hzhou/profile_vllmv1_k8s
|
@hhzhang16 Could you hold of a few minutes and then rebase? The files will be in |
sure, thanks for letting us know! |
It's ready! |
|
@tedzhouhk @grahamking rebased! |
I, euh, maybe a tiny bit changed them again. #2055 |
…-deploy-sla-profiler-to-k8s
|
Rebased again haha |
dep-233-deploy-sla-profiler-to-k8s
…github.com/ai-dynamo/dynamo into hannahz/dep-233-deploy-sla-profiler-to-k8s
dep-233-deploy-sla-profiler-to-k8s
…github.com/ai-dynamo/dynamo into hannahz/dep-233-deploy-sla-profiler-to-k8s
Overview:
This MR revamps the SLA profiler to use DynamoGraphDeployments (DGDs) and moves the SLA profiler to run properly in Kubernetes instead of locally. Some key improvements are error handling, result caching, and K8s integration.
Details:
Where should the reviewer start?
Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)
Summary by CodeRabbit
New Features
Enhancements
Bug Fixes
Chores