-
Notifications
You must be signed in to change notification settings - Fork 7k
[Docs][KubeRay] Add Volcano RayJob gang scheduling example #58320
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Changes from 1 commit
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change | ||||
|---|---|---|---|---|---|---|
|
|
@@ -324,3 +324,146 @@ Finally, clean up the remaining cluster and queue: | |||||
| kubectl delete raycluster test-cluster-1 | ||||||
| kubectl delete queue kuberay-test-queue | ||||||
| ``` | ||||||
|
|
||||||
| ### Step 6: Use Volcano for RayJob gang scheduling | ||||||
|
|
||||||
| Starting with KubeRay 1.5.0, KubeRay supports gang scheduling for RayJob custom resources. | ||||||
|
|
||||||
| First, create a queue with a capacity of 4 CPUs and 6Gi of RAM and RayJob a with a head node (1 CPU + 2Gi of RAM), two workers (1 CPU + 1Gi of RAM each) and a submitter pod (0.5 CPU + 200Mi of RAM), for a total of 3500m CPU and 4296Mi of RAM | ||||||
|
|
||||||
| ```shell | ||||||
| curl -LO https://raw.githubusercontent.com/ray-project/kuberay/master/ray-operator/config/samples/ray-job.volcano-scheduler-queue.yaml | ||||||
|
||||||
| curl -LO https://raw.githubusercontent.com/ray-project/kuberay/master/ray-operator/config/samples/ray-job.volcano-scheduler-queue.yaml | |
| curl -LO https://raw.githubusercontent.com/ray-project/kuberay/v1.5.0/ray-operator/config/samples/ray-job.volcano-scheduler-queue.yaml |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think fixed version will be better, cc @win5923
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, changed to use v1.5.0-rc.0.
b58c683
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think it's a little bit weird to use
Step 6, instead, can we just useUse Volcano for RayJob gang scheduling?https://anyscale-ray--58320.com.readthedocs.build/en/58320/cluster/kubernetes/k8s-ecosystem/volcano.html#step-6-use-volcano-for-rayjob-gang-scheduling
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sure, this is better. Fixed in b58c683