On Kubernetes, set pod anti-affinity at the host level for pods of type 'ray' #4131

virtualluke · 2019-02-22T13:25:58Z

What do these changes do?

Added 'type: ray' to the deployments' metadata and keyed off that type for pod anti-affinity on hosts. This will cause kubernetes to not schedule more than one pod of type 'ray' onto the same host.

AmplabJenkins · 2019-02-22T15:10:15Z

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/12228/
Test FAILed.

ericl

How about using the soft form instead?preferredDuringSchedulingIgnoredDuringExecution

virtualluke · 2019-02-25T18:37:59Z

Seems like during execution and scheduling you would always want node anti-affinity, not just during execution.

ericl · 2019-02-25T18:45:24Z

It looks like the current pr already ignores during execution. I think it's better to use a soft instead of hard constraint as default is all.

…

On Mon, Feb 25, 2019, 10:38 AM Luke ***@***.***> wrote: Seems like during execution and scheduling you would always want node anti-affinity, not just during execution. — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#4131 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAA6SuxBjNEOYvIlXEzCh4oqNmR5sDJ6ks5vRC2SgaJpZM4bJl6s> .

virtualluke · 2019-03-06T18:48:46Z

I definitely prefer the harder (requiredDuringSchedulingIgnoreDuringExecution) form over the softer (preferredDuringSchedulingIgnoredDuringExecution). I am looking at cluster stability and want a hard rule about scheduling of ray pods. If others want it as a cluster scheduling suggestion (which is the softer form) than I am ok with that, just will use the hard version on our cluster.

ericl

Ok, I think either is fine if you think there's a strong benefit, so this LGTM.

AmplabJenkins · 2019-03-11T07:09:54Z

Can one of the admins verify this patch?

robertnishihara · 2019-03-11T19:57:00Z

I just tried this out and it works for me.

One issue I ran into when running this out of the box (pre-existing I think, and probably unrelated to this PR) is that kubectl create -f ray/kubernetes/submit.yaml didn't succeed because it required too much memory on the workers. We could increase the amount of memory requested by the pods, but then it will be harder to run out of the box (e.g., on minikube the pods don't get scheduled when more resources are requested.)

robertnishihara · 2019-03-11T19:57:15Z

Thanks @virtualluke!

virtualluke added 3 commits February 22, 2019 08:14

added type: ray and podAntiAffinity for type: ray

133c3f0

added type: ray and podAntiAffinity for type:ray

8d0d5d9

added type: ray and podAntiAffinity for type:ray

0abbbe2

ericl reviewed Feb 23, 2019

View reviewed changes

robertnishihara self-assigned this Mar 6, 2019

ericl approved these changes Mar 6, 2019

View reviewed changes

robertnishihara merged commit 08a4769 into ray-project:master Mar 11, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

On Kubernetes, set pod anti-affinity at the host level for pods of type 'ray' #4131

On Kubernetes, set pod anti-affinity at the host level for pods of type 'ray' #4131

Uh oh!

virtualluke commented Feb 22, 2019

Uh oh!

AmplabJenkins commented Feb 22, 2019

Uh oh!

ericl left a comment

Uh oh!

virtualluke commented Feb 25, 2019

Uh oh!

ericl commented Feb 25, 2019 via email

Uh oh!

virtualluke commented Mar 6, 2019

Uh oh!

ericl left a comment

Uh oh!

AmplabJenkins commented Mar 11, 2019

Uh oh!

robertnishihara commented Mar 11, 2019

Uh oh!

robertnishihara commented Mar 11, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

On Kubernetes, set pod anti-affinity at the host level for pods of type 'ray' #4131

On Kubernetes, set pod anti-affinity at the host level for pods of type 'ray' #4131

Uh oh!

Conversation

virtualluke commented Feb 22, 2019

What do these changes do?

Uh oh!

AmplabJenkins commented Feb 22, 2019

Uh oh!

ericl left a comment

Choose a reason for hiding this comment

Uh oh!

virtualluke commented Feb 25, 2019

Uh oh!

ericl commented Feb 25, 2019 via email

Uh oh!

virtualluke commented Mar 6, 2019

Uh oh!

ericl left a comment

Choose a reason for hiding this comment

Uh oh!

AmplabJenkins commented Mar 11, 2019

Uh oh!

robertnishihara commented Mar 11, 2019

Uh oh!

robertnishihara commented Mar 11, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants