You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The problem is that Ray loses the right CPU count when our OpenShift has CPU Bursting active. If I start a pod with 1 CPU, without CPU bursting available, then /sys/fs/cgroup/cpu/cpu.cfs_quota_us returns the right amount. However, if I have CPU bursting on, it returns -1.
In this case, I think the /sys/fs/cgroup/cpu/cpu.shares contains the “guaranteed” millicores assigned to the pod, however Ray is not looking at that file right now.
Versions / Dependencies
master
Reproduction script
See above
Issue Severity
No response
The text was updated successfully, but these errors were encountered:
jjyao
added
bug
Something that is supposed to be working; but isn't
triage
Needs triage (eg: priority, bug/not-bug, and owning component)
core
Issues that should be addressed in Ray Core
P2
Important issue, but not time-critical
and removed
triage
Needs triage (eg: priority, bug/not-bug, and owning component)
labels
Aug 17, 2022
What happened + What you expected to happen
This is from https://discuss.ray.io/t/ray-init-fails-to-register-workers/6044
Versions / Dependencies
master
Reproduction script
See above
Issue Severity
No response
The text was updated successfully, but these errors were encountered: