You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The problem is in our case magnified since we are using actors, which as I understand it shuts down the worker they are running on when they are done.
Here is the output of the script
Make it fail: False
2021-11-03 12:46:37,597 INFO services.py:1270 -- View the Ray dashboard at http://127.0.0.1:8265
10
10
Make it fail: True
2021-11-03 12:46:41,696 INFO services.py:1270 -- View the Ray dashboard at http://127.0.0.1:8265
Traceback (most recent call last):
File "play_ray_issue_2.py", line 55, in <module>
asyncio.run(main(True))
File "/usr/lib/python3.8/asyncio/runners.py", line 44, in run
return loop.run_until_complete(main)
File "/usr/lib/python3.8/asyncio/base_events.py", line 616, in run_until_complete
return future.result()
File "play_ray_issue_2.py", line 49, in main
print(await process_data.remote(ref))
ray.exceptions.RayTaskError: ray::process_data() (pid=3242475, ip=10.0.0.63)
At least one of the input arguments for this task could not be computed:
ray.exceptions.OwnerDiedError: Failed to retrieve object 32cccd03c567a254ffffffffffffffffffffffff0100000002000000. To see information about where this ObjectRef was created in Python, set the environment variable RAY_record_ref_creation_sites=1 during `ray start` and `ray.init()`.
The object's owner has exited. This is the Python worker that first created the ObjectRef via `.remote()` or `ray.put()`. Check cluster logs (`/tmp/ray/session_latest/logs/*8011c02e5d0655287e75d4d9920c4238139f2b0956d9a752874fa6d8*` at IP address 10.0.0.63) for more information about the Python worker failure.
Are you willing to submit a PR?
Yes I am willing to submit a PR!
The text was updated successfully, but these errors were encountered:
birgerbr
added
bug
Something that is supposed to be working; but isn't
triage
Needs triage (eg: priority, bug/not-bug, and owning component)
labels
Nov 3, 2021
Search before asking
Ray Component
Ray Core
What happened + What you expected to happen
An object stored using
ray.put
is lost when the owner dies, even though there still exist references to the object.I was expecting the object to be copied during the worker shutdown, such that it is not lost.
Versions / Dependencies
Ray 1.8.0, Python 3.8.10, Ubuntu 20.04
Reproduction script
Anything else
The problem is in our case magnified since we are using actors, which as I understand it shuts down the worker they are running on when they are done.
Here is the output of the script
Are you willing to submit a PR?
The text was updated successfully, but these errors were encountered: