[Core] Thread leaking when submitting actor tasks to actors, leading to reach to the system limit #33957
Labels
bug
Something that is supposed to be working; but isn't
core
Issues that should be addressed in Ray Core
core-correctness
Leak, crash, hang
core-worker
P1
Issue that should be fixed within a few weeks
Ray-2.7
size-large
What happened + What you expected to happen
I have an actor that keeps track of file locations. It has exactly one instance running, and is an async actor. However, as training runs and more and more remote calls are made to the actor the thread count used by the actor process keeps increasing, until eventually it hits the system limit. I can increase the system limit, but I would like to understand what is happening.
Versions / Dependencies
master
Reproduction script
n/a
Issue Severity
High: It blocks me from completing my task.
The text was updated successfully, but these errors were encountered: