[rl] add import torch to provisioner bootstrap to avoid concurrent dlopen … by shuhuayu · Pull Request #3220 · pytorch/torchtitan

shuhuayu · 2026-05-05T00:34:21Z

When Monarch spawns sub-processes and multiple actor threads begin unpickling messages concurrently, they all try to import torch at the same time, causing a race condition in dlopen of torch._C.so. This results in the misleading error torch._C is not a package, even though the import works fine when done sequentially. The fix is to import torch in the Provisioner's bootstrap function, which runs once per sub-process before any threading starts, ensuring torch._C.so is fully loaded before concurrent unpickling begins. It's unclear whether this is a PyTorch bug (concurrent dlopen should be thread-safe) or a Monarch bug (imports during unpickling should be serialized), so we've added a TODO to remove the workaround once the upstream fix lands.

…race in monarch sub-processes

wwwjn · 2026-05-05T01:05:41Z

@shuhuayu You can go ahead and land this fix first, the RL integration test failing is fixed in #3041

shuhuayu · 2026-05-05T01:42:42Z

@shuhuayu You can go ahead and land this fix first, the RL integration test failing is fixed in #3041

Sounds good. Tested it locally and passed and i'll merge first.

add import torch to provisioner bootstrap to avoid concurrent dlopen …

f7f58d0

…race in monarch sub-processes

pytorch-bot Bot added the ciflow/8gpu label May 5, 2026

meta-cla Bot added the CLA Signed This label is managed by the Meta Open Source bot. label May 5, 2026

tianyu-l approved these changes May 5, 2026

View reviewed changes

shuhuayu merged commit 2ae1340 into pytorch:main May 5, 2026
7 of 11 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[rl] add import torch to provisioner bootstrap to avoid concurrent dlopen …#3220

[rl] add import torch to provisioner bootstrap to avoid concurrent dlopen …#3220
shuhuayu merged 1 commit intopytorch:mainfrom
shuhuayu:opt

shuhuayu commented May 5, 2026

Uh oh!

wwwjn commented May 5, 2026

Uh oh!

shuhuayu commented May 5, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

shuhuayu commented May 5, 2026

Uh oh!

wwwjn commented May 5, 2026

Uh oh!

shuhuayu commented May 5, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants