-
-
Notifications
You must be signed in to change notification settings - Fork 633
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix tcp error with pytorch 1.9.1 #2211
Conversation
@sdesrozis can you provide please a link to the docs or code about that ? |
That's what I experimented. From now, default env variables are set by
|
OK, I see your point. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM!
Can you send another PR to fix another CI issue ?
By the way, in the fixed test we are using torch.distributed.launch
and not torch.distributed.run
. Should we update that as well ?
I'm on it.
I will explore but |
Ouch I tried using PyTorch 1.8.1 1.9.0 and 1.9.1 but nightly does a ValueError assertion. I will fix and add a comment. |
We had to also update circle ci but there is no yet 1.9.1 docker images |
Fixes #2210
Description:
It seems that using PyTorch 1.9.1,
tcp
port should have the same value thanmaster_port
. Default is29500
.Check list: