-
Notifications
You must be signed in to change notification settings - Fork 936
runtime: warn before using singleton mode #10289
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
If PMIx is unreachable, but a PMI1/2 or SLURM environment is detected, issue a warning before "falling back" to singleton mode. Refs. open-mpi#10286 Signed-off-by: Gilles Gouaillardet <[email protected]>
|
@bwbarrett here is an idea on how to tackle #10286 We could also add a MCA parameter to select which action should be taken:
FWIW, I tested both SLURM and PMI in order to support
|
| rank_str = getenv("SLURM_PROCID"); | ||
| } | ||
| int rank = (NULL != rank_str)?atoi(rank_str):0; | ||
| if (0 == rank) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What if this really is rank 0? Should -1 be used instead?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The rationale for being 0 is we do our best to limit the warning message to rank 0. But if we cannot figure out the rank (likely caused by a busted environment?) I'd rather have all the ranks print the warning message than none.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I agree with @ggouaillardet on this one.
| if (NULL == size_str) { | ||
| size_str = getenv("SLURM_NPROCS"); | ||
| } | ||
| int size = (NULL != size_str)?atoi(size_str):1; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
you made my brain hurt for no reason :).
| rank_str = getenv("SLURM_PROCID"); | ||
| } | ||
| int rank = (NULL != rank_str)?atoi(rank_str):0; | ||
| if (0 == rank) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I agree with @ggouaillardet on this one.
If PMIx is unreachable, but a PMI1/2 or SLURM environment is detected,
issue a warning before "falling back" to singleton mode.
Refs. #10286
Signed-off-by: Gilles Gouaillardet [email protected]