-
Notifications
You must be signed in to change notification settings - Fork 769
{mpi}[GCC/13.3.0] OpenMPI v5.0.3, PMIx v5.0.2 #17561
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
{mpi}[GCC/13.3.0] OpenMPI v5.0.3, PMIx v5.0.2 #17561
Conversation
|
I don't think there is really anything new to do with regards to CUDA. Just continue to patch in support for internal header. |
easybuild/easyconfigs/o/OpenMPI/OpenMPI-5.0.0rc10-GCC-12.2.0.eb
Outdated
Show resolved
Hide resolved
|
is this PR going to be merged soon? I would be interested in using this version of OpenMPI. |
…penMPI500rc10 bump PMIx and OpenMPI to 5.0.1 and use GCC 13.2.0
|
My remaining question here is, whether we want to add the CUDA-related patches first, or merge this PR as is and add the CUDA-related patches in a follow-up PR? |
|
@boegelbot please test @ jsc-zen3 |
|
@SebastianAchilles: Request for testing this PR well received on jsczen3l1.int.jsc-zen3.fz-juelich.de PR test command '
Test results coming soon (I hope)... - notification for comment with ID 1904058315 processed Message to humans: this is just bookkeeping information for me, |
|
Test report by @boegelbot |
|
I can have a look this week to see how hard it is to port over the internal CUDA patches... |
|
@boegelbot please test @ jsc-zen3 |
|
@SebastianAchilles: Request for testing this PR well received on jsczen3l1.int.jsc-zen3.fz-juelich.de PR test command '
Test results coming soon (I hope)... - notification for comment with ID 1904168663 processed Message to humans: this is just bookkeeping information for me, |
|
Test report by @boegelbot |
This patch has changed since libcuda is no longer dlopen()'ed by Open MPI. Instead we can generate a stub library, and at runtime the CUDA-dependent DSO's (but not the main libmpi.so library) load libcuda.so. This is then consistent with https://docs.open-mpi.org/en/v5.0.x/tuning-apps/networking/cuda.html (but --enable-mca-dso=<comma-delimited-list-of-cuda-components> is done by default already)
|
Test report by @bedroge |
|
Test report by @bedroge |
|
Perhaps we should set |
|
We should probably also pass See: |
|
Test report by @SebastianAchilles Note: Using |
| # to enable SLURM integration (site-specific) | ||
| # configopts += '--with-slurm --with-pmi=/usr/include/slurm --with-pmi-libdir=/usr' |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is obsolete now, as Open MPI 5 only supports PMIx, no more PMI-1 or PMI-2.
The --with-slurm option also applies to PMIx instead (selected by default on all OSes that Slurm supports, no need to set it manually)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Done in boegel#97
…errors=no in OpenMPI-5.0.3-GCC-13.3.0.eb
…penMPI503 remove outdated comment about Slurm support and add --with-show-load-errors=no in OpenMPI-5.0.3-GCC-13.3.0.eb
|
@boegelbot please test @ jsc-zen3 |
|
@bedroge: Request for testing this PR well received on jsczen3l1.int.jsc-zen3.fz-juelich.de PR test command '
Test results coming soon (I hope)... - notification for comment with ID 2136006743 processed Message to humans: this is just bookkeeping information for me, |
|
Test report by @branfosj |
|
Test report by @branfosj |
|
Test report by @branfosj |
|
Test report by @boegelbot |
|
@boegelbot please test @ generoso |
|
@bedroge: Request for testing this PR well received on login1 PR test command '
Test results coming soon (I hope)... - notification for comment with ID 2136035919 processed Message to humans: this is just bookkeeping information for me, |
|
Test report by @branfosj |
|
Test report by @boegelbot edit: ah, I see that's expected on generoso. |
|
Test report by @bedroge |
|
Test report by @bedroge |
|
Test report by @bedroge |
bartoldeman
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Lgtm
|
Test report by @bedroge |
(created using
eb --new-pr)WIP since we're using release candidates here, not final releases.I had to strip out the CUDA-related patches we are using for OpenMPI 4.1.5 to get the build working, we'll need to figure out how to move forward there (cc @Micket, @bartoldeman)requires: