added non-blocking root communicator #1478

gberg617 · 2024-12-11T20:19:41Z

Summary

This PR is a feature which adds a communicator for sending messages from any rank to the root rank non-collectively. This can be useful in cases where an arbitrary rank throws an error that needs to be sent to the root rank to output to a file.

gberg617 · 2024-12-11T20:20:14Z

Unit testing and documentation will be added to this PR in follow-up commits.

src/axom/lumberjack/MPIUtility.cpp

rhornung67 · 2024-12-11T20:24:21Z

src/axom/lumberjack/MPIUtility.cpp

+  MPI_Status mpiStatus;
+
+  // Get size and source of MPI message
+  int mpiFlag = true;


Similar comment here and below. You could define static constexpr integer variables that use names containing true and false to make the code more readable and avoid magic numbers.

src/axom/lumberjack/MPIUtility.cpp

rhornung67 · 2024-12-11T20:29:10Z

src/axom/lumberjack/MPIUtility.hpp

+ * \param [in] comm The MPI Communicator.
+ *****************************************************************************
+ */
+const char* mpiNonBlockingReceiveMessages(MPI_Comm comm, const int tag = 0);


Why is tag argument here?

The MPI communication calls currently use the value associated with LJ_TAG by default (defined in MPIUtility.cpp). The non-blocking receives used by the new communicator in this PR work better when we use another tag in order to not conflict with other communicators. I added logic into the MPI utility functions to check whether the tag was overridden (i.e. non-zero). In those cases, the sends/receives will use the tag value passed in. Otherwise, we revert to the default LJ_TAG for MPI communication. Setting this default in the function declarations prevents us from having to change all the existing calls to these methods by other communicators.

Ah. Got it. Thanks for the explanation.

rhornung67 · 2024-12-11T20:29:44Z

src/axom/lumberjack/MPIUtility.hpp

 *****************************************************************************
 */
 void mpiNonBlockingSendMessages(MPI_Comm comm,
                                int destinationRank,
-                                const char* packedMessagesToBeSent);
+                                const char* packedMessagesToBeSent,
+                                const int tag = 0);


Similar comment about why tag arg is here.

Similar response as above

src/axom/lumberjack/NonBlockingRootCommunicator.cpp

bmhan12 · 2024-12-11T22:28:54Z

src/axom/lumberjack/MPIUtility.cpp

+    // Receive packed Message
+    MPI_Recv(charArray,
+            messageSize,
+            MPI_CHAR,
+            mpiStatus.MPI_SOURCE,
+            mpiTag,
+            comm,
+            &mpiStatus);


As I understand the MPI API, this is actually a blocking MPI_Recv call? So this mpiNonBlockingReceiveMessages function is currently blocking to receive messages.

Yes, that's correct. The non-blocking part is the call to MPI_Iprobe, but then the Recv is blocking. My intent here is to be sure that the receive is fully finished before anything else is done, but to not block any further execution if there are no messages to be received (i.e. when mpiFlag is false). I can change the function name to clarify the intent here.

To clarify the above point, MPI_Iprobe is used instead of MPI_probe because the former will return with an mpiFlag value regardless of whether messages need to be received, whereas the latter is a blocking call that will only return if there is a message to be received.

Gotcha, the combination of MPI_Iprobe + MPI_Recv makes sense now!
I had tunnel vision comparing the non-blocking and blocking MPI interfaces.

This does also make me think about renaming this communicator to something like "NonCollectiveCommunicator" rather than "NonBlockingCommunicator". It's true that it calls these non-blocking functions, but I think the main feature is actually that we don't rely on collective calls to communicate messages to root.

I like that idea.

bmhan12 · 2024-12-11T22:32:16Z

src/axom/lumberjack/MPIUtility.cpp

+
+  // Get size and source of MPI message
+  int mpiFlag = true;
+  MPI_Iprobe(MPI_ANY_SOURCE, tag, comm, &mpiFlag, &mpiStatus);


MPI_Iprobe is nonblocking here, so is there a chance the mpiFlag is not set to true when it is expected to be? Would it be better to have this be a blocking MPI_Probe? Basing this comment off this stackoverflow post: https://stackoverflow.com/questions/43823458/mpi-iprobe-vs-mpi-probe

Additionally, if using MPI_Iprobe, should mpiFlag default be set to false, so it can be set to true only by a successful function call?

I think the mpiFlag will be set in either context to either true or false, but to your point, it is safer to initialize this as false.

The stackoverflow example illustrates an interesting but slightly different approach than what I'm intending to do. They are calling MPI_Iprobe in a while loop that does not exit until it returns a flag that is non-zero. In my case, I am checking to see if any messages need to be received only once, and if there are no messages, the function exits by returning nullptr. This intent in the stackoverflow example is to continuously monitor the status, whereas I'm only intending to periodically monitor the status whenever the code path enters into this function. Both could be relevant to the problem I'm trying to solve with this communicator, where the root rank needs to receive information from other ranks that they are aborting. I had a preference toward the latter option (periodically monitoring the status whenever the root rank reaches a point where it enters this code path) because it seemed to me like the more efficient option, even if it comes at a cost of sometimes not receiving the status before the program aborts. But I'm not really sure which option is best for this scenario. I'd be curious to hear your thoughts.

I had a preference toward the latter option (periodically monitoring the status whenever the root rank reaches a point where it enters this code path) because it seemed to me like the more efficient option, even if it comes at a cost of sometimes not receiving the status before the program aborts.

I agree, I would expect the latter option to have less overhead, doing a single poll with MPI_Iprobe instead of spinning on MPI_Iprobe until status is updated in the former case. Nevertheless, I might not be considering something, so am also curious if others have ideas.

…or to NonCollectiveRootCommunicator

… and added unit testing

added non-blocking root communicator

4835c73

gberg617 requested review from white238, kennyweiss, rhornung67 and bmhan12 December 11, 2024 20:19

gberg617 self-assigned this Dec 11, 2024

rhornung67 reviewed Dec 11, 2024

View reviewed changes

src/axom/lumberjack/MPIUtility.cpp Outdated Show resolved Hide resolved

rhornung67 reviewed Dec 11, 2024

View reviewed changes

src/axom/lumberjack/MPIUtility.cpp Outdated Show resolved Hide resolved

rhornung67 reviewed Dec 11, 2024

View reviewed changes

src/axom/lumberjack/MPIUtility.cpp Outdated Show resolved Hide resolved

rhornung67 reviewed Dec 11, 2024

View reviewed changes

src/axom/lumberjack/NonBlockingRootCommunicator.cpp Outdated Show resolved Hide resolved

bmhan12 reviewed Dec 11, 2024

View reviewed changes

fixed tag and flag comparison type, renamed NonBlockingRootCommunicat…

926fd00

…or to NonCollectiveRootCommunicator

gberg617 force-pushed the feature/bergel1/lumberjack_nonblocking_communicator branch from 7921ec5 to 926fd00 Compare December 13, 2024 01:13

gberg617 added 3 commits December 12, 2024 17:15

small formatting fix

f53ca2f

added unique mpi tag for each NonCollectiveRootCommunicator instance,…

5025063

… and added unit testing

added missing function param description

d8faf4a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

added non-blocking root communicator #1478

added non-blocking root communicator #1478

gberg617 commented Dec 11, 2024

gberg617 commented Dec 11, 2024

rhornung67 Dec 11, 2024 •

edited

Loading

rhornung67 Dec 11, 2024

gberg617 Dec 11, 2024

rhornung67 Dec 11, 2024

rhornung67 Dec 11, 2024

gberg617 Dec 11, 2024

bmhan12 Dec 11, 2024

gberg617 Dec 12, 2024

gberg617 Dec 12, 2024

bmhan12 Dec 12, 2024

gberg617 Dec 12, 2024

rhornung67 Dec 12, 2024

bmhan12 Dec 11, 2024

gberg617 Dec 12, 2024

gberg617 Dec 12, 2024

bmhan12 Dec 12, 2024

added non-blocking root communicator #1478

Are you sure you want to change the base?

added non-blocking root communicator #1478

Conversation

gberg617 commented Dec 11, 2024

Summary

gberg617 commented Dec 11, 2024

rhornung67 Dec 11, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

rhornung67 Dec 11, 2024 •

edited

Loading