common/ofi: Add comments for the import monitor code #9387

wckzhang · 2021-09-17T16:15:25Z

Signed-off-by: William Zhang [email protected]

wckzhang · 2021-09-17T16:16:13Z

Adding some comments to make the code more understandable.

bwbarrett · 2021-09-17T17:25:11Z

opal/mca/common/ofi/common_ofi.c


+/*
+ * These no-op functions are necessary since libfabric does not allow null
+ * function pointers here or else it will segfault.


drop the "or else it will segfault". The fact that it doesn't allow NULL pointers is sufficient.

bwbarrett · 2021-09-17T17:27:30Z

opal/mca/common/ofi/common_ofi.c

 #endif /* OPAL_OFI_IMPORT_MONITOR_SUPPORT */

 OPAL_DECLSPEC int opal_common_ofi_init(void)
 {


All functions that do interesting work should have a doxygen style header.

You should call out things like what this function is doing and why. At some point, you should explicitly say that you're injecting a monitor into Libfabric so that Libfabric can use Open MPI's memory hooks as its memory monitor, rather than trying to use its own.

I had a comment in the header file for the function, but it isn't doxygen style, I'll update it and give it more information

bwbarrett · 2021-09-17T17:28:01Z

opal/mca/common/ofi/common_ofi.c

    int ret;

+    /*
+     * Global refcnt and boolean are used to ensure we only initialize once


This pattern is well established and doesn't need a comment about why its here. However, you do need to explain how this fits in with the thread safety requirements of this function (and document those requirements in the function header).

I think to make it easier for users to call this function, I'll add a new opal_mutex_t lock and lock around the whole function.

why is that the right thing to do? That has its own risks, especially around initialization of the mutex in the initialization function you're currently writing. Just get the definition right.

There's already a common mutex lock that is being used, I think it's fine to re-use that

bwbarrett · 2021-09-17T17:30:50Z

opal/mca/common/ofi/common_ofi.c

    }

+    /*
+     * This cache object doesn't do much, but is necessary for the API to work.


this doesn't really help the reader. Bigger questions would be "Why 1.13" and "why "mr_cache"". Those are the things you should comment on.

bwbarrett · 2021-09-17T17:45:39Z

opal/mca/common/ofi/common_ofi.c

 OPAL_DECLSPEC int opal_common_ofi_fini(void)
 {
+    /*
+     * Global refcnt and boolean are used to ensure we only finalize once


you don't need to do this comment.

For thread safety, we have the init/fini functions locked under the common ofi lock. Signed-off-by: William Zhang <[email protected]>

Signed-off-by: William Zhang <[email protected]>

wckzhang · 2021-09-17T18:16:56Z

Updated comments, added locking to the init/fini function since I don't think it was actually thread safe previously. Reused the common ofi mutex lock for this.

bwbarrett · 2021-09-30T22:56:48Z

Replaced by #9441

wckzhang mentioned this pull request Sep 17, 2021

Memory patcher conflicts with OMPI and libfabric #8822

Closed

bwbarrett requested changes Sep 17, 2021

View reviewed changes

wckzhang added 2 commits September 17, 2021 11:14

common/ofi: Add locking to the init/fini functions

ea19b56

For thread safety, we have the init/fini functions locked under the common ofi lock. Signed-off-by: William Zhang <[email protected]>

common/ofi: Add comments for the import monitor code

4e0e688

Signed-off-by: William Zhang <[email protected]>

wckzhang force-pushed the api branch from 751b53a to 4e0e688 Compare September 17, 2021 18:15

bwbarrett mentioned this pull request Sep 29, 2021

Clean up OFI common code and delay patcher initialization until needed #9441

Merged

bwbarrett closed this Sep 30, 2021

common/ofi: Add comments for the import monitor code #9387

common/ofi: Add comments for the import monitor code #9387

Uh oh!

Conversation

wckzhang commented Sep 17, 2021

Uh oh!

wckzhang commented Sep 17, 2021

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

wckzhang commented Sep 17, 2021

Uh oh!

bwbarrett commented Sep 30, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants