[DRAFT] [Fix] Restore NullComm for spoofed cube-sphere run#386
[DRAFT] [Fix] Restore NullComm for spoofed cube-sphere run#386FlorianDeconinck wants to merge 1 commit into
NullComm for spoofed cube-sphere run#386Conversation
|
Poke @oelbert |
|
I didn't see/rember that That's another solution. |
I moved a copy of Spoofing a 6-rank setup on one rank sounds scary. I don't think the DSL should support/promote this behavior. I can see limited use in tests, though NDSL tests work fine without it. I'd rather not have the general DSL user know about this. If pace wants to support such workflows (e.g. for development), I think we should use/keep |
|
Closing this. Next steps for the underlying issues is:
|
Description
We definitely took
NullCommout in #350 and replace it's usage byLocalComm.One use case that cannot be covered with this replacement is running a single process spoofing 6 ranks cube-sphere
pyfv3model.Their is two problem:
LocalCommcan do halo exchange but you have to code for it (see unit test) andpyFV3isn't.AllReduceraises withLocalCommwhich is also needed forpyFV3(grid generation & tracer advection)In this PR we restore
NullCommto re-authorize the behavior.Other fixes that would not bring
NullCommback are:NullCommmakes it nonsense anyway) + no-opallreduceonLocalCommCachingCommfor the 6-ranks spoof (which should work but requires running true 6-ranks first)pyfv3proper +allreduceonLocalCommHow has this been tested?
N/A
Checklist