mpi: Enhance flexibility for custom topologies #2134

georgebisbas · 2023-05-22T16:30:46Z

This PR enhances the available custom topology MPI decompositions

The requirements for the number of stars to evenly divide the number of MPI procs are dropped.
You can see some topology to decomposition combos in the added tests

codecov · 2023-05-22T16:41:38Z

Codecov Report

Merging #2134 (f26e986) into master (9b9c45e) will decrease coverage by 0.03%.
The diff coverage is 35.71%.

@@            Coverage Diff             @@
##           master    #2134      +/-   ##
==========================================
- Coverage   87.10%   87.07%   -0.03%     
==========================================
  Files         223      223              
  Lines       39813    39832      +19     
  Branches     5166     5169       +3     
==========================================
+ Hits        34679    34685       +6     
- Misses       4556     4569      +13     
  Partials      578      578

Impacted Files	Coverage Δ
examples/seismic/test_seismic_utils.py	`100.00% <ø> (ø)`
devito/mpi/distributed.py	`87.37% <25.00%> (-2.86%)`	⬇️
tests/test_mpi.py	`98.85% <57.14%> (-0.23%)`	⬇️
devito/data/data.py	`95.79% <100.00%> (ø)`

devito/mpi/distributed.py

mloubout · 2023-05-26T16:50:58Z

devito/mpi/distributed.py

+            # Decompose the processes remaining for allocation to prime factors
+            alloc_procs = np.prod([i for i in items if i != '*'])
+            remprocs = int(input_comm.size // alloc_procs)
+            prime_factors = primefactors(remprocs)


This looks a bit overly intricate.

If you use factorint(remproc) you get directly the dict of factors that you just need to split i.e

factors = factorint(remprocs) vals = [k for (k, v) in factors.items() for _ in range(v)] vals = vals + [1 for _ in range(nstars - len(vals))] startvals = (*vals[:nstart-1], prod(vals[nstars-1:])

and should be it

Well, not realy, since you do not prioritise neither the outermost dimension, nor the overall balance:
e.g. your approach will give:

FAILED tests/test_mpi.py::TestDistributor::test_custom_topology_3d_dummy[24-topology20-dist_topology20] - assert (2, 2, 6) == (6, 2, 2) FAILED tests/test_mpi.py::TestDistributor::test_custom_topology_3d_dummy[32-topology21-dist_topology21] - assert (2, 2, 8) == (4, 4, 2)

That's just a minor change to the last line to cycle instead of multiply the remainder

split = np.array_split(vals, nstar) starvals = np.prod([np.pad(s, (0, nstar-len(s)), constant_values=1) for s in split], dims=1)

And pad (nstar-len(s), 0) if you want it left heavy instead

I think I pushed a simplified version, lemme know if you like it better.
I would dare to say that I consider your approach less readable, at least for my eyes.
btw, I could not make it work?
Would you mind to try locally your code?

tests/test_mpi.py

FabioLuporini

Several improvements possible in my opinion

tests/test_mpi.py

devito/mpi/distributed.py

FabioLuporini · 2023-06-07T08:37:37Z

devito/mpi/distributed.py

+                for index, value in zip(int_pos, int_vals):
+                    processed[index] = value
+
+            if dd_list:


why do you need this if?

and it's probably doable with a comprehension

Really? Like how without iterating O(len(processed))?

devito/mpi/distributed.py

FabioLuporini · 2023-06-07T08:38:52Z

devito/mpi/distributed.py

+                       the outermost dimension
+
+    Assuming N=6 and requested topology is `('*', '*', 1)`,
+    since there is no integer k, so that k*k=6, we resort to the closest factors to


Could you rewrite this (grammarly maybe)? it's not very clear to me at first glance

I simplified it to let the examples speak by themselves

devito/mpi/distributed.py

FabioLuporini · 2023-06-07T08:45:58Z

devito/mpi/distributed.py

+            # Start by using the max prime factor at the first starred position,
+            # then cyclically-iteratively decompose as evenly as possible until
+            # decomposing to the number of `remprocs`
+            while remprocs != 1:


nitpicking: if one passes a dummy comm with .size = 0, this remains entrapped in an infinite loop. Safer to use : while remprocs > 1

Also, I see an inconsisten use of _ in variable names

Switched to rem_procs.

I think passing 0 is bad use, maybe we can jump to the assert statement directly ?
Asking cutom topo on 0 ranks looks bad to me

devito/mpi/distributed.py

FabioLuporini · 2023-06-07T08:48:17Z

devito/mpi/distributed.py

+            # Decompose the processes remaining for allocation to prime factors
+            alloc_procs = np.prod([i for i in items if i != '*'])
+            remprocs = int(input_comm.size // alloc_procs)
+            prime_factors = primefactors(remprocs)


instead of having this both here and inside the while loop, you should have it only once, at the top of the while loop...

which, if necessary, you can turn into:

while True: if rem_procs > 1: break ...

thus emulating a do-while loop

probably same story with rem_procs

Dropped and improved, lemme know if it is fine

FabioLuporini · 2023-06-07T08:49:22Z

devito/mpi/distributed.py

+        try:
+            assert np.prod(processed) == input_comm.size
+        except:
+            raise ValueError("Invalid `topology`", processed, " for given nprocs:",


if it's an assert, why bothering with this? Just leave the assert alone and that's it

I think it will be more helpful as a message to the user?
I can drop if you feel so

FabioLuporini · 2023-06-15T08:34:03Z

Merged, thanks

georgebisbas added the MPI mpi-related label May 22, 2023

georgebisbas requested review from FabioLuporini and mloubout May 22, 2023 16:30

georgebisbas self-assigned this May 22, 2023

georgebisbas force-pushed the test_custom_topology_w_star branch from 0a8ecf0 to c8d3b54 Compare May 22, 2023 16:34

mloubout reviewed May 25, 2023

View reviewed changes

devito/mpi/distributed.py Outdated Show resolved Hide resolved

mloubout reviewed May 26, 2023

View reviewed changes

georgebisbas force-pushed the test_custom_topology_w_star branch from 15761fe to 3ae6798 Compare June 4, 2023 12:42

FabioLuporini reviewed Jun 5, 2023

View reviewed changes

tests/test_mpi.py Outdated Show resolved Hide resolved

georgebisbas force-pushed the test_custom_topology_w_star branch from 3ae6798 to 958579c Compare June 5, 2023 09:00

FabioLuporini requested changes Jun 7, 2023

View reviewed changes

georgebisbas added 7 commits June 8, 2023 16:27

tests: Add tests with star in custom topology

9a686b7

mpi: Enhance custom topologies with more flexibility

cfe5870

tests: Add dummy communicator to avoid mpi test overhead

f523f5b

mpi: Enhance support for custom star topology and add tests

702a5da

mpi: Add numbers other than 1 to custom topology

1c3c893

mpi: Simplify algorithm logic for decomposition

60afbd3

mpi: Drop redundant check and defs

422b3f3

georgebisbas force-pushed the test_custom_topology_w_star branch from 5fd03f7 to 83e8640 Compare June 8, 2023 15:30

georgebisbas requested a review from FabioLuporini June 8, 2023 16:18

mloubout approved these changes Jun 8, 2023

View reviewed changes

georgebisbas added 2 commits June 9, 2023 11:43

mpi: Simplify Custom domain decomposition

9c7e8d3

mpi: Simplify custom approach using 'factorint' and 'array_split'

f26e986

georgebisbas force-pushed the test_custom_topology_w_star branch from 14b1fdc to f26e986 Compare June 9, 2023 10:58

mloubout approved these changes Jun 9, 2023

View reviewed changes

FabioLuporini approved these changes Jun 15, 2023

View reviewed changes

FabioLuporini merged commit 4c54253 into master Jun 15, 2023

FabioLuporini deleted the test_custom_topology_w_star branch June 15, 2023 08:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mpi: Enhance flexibility for custom topologies #2134

mpi: Enhance flexibility for custom topologies #2134

georgebisbas commented May 22, 2023

codecov bot commented May 22, 2023 •

edited

Loading

mloubout May 26, 2023

georgebisbas Jun 4, 2023

mloubout Jun 7, 2023 •

edited

Loading

mloubout Jun 7, 2023

georgebisbas Jun 8, 2023

FabioLuporini left a comment

FabioLuporini Jun 7, 2023

FabioLuporini Jun 7, 2023

georgebisbas Jun 8, 2023

FabioLuporini Jun 7, 2023

georgebisbas Jun 8, 2023

FabioLuporini Jun 7, 2023

georgebisbas Jun 8, 2023

georgebisbas Jun 8, 2023

FabioLuporini Jun 7, 2023

FabioLuporini Jun 7, 2023

georgebisbas Jun 8, 2023

FabioLuporini Jun 7, 2023

georgebisbas Jun 8, 2023

FabioLuporini commented Jun 15, 2023

mpi: Enhance flexibility for custom topologies #2134

mpi: Enhance flexibility for custom topologies #2134

Conversation

georgebisbas commented May 22, 2023

codecov bot commented May 22, 2023 • edited Loading

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mloubout Jun 7, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

FabioLuporini left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

FabioLuporini commented Jun 15, 2023

codecov bot commented May 22, 2023 •

edited

Loading

mloubout Jun 7, 2023 •

edited

Loading