Skip to content

Conversation

@awlauria
Copy link
Contributor

@awlauria awlauria commented Jan 8, 2021

Make a managed allocation filter a hostfile/hostlist.

If the user asks for a hostfile/hostlist inside of a managed allocation,
make sure that rmaps filters these and maps processes based on them. Otherwise,
it can result in inconsistent mappings across root and compute nodes if the
user orders their hostfile differently than the resource manager.

Signed-off-by: Austen Lauria [email protected]
(cherry picked from commit e14f80d)

Fix bug where orte under a managed allocation does not honor -host.

For example:

$. bsub -n 40 -m "node1 node2" mpirun -np 6 -host node1:2,node2:4 hostname

would not map two hostname processes to node1 and four to node2.
Instead, it would still think that each node1
and node2 had (for example) 20 cpu resources, and map accordingly.

Signed-off-by: Austen Lauria [email protected]
(cherry picked from commit 35cf87a)

If the user asks for a hostfile/hostlist inside of a managed allocation,
make sure that rmaps filters these and maps processes based on them. Otherwise,
it can result in inconsistent mappings across root and compute nodes if the
user orders their hostfile differently than the resource manager.

Signed-off-by: Austen Lauria <[email protected]>
(cherry picked from commit e14f80d)
For example:

$. bsub -n 40 -m "node1 node2" mpirun -np 6 -host node1:2,node2:4 hostname

would not map two hostname processes to node1 and four to node2.
Instead, it would still think that each node1
and node2 had (for example) 20 cpu resources, and map accordingly.

Signed-off-by: Austen Lauria <[email protected]>
(cherry picked from commit 35cf87a)
@awlauria awlauria changed the title v4.1.x: Managed allocation v4.1.x v4.1.x: Fix a couple managed allocation issues. Jan 8, 2021
@awlauria
Copy link
Contributor Author

awlauria commented Jan 8, 2021

Cherry-picked from #8352
prrte/master: openpmix/prrte#718

@jsquyres jsquyres merged commit 90b70d5 into open-mpi:v4.1.x Jan 11, 2021
@awlauria awlauria deleted the managed_allocation_v4.1.x branch March 17, 2022 17:24
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants