Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve progress engine binding flow #181

Merged
merged 2 commits into from
Mar 8, 2023
Merged

Improve progress engine binding flow #181

merged 2 commits into from
Mar 8, 2023

Conversation

ndryden
Copy link
Collaborator

@ndryden ndryden commented Mar 8, 2023

This splits apart how binding the progress engine is done to avoid MPI collectives coming from the progress engine thread. This could cause issues when not all ranks participate in calls, leading to deadlocks. (There might also be multi-threading issues.)

This also fixes an unguarded use of a variable which might lead to compile errors.

@ndryden ndryden added the bug Something isn't working label Mar 8, 2023
@ndryden ndryden requested a review from benson31 March 8, 2023 17:02
Copy link
Collaborator

@benson31 benson31 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. Fixes observed hang in LBANN.

@ndryden ndryden merged commit a5fdb08 into master Mar 8, 2023
@ndryden ndryden deleted the fix-binding branch March 8, 2023 17:29
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants