Skip to content

Conversation

@awlauria
Copy link
Contributor

No description provided.

@awlauria awlauria requested a review from rhc54 August 23, 2022 13:50
#if PRTE_ENABLE_FT
PMIX_OPTION_DEFINE(PRTE_CLI_ENABLE_RECOVERY, PMIX_ARG_NONE),
PMIX_OPTION_DEFINE(PRTE_CLI_MAX_RESTARTS, PMIX_ARG_REQD),
PMIX_OPTION_DEFINE(PRTE_CLI_DISABLE_RECOVERY, PMIX_ARG_NONE),
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hmmm...what if the default for PRRTE has been set by MCA parameter to enable recovery, but the user doesn't want to do that for this job? Are you sure you want to remove this option?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It was undefined here:
8d735f5#diff-95468cad0a5594de72258b4d462586e17493568348689d12b0b612141a9312daL113

alternatively we could just add this back - oversight?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah, that was an error on my part. I'm working right now on how to handle all these runtime controls - many of them are currently set by MCA param (e.g., the abort-on-non-zero-exit behavior), which really isn't right for PRRTE as it is primarily a DVM and not every job it runs will want such behaviors. I'm creating a new --runtime-options CLI for this purpose.

So we have a couple of options here:

  • you could restore the definition for now, which is fine with me
  • you could decide to add the --runtime-options directive to your cmd line and include "disablerecover" (or whatever you would like to call it) there. You'd have to wait for me to commit that directive and its supporting code - should be later today or tomorrow.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks - I just put it back as it's a compile error.

Restore PRTE_CLI_DISABLE_RECOVERY definition for now.

Signed-off-by: Austen Lauria <[email protected]>
@rhc54
Copy link
Contributor

rhc54 commented Aug 23, 2022

bot:ibm:retest

@rhc54 rhc54 merged commit c63d743 into openpmix:master Aug 23, 2022
@awlauria awlauria deleted the fix_minor_things branch August 23, 2022 15:22
@awlauria awlauria mentioned this pull request Aug 23, 2022
14 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants