Skip to content
This repository has been archived by the owner on Nov 17, 2023. It is now read-only.

[v1.4.x] Support populating errors back to MXNet engine in callback #13932

Merged
merged 2 commits into from
Jan 18, 2019

Conversation

yuxihu
Copy link
Member

@yuxihu yuxihu commented Jan 18, 2019

This PR adds an optional dmlc::Error* argument in MXNet engine callback functions. The callers can leverage this argument to populate errors back to MXNet engine through callback such that the errors can be handled properly by MXNet engine. It is needed to populate the errors detected in Horovod back to MXNet engine for Hovorod and MXNet integration (see PR). This change does not affect existing use cases.

Copy link
Contributor

@apeforest apeforest left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@yuxihu
Copy link
Member Author

yuxihu commented Jan 18, 2019

@mxnet-label-bot update [pr-awaiting-merge]

@marcoabreu marcoabreu added the pr-awaiting-merge Review and CI is complete. Ready to Merge label Jan 18, 2019
@yuxihu
Copy link
Member Author

yuxihu commented Jan 18, 2019

@marcoabreu Could you please help merge this PR to v1.4.x branch? Thanks.

Copy link
Member

@lanking520 lanking520 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

#13922 Mirror of the PR, LGTM

@lanking520 lanking520 merged commit 191f50a into apache:v1.4.x Jan 18, 2019
@yuxihu yuxihu deleted the mx_error_14x branch January 25, 2019 04:58
lanking520 pushed a commit to lanking520/incubator-mxnet that referenced this pull request Feb 18, 2019
…pache#13932)

* add an optional error_msg in engine on_complete callbcak

* use dmlc::Error struct to make error population extendable
@anirudh2290 anirudh2290 mentioned this pull request Apr 4, 2019
7 tasks
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
pr-awaiting-merge Review and CI is complete. Ready to Merge
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants