-
Notifications
You must be signed in to change notification settings - Fork 3.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Set MLFlowLogger
status to FAILED when training raises an error
#12292
Set MLFlowLogger
status to FAILED when training raises an error
#12292
Conversation
for more information, see https://pre-commit.ci
The commit has succeeded to fix the bug, but failed to pass an existing unit test, which I couldn't find a solution to. |
…lightning into bugfix/12291_mlflow
I would appreciate it if you approve running the workflows 🙂 |
Most of team was OOO. Sorry about the delay.
Which test was it? |
@awaelchli
It was |
@awaelchli @daniellepintz |
@ritsuki1227 I created a fix in #14762 to unblock this PR for the remaining issues. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
mind add a changelog?
@ritsuki1227 before we merge, would you mind checking again your MLFlow use case, that the status is reported as you expect? Thank you! |
@awaelchli Thank you for your commits! I've checked mlflow logger worked correctly on the use case. |
Awesome, let's get it in then! Thanks a lot for your help in making the integration with PL better! |
…2292) Co-authored-by: Ritsuki Yamada <[email protected]> Co-authored-by: Adrian Wälchli <[email protected]> Co-authored-by: Jirka <[email protected]> Co-authored-by: Carlos Mocholí <[email protected]>
What does this PR do?
If a trainer with MLFlowLogger raises an error, the user should be able to see the MLflow's screen to check the training has been failed.
MLflow's status remains "RUNNING" even after trainer.fit raises an error in the current implementation, so the user cannot know whether the training is still in progress or failed.
Fixes #12291
Does your PR introduce any breaking changes? If yes, please list them.
Before submitting
PR review
Anyone in the community is welcome to review the PR.
Before you start reviewing make sure you have read Review guidelines. In short, see the following bullet-list:
Did you have fun?
Make sure you had fun coding 🙃