-
Notifications
You must be signed in to change notification settings - Fork 6.8k
Experiment with CI cudnn versions [Do not merge] #15847
Conversation
Seems CI was stuck. And I have resolved several conflicts in PR #15741. Would you mind rebasing on it and triggering CI again? Thanks. BTW, I also found that |
Any update on this or CI versions irrelevant, pls? |
Thanks for your patience on this. I began working on the issues of your PR with the commit 'bump cudnn version..." and found that the CI passed for the platform that was causing you trouble. However, I could not get a passing CI due to a number of issues. Those issues I finally fixed with merged PR #15922. I am currently out-of-the-office this week. If you don't mind pursuing a remedy on your own, I suggest you re-merge your PR with master (to pick up 15922), then cherry-pick my 'bump cudnn version..." commit. Please let me know if that stabilizes your PR. |
@DickJC123 Sure. I will have a try. Thanks for your reply. |
With CuDNN 7.6.0.4, CI also failed for several times. test_operator_gpu.test_lstm_bidirectional test_gluon_gpu.test_rnn_layer |
@mxnet-label-bot add [pr-awaiting-review] |
Description
While trying to track down rnn test flakiness that surfaced in PR#15741, I noticed that the CI for the unix-gpu runners is running against cudnn 7.5, while the libmxnet.so was built against 7.6. This PR is an attempt to see if the flakiness is correlated with the cudnn lib version.
Checklist
Essentials
Please feel free to remove inapplicable items for your PR.
Changes
Comments