-
Notifications
You must be signed in to change notification settings - Fork 6.8k
Conversation
Hey @szha , Thanks for submitting the PR
CI supported jobs: [centos-gpu, edge, windows-cpu, website, centos-cpu, unix-cpu, unix-gpu, windows-gpu, miscellaneous, sanity, clang] Note: |
2c73dcf
to
ccf2342
Compare
How about the random seed handling and reproducibility? I think you'll need to adapt https://github.com/dmlc/gluon-nlp/blob/v0.9.x/conftest.py#L41 |
d09779c
to
b7ab4a2
Compare
@leezu thanks. Added. |
f94b92f
to
e848924
Compare
@@ -10,16 +10,16 @@ jobs: | |||
uses: actions/checkout@v2 | |||
- name: Install Dependencies | |||
run: | | |||
brew install nasm automake ninja libtool cmake pkgconfig protobuf | |||
brew install nasm automake ninja libtool cmake pkgconfig protobuf hdf5 zlib | |||
python3 -m pip install --user -r ci/docker/install/requirements |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I suggest not to use ci/docker/install/requirements
here but list the dependencies explicitly. Most of the things in ci/docker/install/requirements
are not needed for this test and having a single requirements file that is used from an opaque number of locations for an variety of purposes makes it very hard to maintain that file.
Specifically, here we can just replace python3 -m pip install --user nose nose-timer nose-exclude numpy scipy
with python3 -m pip install --user pytest numpy scipy
a few lines below.
@mxnet-bot run ci [unix-gpu] |
Jenkins CI successfully triggered : [unix-gpu] |
@szha there is a bug with the unix-gpu dist kvstore tests Runtime on master is 6 minutes http://jenkins.mxnet-ci.amazon-ml.com/blue/organizations/jenkins/mxnet-validation%2Funix-gpu/detail/master/1887/pipeline/426 but more than 1 hour in this PR http://jenkins.mxnet-ci.amazon-ml.com/blue/organizations/jenkins/mxnet-validation%2Funix-gpu/detail/PR-18025/56/pipeline |
Ok, the difference is due to hanging issue.. [2020-04-22T17:06:49.477Z] AssertionError: (None, array([[0.6, 0.6, 0.6], [2020-04-22T17:06:49.477Z] [0.6, 0.6, 0.6]], dtype=float32), 41.0) [2020-04-22T17:06:49.477Z] terminate called without an active exception [2020-04-22T17:06:49.477Z] terminate called without an active exception [2020-04-22T17:06:49.477Z] terminate called without an active exception [2020-04-22T17:06:49.477Z] terminate called without an active exception Thanks @marcoabreu to retrigger |
The stability of our test suite is really a pain in the butt :( You're welcome |
This does not seem to be a stability issue, but a bug. Same failure again:
|
Still
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
good to me
Woohoo! |
|
||
# https://github.com/apache/incubator-mxnet/issues/11801 | ||
# if [[ ${mxnet_variant} = "cpu" ]] || [[ ${mxnet_variant} = "mkl" ]]; then | ||
# integrationtest_ubuntu_cpu_dist_kvstore | ||
# fi | ||
|
||
if [[ ${mxnet_variant} = cu* ]]; then | ||
$nose_cmd $NOSE_TIMER_ARGUMENTS --verbose tests/python/gpu | ||
pytest --durations=50 --verbose tests/python/gpu |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
why are we showing top 50 slowest duration only?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why not? The others are not really of interest
* switch nose with pytest * switch centos python to 3.6 * disable dist kvstore tests * skip hanging test
Description
switch nose with pytest
Checklist
Essentials
Please feel free to remove inapplicable items for your PR.
Changes
Comments