This repository has been archived by the owner on Nov 17, 2023. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 6.8k
[v1.x] test_gluon_data unit tests failing #19877
Labels
Comments
josephevans
pushed a commit
to josephevans/mxnet
that referenced
this issue
Feb 10, 2021
Since we're trying to unblock the v1.x CI pipeline, I am disabling these 2 tests for now in #19872. |
sandeep-krishnamurthy
pushed a commit
that referenced
this issue
Feb 10, 2021
* Attempt to fix v1.x CI issues. * Re-pin scipy. * Add numpy with pinned version so other package installs don't overwrite out required version. * Use python3 (from /usr/local/bin) for tensorrt gpu tests, so it can find all required python modules. * Fix onnx tests; need to pass scalar value (not np.array) to create_const_scalar_node. * Fix pylint * Set values using np.dtype(dtype) instead of using float32 and then casting to desired type. * Skip 2 tests that are flakey, reported in issue #19877. Co-authored-by: Joe Evans <[email protected]>
this PR made changes to gluon data loader #19748 . But the issue is not reproducible on local machine though. CI is unblocked for now. This requires further investigation for root cause |
Getting another test failure, looks related to gluon data loader stuff. We really need to dig down and root cause this issue.
|
I don't think it's related, but I create #19886 to test if upgrading python (from 3.6 to 3.7) caused these unit test failures. |
This was referenced Feb 19, 2021
See my comment in the other issue about this: #19918 (comment) |
4 tasks
Thanks @ptrendx for the fix! Closing this issue. |
Sign up for free
to subscribe to this conversation on GitHub.
Already have an account?
Sign in.
Description
On the v1.x pipeline, we are seeing the following test failures consistently:
in tests/python/unittest/test_gluon_data.py:
test_multi_worker_dataloader_release_pool
test_multi_worker_forked_data_loader
Occurrences
https://jenkins.mxnet-ci.amazon-ml.com/blue/organizations/jenkins/mxnet-validation%2Funix-cpu/detail/PR-19872/7/pipeline/293/#step-776-log-1725
https://jenkins.mxnet-ci.amazon-ml.com/blue/organizations/jenkins/mxnet-validation%2Funix-cpu/detail/PR-19872/4/pipeline/296
Test failure logs:
The text was updated successfully, but these errors were encountered: