Load NDArray only to GPU if GPU is present #16432

leezu · 2019-10-11T00:11:19Z

Description

Fix #16399

Checklist

Essentials

Please feel free to remove inapplicable items for your PR.

Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage:
Unit tests are added for small changes to verify correctness (e.g. adding a new operator)
Nightly tests are added for complicated/long-running ones (e.g. changing distributed kvstore)
Build tests will be added for build configuration changes (e.g. adding a new build option with NCCL)
Code is well-documented:
For user-facing API changes, API doc string has been updated.
For new C++ functions in header files, their functionalities and arguments are documented.
For new examples, README.md is added to explain the what the example does, the source of the dataset, expected performance on test set and reference to the original paper if applicable
Check the API doc at https://mxnet-ci-doc.s3-accelerate.dualstack.amazonaws.com/PR-$PR_ID/$BUILD_ID/index.html
To the my best knowledge, examples are either not affected by this change, or have been fixed to be compatible with this change

Changes

Fix crash when attempting to load a NDArray that was saved with GPU context on a MXNet runtime built with Cuda but when no GPUs are present

Comments

I'm not sure if the CI actually tests the case of "GPU build of MXNET but no GPU present". So the test added here may always pass on CI.

ZhennanQin · 2019-10-11T01:07:13Z

For a machine with GPU, if user want to run model on CPU only, will this change works for him?

leezu · 2019-10-11T04:16:23Z

This change only affects the case where a MXNet cuda-enabled build is used on a CPU-only machine. It avoids a crash that currently happens. So the use-case you describe is not affected.

ZhennanQin · 2019-10-11T08:04:19Z

So if user want to use MXNet cuda-enabled build on a GPU equipped machine, but want to run model on its CPU, is this case supported?

leezu · 2019-10-11T18:34:09Z

Yes. It is already supported currently. Nothing is changed with respect to that.

The problem addressed here, is that a serialized ndarray encodes the context it lived on before serialization. When loading it, and no GPU is present, MXNet crashes as it attempts to load the array to GPU. With this PR, we fallback to loading to CPU. This is already done for CPU-only builds of MXNet.

* Load NDArray only to GPU if GPU is present * Add test

leezu added 2 commits October 10, 2019 23:53

Load NDArray only to GPU if GPU is present

fccec0f

Add test

501c4e4

leezu mentioned this pull request Oct 14, 2019

ndarray.load crashes MXNet GPU builds on CPU machines #16399

Closed

szha approved these changes Oct 17, 2019

View reviewed changes

szha merged commit a4ea4a8 into apache:master Oct 17, 2019

leezu deleted the fix16399 branch October 17, 2019 22:54

apeforest pushed a commit that referenced this pull request Nov 14, 2019

Load NDArray only to GPU if GPU is present (#16432)

70b65cb

* Load NDArray only to GPU if GPU is present * Add test

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Load NDArray only to GPU if GPU is present #16432

Load NDArray only to GPU if GPU is present #16432

leezu commented Oct 11, 2019

ZhennanQin commented Oct 11, 2019

leezu commented Oct 11, 2019

ZhennanQin commented Oct 11, 2019

leezu commented Oct 11, 2019 •

edited

Loading

Load NDArray only to GPU if GPU is present #16432

Load NDArray only to GPU if GPU is present #16432

Conversation

leezu commented Oct 11, 2019

Description

Checklist

Essentials

Changes

Comments

ZhennanQin commented Oct 11, 2019

leezu commented Oct 11, 2019

ZhennanQin commented Oct 11, 2019

leezu commented Oct 11, 2019 • edited Loading

leezu commented Oct 11, 2019 •

edited

Loading