-
Notifications
You must be signed in to change notification settings - Fork 6.8k
MXNet Master build for CUDA with DEBUG=1 failing #14263
Comments
Hey, this is the MXNet Label Bot. |
I tried this on a p3.2xlarge and a p3.16xlarge using a DLAMI Base and saw the same error both times. Running with DEBUG off will build fine. |
I have the same issue on PPC64LE using master (c319ae5), built with CMake on Linux. I don't have this issue with 1.4.0. |
Other Approaches tried:
|
@mxnet-label-bot add [bug][build] |
@mxnet-label-bot add [build] |
I met the same problem.
my config.mk
The output of make:
|
Having the same issue. |
A fix PR has been merged. Could you please try to verify if it fixes your problem? |
@yuxihu Still Not fixed |
Try replace the in The root cause of this may be the 4GB limit of static lib generated by |
@hzfan let me try that. Thanks for the suggestion! |
@access2rohit Regarding See https://sourceware.org/bugzilla/show_bug.cgi?id=14625 However, in my local tests that didn't fix the issue when using |
The same problem for me: |
You should be able to avoid this issue by just building for a single cuda architecture. Look into specifying |
@leezu following your advice, the size of generated .so is reduced by 3/4, thanks. |
Note: Providing complete information in the most concise form is the best way to get help. This issue template serves as the checklist for essential information to most of the technical issues and bug reports. For non-technical issues and feature requests, feel free to present the information in what you believe is the best form.
For Q & A and discussion, please start a discussion thread at https://discuss.mxnet.io
Description
MXNet Master build for CUDA with DEBUG=1 failing
Environment info (Required)
AWS Base DLAMI (ubuntu 16.04) on a p2.8xlarge
Package used (Python/R/Scala/Julia):
(I'm using ...) Python
Build info (Required if built from source)
Compiler (gcc/clang/mingw/visual studio):
gcc 5.4
g++ 5.4
MXNet commit hash:
(Paste the output of
git rev-parse HEAD
here.)0af40f7
Build config:
(Paste the content of config.mk, or the build command.)
USE_CUDA=1
USE_CUDNN=1
USE_LAPACK=1
USE_BLAS = openblas
USE_OPENCV=1
USE_CUDA_PATH = /usr/local/cuda
DEBUG=1
/usr/local/cuda -> /usr/local/cuda-9.0
Error Message:
(Paste the complete error message, including stack trace.)
Minimum reproducible example
(If you are using your own code, please provide a short script that reproduces the error. Otherwise, please provide link to the existing example.)
Steps to reproduce
(Paste the commands you ran that produced the error.)
What have you tried to solve it?
The text was updated successfully, but these errors were encountered: