You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Nov 17, 2023. It is now read-only.
These were the same lines that caused issue#20145, as was fixed by @Zha0q1 in PR#20146. The problem is that these source lines still have a sensitivity to the driver version and cmake build flag NVML_NO_UNVERSIONED_FUNC_DEFS.
Error Message
This issue was found when we compile MXNet master on the cuda11 450.x driver, where we see:
In file included from ../src/profiler/storage_profiler.cc:22:
/usr/local/cuda/include/nvml.h:8403:127: note: initializing argument 3 of ‘nvmlReturn_t nvmlDeviceGetComputeRunningProcesses(nvmlDevice_t, unsigned int*, nvmlProcessInfo_v1_t*)’
8403 | nvmlReturn_t DECLDIR nvmlDeviceGetComputeRunningProcesses(nvmlDevice_t device, unsigned int *infoCount, nvmlProcessInfo_v1_t *infos);
Steps to reproduce
Find machine with cuda11 450.x driver
Compile mxnet
What have you tried to solve it?
This issue was found and fixed by Dick Carter
@DickJC123 has developed a general solution that avoids compilation errors no matter which signature of the nvmlDeviceGetComputeRunningProcesses() function is enabled in the code. We will be submitting this fix as a PR shortly.
The text was updated successfully, but these errors were encountered:
Welcome to Apache MXNet (incubating)! We are on a mission to democratize AI, and we are glad that you are contributing to it by opening this issue.
Please make sure to include all the relevant context, and one of the @apache/mxnet-committers will be here shortly.
If you are interested in contributing to our project, let us know! Also, be sure to check out our guide on contributing to MXNet and our development guides wiki.
Description
We would like to report a compilation issue on the master branch, related to use of NVIDIA’s NVML library. The source lines involved are: https://github.com/apache/incubator-mxnet/blob/master/src/profiler/storage_profiler.cc#L103-L111
These were the same lines that caused issue#20145, as was fixed by @Zha0q1 in PR#20146. The problem is that these source lines still have a sensitivity to the driver version and cmake build flag NVML_NO_UNVERSIONED_FUNC_DEFS.
Error Message
This issue was found when we compile MXNet master on the cuda11 450.x driver, where we see:
FAILED:
CMakeFiles/mxnet.dir/src/profiler/storage_profiler.cc.o../src/profiler/storage_profiler.cc:109:78: error: cannot convert ‘nvmlProcessInfo_st*’ to ‘nvmlProcessInfo_v1_t*’ {aka ‘nvmlProcessInfo_v1_st*’}
109 | nvmlDeviceGetComputeRunningProcesses(nvml_device, &info_count, infos.data());
In file included from ../src/profiler/storage_profiler.cc:22:
/usr/local/cuda/include/nvml.h:8403:127: note: initializing argument 3 of ‘nvmlReturn_t nvmlDeviceGetComputeRunningProcesses(nvmlDevice_t, unsigned int*, nvmlProcessInfo_v1_t*)’
8403 | nvmlReturn_t DECLDIR nvmlDeviceGetComputeRunningProcesses(nvmlDevice_t device, unsigned int *infoCount, nvmlProcessInfo_v1_t *infos);
Steps to reproduce
What have you tried to solve it?
The text was updated successfully, but these errors were encountered: