Skip to content

Profiling of the DeepMD-kit #1842

Answered by AnuragKr
AnuragKr asked this question in Q&A
Aug 2, 2022 · 2 comments · 4 replies
Discussion options

You must be logged in to vote

Solution--

  1. Issue -- ERR_NVGPUCTRPERM: Permission issue with Performance Counters
    Steps to solve (Ask System Administrator to follow these steps)--

    1. Write

      options nvidia "NVreg_RestrictProfilingToAdminUsers=0 

      into /etc/modprobe.d/nvidia-prof.conf

    2.   sudo update-initramfs -u -k all
    3. Reboot

  2. Kernel level profiling and FLOP count can be done now using Nsight compute rather than nvprof.

    CUDA_VISIBLE_DEVICES=1 ncu --target-processes all -o profile mpirun -np 1 dp train  input.json

For more advanced profiling options - https://docs.nvidia.com/nsight-compute/NsightComputeCli/index.html

Replies: 2 comments 4 replies

Comment options

You must be logged in to vote
4 replies
@AnuragKr
Comment options

@denghuilu
Comment options

@AnuragKr
Comment options

@AnuragKr
Comment options

Comment options

You must be logged in to vote
0 replies
Answer selected by AnuragKr
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants