Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

申请了gpu的pod在运行一段时间之后 容器内会出现找不到显卡 #230

Open
wanghaowish opened this issue Sep 10, 2024 · 0 comments

Comments

@wanghaowish
Copy link

wanghaowish commented Sep 10, 2024

pod一开始是可以的,但是运行一段时间之后,有概率出现掉显卡的现象。
进入容器内执行nvidia-smi也会提示显卡错误
容器内执行nvidia-smi 输出
Failed to initialize NVML: Unknown Error
img_v3_02ej_9a6e0bc2-0f92-42d5-941a-cef533f9916g

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant