Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GPU显存占用上去,运算并没有使用GPU,推理速度30s/字 #89

Closed
xu-mengnan opened this issue Jun 19, 2023 · 4 comments
Closed
Labels

Comments

@xu-mengnan
Copy link

xu-mengnan commented Jun 19, 2023

图片

按照教程来进行的配置安装,但是发现推理速度非常慢。
机器配置:2080ti/8G显存/5CPU/12G内存。
模型:GLM int4

@wangzhaode
Copy link
Owner

MNN编译时是否开启了CUDA选项呢

@xu-mengnan
Copy link
Author

xu-mengnan commented Jun 19, 2023

开启了,全部按照readme配置来的

@xu-mengnan
Copy link
Author

MNN编译时是否开启了CUDA选项呢

图片 除了这两项,还需要进行其他的额外配置吗

@hj8e45
Copy link

hj8e45 commented Jun 20, 2023

MNN编译时是否开启了CUDA选项呢

图片 除了这两项,还需要进行其他的额外配置吗

try this? alibaba/MNN#2321

Copy link

Marking as stale. No activity in 30 days.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

3 participants