-
Notifications
You must be signed in to change notification settings - Fork 487
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug] pipeline 加载模型时无限期挂起 而命令行部署正常 #3107
Comments
看起来有可能是 tuning 时 OOM 了,那个地方 OOM 的异常处理不是很完善导致没有报错的情况下卡住了。建议试试降低 cache_max_entry_count 和 max_prefill_token_num。 可以先从 cache_max_entry_count=0.2,max_prefill_token_num=1024 开始尝试。 |
哦!好的,谢谢您的回答! |
这个问题能被某种程度上修复,或者说给出报错提示,或者能通知到主进程嘛,我在 opencompass 那个项目里遇到了类似的问题,很难捕获到 |
在 |
Checklist
Describe the bug
在使用pipeline加载deepseek-r1-distill-qwen-7b-gptq-int4模型时卡住,但是在命令行部署时正常。
我在标题里写“挂起”,因为它真的挂起了:
Reproduction
这是问题代码
而命令行部署正常:
Environment
Error traceback
The text was updated successfully, but these errors were encountered: