You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
1. I have searched related issues but cannot get the expected help.
2. The bug has not been fixed in the latest version.
3. Please note that if the bug-related issue you submitted lacks corresponding environment info and a minimal reproducible demo, it will be challenging for us to reproduce and resolve the issue, reducing the likelihood of receiving feedback.
---原始邮件---
发件人: "Lyu ***@***.***>
发送时间: 2025年2月2日(周日) 晚上9:43
收件人: ***@***.***>;
抄送: ***@***.******@***.***>;
主题: Re: [InternLM/lmdeploy] [Bug] 在Kaggle Notebook中使用turbomind backend推理Qwen/Qwen2.5-32B-Instruct-AWQ会无限期卡死 (Issue #3108)
用的是哪个版本呢?
—
Reply to this email directly, view it on GitHub, or unsubscribe.
You are receiving this because you authored the thread.Message ID: ***@***.***>
Checklist
Describe the bug
我在kaggle notebook中测试使用T4x2 GPU搭配turbomind backend推理Qwen/Qwen2.5-32B-Instruct-AWQ会遇到代码块无限期卡死的问题
使用的lmdeploy是github中的0.7.0.post2+cu118发行版本
如果手动指定pytorch backend则不会出现这个问题
使用turbomind backend部署gradio也会出现无限期卡死的情况
Reproduction
!lmdeploy serve api_server Qwen/Qwen2.5-32B-Instruct-AWQ --tp 2
Environment
Error traceback
The text was updated successfully, but these errors were encountered: