InternLM / lmdeploy Public

Notifications You must be signed in to change notification settings
Fork 485
Star 5.6k

Code
Issues 360
Pull requests 30
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Issues: InternLM/lmdeploy

[Docs] inference DeepSeek-V3 with LMDeploy

#2960 opened Dec 26, 2024 by haswelliris

Open 39

Labels 34 Milestones 0

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

360 Open 1,336 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

OOM when use InternVL2_5-1B-MPO

#3143 opened Feb 14, 2025 by BobHo5474

[Feature] old version of outlines 0.0.46, break other packages

#3141 opened Feb 13, 2025 by devops724

[Bug] Dockerfile_aarch64_ascend cannot work for docker 18.09

#3140 opened Feb 13, 2025 by bingps

3 tasks done

[Bug] ERROR - engine.py:904 - Task <MainLoopBackground> failed

#3138 opened Feb 12, 2025 by xiezhipeng-git

3 tasks done

internvl2.5-2B量化后推理速度无明显提升

#3135 opened Feb 12, 2025 by nzomi

Qwen2.5-VL-72B-Instruct can be supported now?

#3132 opened Feb 12, 2025 by lijingcheng2021

[Feature] 目前无法有效的获得停止词stop_words，需要获得停止词功能

#3131 opened Feb 11, 2025 by xiezhipeng-git

[Bug] lmdeploy Turbomind 推理导致jupyter 崩溃

#3128 opened Feb 10, 2025 by xiezhipeng-git

3 tasks done

LMDeploy目前支持MiniCPM-o模型吗

#3127 opened Feb 10, 2025 by daihuidai

[Bug] 在昇腾310P上加载Qwen AWQ模型时出现参数传递错误

#3124 opened Feb 9, 2025 by cccccya

3 tasks

[Bug] cogvlm-chat-hf picture understanding ability is bad. awaiting response

#3121 opened Feb 8, 2025 by zhulinJulia24

3 tasks

[Bug] structed_output cannot be used in cu118 with the lated docker images

#3120 opened Feb 8, 2025 by zhulinJulia24

3 tasks

[Feature] ascend 310p上执行图模式

#3119 opened Feb 7, 2025 by yezekun

310P疑似不支持dist.broadcast操作，如何支持多卡

#3118 opened Feb 7, 2025 by yezekun

关于InternVL TurbomindEngine 的疑惑

#3111 opened Feb 6, 2025 by chenzhengda

[Bug] 在Kaggle Notebook中使用turbomind backend推理Qwen/Qwen2.5-32B-Instruct-AWQ会无限期卡死

#3108 opened Feb 2, 2025 by zzc0721

3 tasks done

[Bug] pipeline 加载模型时无限期挂起而命令行部署正常

#3107 opened Jan 31, 2025 by NB-Group

3 tasks done

[Bug] 如何提前终止流式推理pipe.stream_infer awaiting response

#3106 opened Jan 31, 2025 by youyc22

3 tasks done

Closing PyTorchEngine gracefully

#3104 opened Jan 30, 2025 by AvisP

[Bug] CUDA error with Qwen/Qwen2-VL-7B-Instruct

#3101 opened Jan 29, 2025 by andoorve

3 tasks done

[Bug] LogitsWarper deprecated in transformers? (trying to run Qwen/Qwen2.5-VL-72B-Instruct) awaiting response

#3100 opened Jan 29, 2025 by josephrocca

[Feature] 建议官方在kaggle上使用lmdeploy完整安装资源开放一个公共笔记本用来展示lmdeploy的推理性能

#3098 opened Jan 29, 2025 by xiezhipeng-git

[Feature] Automatically Free pipeline of Prompts awaiting response

#3095 opened Jan 27, 2025 by richardjonker2000

[Feature] Support for sparse attention

#3093 opened Jan 27, 2025 by youyc22

[Bug] tp>1时，进行多视频batch推理没有结果输出

#3092 opened Jan 26, 2025 by qingchunlizhi

3 tasks done

Previous 1 2 3 4 5 … 14 15 Next

Previous Next

ProTip! What’s not been updated in a month: updated:<2025-01-14.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly