Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

FEAT: support deepseek-v2 and 2.5 #2292

Merged
merged 5 commits into from
Sep 13, 2024
Merged

Conversation

amumu96
Copy link
Contributor

@amumu96 amumu96 commented Sep 12, 2024

测试环境: NVIDIA GeForce RTX 3090 Ti * 2
测试模型:DeepSeek-V2-Lite-Chat
推理引擎:vllm==0.5.5
测试参数:--num-prompt 100 --concurrency 50
image
推理引擎:transformers==4.39.3
测试参数:--num-prompt 50 --concurrency 5
image
推理引擎:sglang
模型参数:mem_fraction_static=0.8
测试参数:--num-prompt 100 --concurrency 50

@XprobeBot XprobeBot added this to the v0.15 milestone Sep 12, 2024
wuzhaoxin added 2 commits September 12, 2024 11:26
# Conflicts:
#	xinference/model/llm/llm_family.json
#	xinference/model/llm/llm_family_modelscope.json
#	xinference/model/llm/transformers/core.py
@qinxuye qinxuye changed the title FEAT: support deepseek-v2 FEAT: support deepseek-v2 and 2.5 Sep 13, 2024
@ChengjieLi28 ChengjieLi28 merged commit 56de933 into xorbitsai:main Sep 13, 2024
13 checks passed
Vanocore added a commit to Vanocore/xinference that referenced this pull request Sep 22, 2024
* 'main' of https://github.com/xorbitsai/inference:
  FEAT: support qwen2.5-coder-instruct and qwen2.5 sglang (xorbitsai#2332)
  DOC: update models for doc and readme (xorbitsai#2330)
  BUG: fix stable diffusion from dify tool (xorbitsai#2336)
  BUG: support old register llm format (xorbitsai#2335)
  FEAT: Support Qwen 2.5 (xorbitsai#2325)
  BUG: Fix CosyVoice missing output (xorbitsai#2320)
  BUG: [UI] Fix registration page bug. (xorbitsai#2315)
  Bug: modify vllm image version (xorbitsai#2312)
  BUG: modify vllm image version (xorbitsai#2311)
  FEAT: qwen2 audio (xorbitsai#2271)
  BUG: fix sampler_name for img2img (xorbitsai#2301)
  FEAT: Support yi-coder-chat (xorbitsai#2302)
  FEAT: support flux.1 image2image and inpainting (xorbitsai#2296)
  FEAT: support sdapi/img2img (xorbitsai#2293)
  ENH: Support fish speech 1.4 (xorbitsai#2295)
  FEAT: Update Qwen2-VL-Model to support flash_attention_2 implementation (xorbitsai#2289)
  FEAT: support deepseek-v2 and 2.5 (xorbitsai#2292)

# Conflicts:
#	xinference/model/audio/cosyvoice.py
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants