[Feature] Support DeepSeekV3 Function Call #17784

Xu-Wenqing · 2025-05-07T11:18:13Z

Support DeepSeek-V3-0324 function call.

usage:

vllm serve ... --enable-auto-tool-choice --tool-call-parser deepseek_v3 --chat-template examples/tool_chat_template_deepseekv3.jinja

test script (no streaming):

from openai import OpenAI

openai_api_base = ""
openai_api_key = ""

client = OpenAI(
    api_key=openai_api_key,
    base_url=openai_api_base + "/v1",
)


tools = [
    {
        "type": "function",
        "function": {
            "name": "get_current_temperature",
            "description": "Get current temperature at a location.",
            "parameters": {
                "type": "object",
                "properties": {
                    "location": {
                        "type": "string",
                        "description": 'The location to get the temperature for, in the format "City, State, Country".',
                    },
                    "unit": {
                        "type": "string",
                        "enum": ["celsius", "fahrenheit"],
                        "description": 'The unit to return the temperature in. Defaults to "celsius".',
                    },
                },
                "required": ["location"],
            },
        },
    },
    {
        "type": "function",
        "function": {
            "name": "get_temperature_date",
            "description": "Get temperature at a location and date.",
            "parameters": {
                "type": "object",
                "properties": {
                    "location": {
                        "type": "string",
                        "description": 'The location to get the temperature for, in the format "City, State, Country".',
                    },
                    "date": {
                        "type": "string",
                        "description": 'The date to get the temperature for, in the format "Year-Month-Day".',
                    },
                    "unit": {
                        "type": "string",
                        "enum": ["celsius", "fahrenheit"],
                        "description": 'The unit to return the temperature in. Defaults to "celsius".',
                    },
                },
                "required": ["location", "date"],
            },
        },
    },
]


response = client.chat.completions.create(
    model=client.models.list().data[0].id,
    messages=[
        {
            "role": "system",
            "content": "You are a helpful assistant.\n\nCurrent Date: 2024-09-30",
        },
        {
            "role": "user",
            "content": "What's the temperature in San Francisco now? How about tomorrow?",
        },
    ],
    tools=tools,
    tool_choice="auto",
    stream=False,
)

print(response.choices[0].message.content)

tool_calls = response.choices[0].message.tool_calls
for c in tool_calls:
    print(c.function.name, c.function.arguments)

Output:

None
get_current_temperature {"location": "San Francisco, CA, USA", "unit": "celsius"}
get_temperature_date {"location": "San Francisco, CA, USA", "date": "2024-10-01", "unit": "celsius"}

test script (streaming):

from openai import OpenAI

openai_api_base = ""
openai_api_key = ""

client = OpenAI(
    api_key=openai_api_key,
    base_url=openai_api_base + "/v1",
)

class bcolors:
    HEADER = '\033[95m'
    OKBLUE = '\033[94m'
    OKCYAN = '\033[96m'
    OKGREEN = '\033[92m'
    WARNING = '\033[93m'
    FAIL = '\033[91m'
    ENDC = '\033[0m'
    BOLD = '\033[1m'
    UNDERLINE = '\033[4m'


tools = [
    {
        "type": "function",
        "function": {
            "name": "get_current_temperature",
            "description": "Get current temperature at a location.",
            "parameters": {
                "type": "object",
                "properties": {
                    "location": {
                        "type": "string",
                        "description": 'The location to get the temperature for, in the format "City, State, Country".',
                    },
                    "unit": {
                        "type": "string",
                        "enum": ["celsius", "fahrenheit"],
                        "description": 'The unit to return the temperature in. Defaults to "celsius".',
                    },
                },
                "required": ["location"],
            },
        },
    },
    {
        "type": "function",
        "function": {
            "name": "get_temperature_date",
            "description": "Get temperature at a location and date.",
            "parameters": {
                "type": "object",
                "properties": {
                    "location": {
                        "type": "string",
                        "description": 'The location to get the temperature for, in the format "City, State, Country".',
                    },
                    "date": {
                        "type": "string",
                        "description": 'The date to get the temperature for, in the format "Year-Month-Day".',
                    },
                    "unit": {
                        "type": "string",
                        "enum": ["celsius", "fahrenheit"],
                        "description": 'The unit to return the temperature in. Defaults to "celsius".',
                    },
                },
                "required": ["location", "date"],
            },
        },
    },
]


tool_calls_stream = client.chat.completions.create(
    model=client.models.list().data[0].id,
    messages=[
        {
            "role": "system",
            "content": "You are a helpful assistant.\n\nCurrent Date: 2024-09-30",
        },
        {
            "role": "user",
            "content": "What's the temperature in San Francisco now? How about tomorrow?",
        },
    ],
    tools=tools,
    tool_choice="auto",
    stream=True,
    extra_body={"chat_template_kwargs": {"enable_thinking": True}},
)

print("reasoning content(Blue) and content(Green):")
chunks = []
for chunk in tool_calls_stream:
    chunks.append(chunk)
    if hasattr(chunk.choices[0].delta, "reasoning_content"):
        reasoning_content = chunk.choices[0].delta.reasoning_content
        if reasoning_content:
            print(bcolors.OKBLUE + reasoning_content, end="", flush=True)
    elif hasattr(chunk.choices[0].delta, "content"):
        content = chunk.choices[0].delta.content
        if content:
            print(bcolors.OKGREEN + content, end="", flush=True)

print(bcolors.ENDC + "\n### end of reasoning content and content. ###\n")

arguments = []
tool_call_idx = -1
for chunk in chunks:

    if chunk.choices[0].delta.tool_calls:
        tool_call = chunk.choices[0].delta.tool_calls[0]

        if tool_call.index != tool_call_idx:
            if tool_call_idx >= 0:
                print(f"streamed tool call arguments: {arguments[tool_call_idx]}")
            tool_call_idx = chunk.choices[0].delta.tool_calls[0].index
            arguments.append("")
        if tool_call.id:
            print(f"streamed tool call id: {tool_call.id} ")

        if tool_call.function:
            if tool_call.function.name:
                print(f"streamed tool call name: {tool_call.function.name}")

            if tool_call.function.arguments:
                arguments[tool_call_idx] += tool_call.function.arguments

if len(arguments):
    print(f"streamed tool call arguments: {arguments[-1]}")

Output:

reasoning content(Blue) and content(Green):


### end of reasoning content and content. ###

streamed tool call id: chatcmpl-tool-e2c23f26c3fa4a45a5eb029babe8ba12 
streamed tool call name: get_current_temperature
streamed tool call arguments: {"location": "San Francisco, USA", "unit": "celsius"}
streamed tool call id: chatcmpl-tool-c4945b99fcbd4134971134c302eaf6a9 
streamed tool call name: get_temperature_date
streamed tool call arguments: {"location": "San Francisco, USA", "date": "2024-10-01", "unit": "celsius"}

Signed-off-by: 许文卿 <[email protected]>

github-actions · 2025-05-07T11:18:22Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

Signed-off-by: 许文卿 <[email protected]>

Xu-Wenqing · 2025-05-07T11:43:50Z

#14745

Signed-off-by: 许文卿 <[email protected]>

houseroad · 2025-05-07T16:20:21Z

Could you lint and attach a test plan?

Signed-off-by: 许文卿 <[email protected]>

aarnphm

qq. The parser looks good for merging for now, but do we need toe tool chat template?

iirc this is already included in the default chat templates from the model repo?

Signed-off-by: 许文卿 <[email protected]>

Xu-Wenqing · 2025-05-08T06:53:41Z

qq. The parser looks good for merging for now, but do we need toe tool chat template?

iirc this is already included in the default chat templates from the model repo?

@aarnphm yeah, DeepSeek-V3-0324 default chat template not works for vllm, fixed some issues, e.g.
tool['function']['arguments'] changed to tool['function']['arguments']|tojson https://github.com/Xu-Wenqing/vllm/blob/dev/dpsk_r1_tool_parser/examples/tool_chat_template_deepseekv3.jinja#L56-L62

I updated the code, need review again, thanks.

Xu-Wenqing · 2025-05-08T06:55:09Z

Could you lint and attach a test plan?

@houseroad updated descriptions, including some test cases.

aarnphm · 2025-05-09T20:56:12Z

vllm/entrypoints/openai/tool_parsers/deepseekv3_tool_parser.py

+logger = init_logger(__name__)
+
+
+@ToolParserManager.register_module("deepseekv3")


Suggested change

@ToolParserManager.register_module("deepseekv3")

@ToolParserManager.register_module("deepseek_v3")

minor s/deepseekv3/deepseek_v3

aarnphm

Tiny

Signed-off-by: Xu Wenqing <[email protected]>

Xu-Wenqing · 2025-05-10T01:49:59Z

Tiny

@aarnphm done.

Xu-Wenqing · 2025-05-10T01:51:40Z

@DarkLight1337 @simon-mo @houseroad @russellb need review with write access, thanks.

Signed-off-by: Xu Wenqing <[email protected]>

Signed-off-by: 许文卿 <[email protected]> Signed-off-by: Xu Wenqing <[email protected]> Signed-off-by: Mu Huai <[email protected]>

Signed-off-by: 许文卿 <[email protected]> Signed-off-by: Xu Wenqing <[email protected]>

Signed-off-by: 许文卿 <[email protected]> Signed-off-by: Xu Wenqing <[email protected]> Signed-off-by: Yuqi Zhang <[email protected]>

WangErXiao · 2025-05-29T08:17:40Z

@Xu-Wenqing Can this support deepseek_r1 tool calling?

Xu-Wenqing · 2025-05-29T23:34:57Z

@Xu-Wenqing Can this support deepseek_r1 tool calling?

@WangErXiao Yes. DeepSeek just released DeepSeek-R1-0528 model, this model support function call. We can use "deepseek_v3" tool call parser, but the chat template "tool_chat_template_deepseekv3.jinja" here don't support DeepSeek-R1-0528. I created a PR: #18874, add DeepSeek-R1-0528 chat template.

WangErXiao · 2025-05-30T01:32:08Z

LGTM 👍 @Xu-Wenqing

xueshuai0922 · 2025-06-10T01:35:10Z

@Xu-Wenqing https://docs.vllm.ai/en/stable/features/tool_calling.html?h=function+call#deepseek-v3-models-deepseek_v3 Description in the official documentation. If the vllm version is greater than 0.8.3, it supports function call. Is there any difference between this update?

Signed-off-by: 许文卿 <[email protected]> Signed-off-by: Xu Wenqing <[email protected]>

Xu-Wenqing added 4 commits May 7, 2025 14:46

[Feature] Support DeepSeekV3 tool parser

7531ada

Signed-off-by: 许文卿 <[email protected]>

Merge branch 'main' into dev/dpsk_r1_tool_parser

c171fa3

[Feature] Support DeepSeekV3 tool parser

061b11d

Signed-off-by: 许文卿 <[email protected]>

[Feature] Support DeepSeekV3 tool parser

d06f072

Signed-off-by: 许文卿 <[email protected]>

mergify bot added documentation Improvements or additions to documentation frontend tool-calling labels May 7, 2025

github-project-automation bot added this to Tool Calling May 7, 2025

Xu-Wenqing mentioned this pull request May 7, 2025

[Feature]: Support tool calls for DeepSeek. #14745

Closed

1 task

Xu-Wenqing marked this pull request as ready for review May 7, 2025 11:20

Xu-Wenqing changed the title ~~[Feature] Support DeepSeekV3 Function Call~~ [WIP][Feature] Support DeepSeekV3 Function Call May 7, 2025

Xu-Wenqing marked this pull request as draft May 7, 2025 11:22

[Feature] Support DeepSeekV3 tool parser

1cc376f

Signed-off-by: 许文卿 <[email protected]>

Xu-Wenqing changed the title ~~[WIP][Feature] Support DeepSeekV3 Function Call~~ [Feature] Support DeepSeekV3 Function Call May 7, 2025

Xu-Wenqing marked this pull request as ready for review May 7, 2025 11:42

Xu-Wenqing added 2 commits May 7, 2025 20:13

[Feature] Support DeepSeekV3 tool parser

7d3576b

Signed-off-by: 许文卿 <[email protected]>

[Feature] Support DeepSeekV3 tool parser

58066ca

Signed-off-by: 许文卿 <[email protected]>

Xu-Wenqing added 3 commits May 8, 2025 10:46

[Feature] Support DeepSeekV3 tool parser

c8007a1

Signed-off-by: 许文卿 <[email protected]>

[Feature] Support DeepSeekV3 tool parser

1ab8306

Signed-off-by: 许文卿 <[email protected]>

Merge branch 'main' into dev/dpsk_r1_tool_parser

152e809

aarnphm approved these changes May 8, 2025

View reviewed changes

[Feature] Support DeepSeekV3 tool parser

d2b8152

Signed-off-by: 许文卿 <[email protected]>

Xu-Wenqing requested a review from aarnphm May 8, 2025 07:00

aarnphm reviewed May 9, 2025

View reviewed changes

aarnphm approved these changes May 9, 2025

View reviewed changes

[Feature] Support DeepSeekV3 Function Call

08b95c9

Signed-off-by: Xu Wenqing <[email protected]>

[Feature] Support DeepSeekV3 Function Call

958b15c

Signed-off-by: Xu Wenqing <[email protected]>

DarkLight1337 approved these changes May 12, 2025

View reviewed changes

DarkLight1337 enabled auto-merge (squash) May 12, 2025 05:54

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label May 12, 2025

Merge branch 'main' into dev/dpsk_r1_tool_parser

58a234c

vllm-bot merged commit 3a5ea75 into vllm-project:main May 12, 2025
11 of 14 checks passed

github-project-automation bot moved this to Done in Tool Calling May 12, 2025

mawong-amd pushed a commit to ROCm/vllm that referenced this pull request May 14, 2025

[Feature] Support DeepSeekV3 Function Call (vllm-project#17784)

fdb0722

Signed-off-by: 许文卿 <[email protected]> Signed-off-by: Xu Wenqing <[email protected]>

NickLucche mentioned this pull request May 15, 2025

[PD] Heterogenous TP + #7 robertgshaw2-redhat/vllm#14

Closed

zzzyq pushed a commit to zzzyq/vllm that referenced this pull request May 24, 2025

[Feature] Support DeepSeekV3 Function Call (vllm-project#17784)

0d89240

Signed-off-by: 许文卿 <[email protected]> Signed-off-by: Xu Wenqing <[email protected]> Signed-off-by: Yuqi Zhang <[email protected]>

gary-wjc mentioned this pull request Jul 28, 2025

[Bug]: DeepSeek-V3-0324 tool-call: fail to get tool_calls from response occationally #21727

Open

1 task

huiqiwa pushed a commit to huiqiwa/vllm-fork that referenced this pull request Oct 21, 2025

[Feature] Support DeepSeekV3 Function Call (vllm-project#17784)

756299a

Signed-off-by: 许文卿 <[email protected]> Signed-off-by: Xu Wenqing <[email protected]>

huiqiwa pushed a commit to huiqiwa/vllm-fork that referenced this pull request Oct 22, 2025

[Feature] Support DeepSeekV3 Function Call (vllm-project#17784)

4658fa9

Signed-off-by: 许文卿 <[email protected]> Signed-off-by: Xu Wenqing <[email protected]>

This comment was marked as off-topic.

Sign in to view

		logger = init_logger(__name__)


		@ToolParserManager.register_module("deepseekv3")

Uh oh!

[Feature] Support DeepSeekV3 Function Call #17784

[Feature] Support DeepSeekV3 Function Call #17784

Conversation

Xu-Wenqing commented May 7, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented May 7, 2025

Uh oh!

Xu-Wenqing commented May 7, 2025

Uh oh!

houseroad commented May 7, 2025

Uh oh!

aarnphm left a comment

Choose a reason for hiding this comment

Uh oh!

Xu-Wenqing commented May 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Xu-Wenqing commented May 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aarnphm May 9, 2025

Choose a reason for hiding this comment

Uh oh!

Xu-Wenqing May 10, 2025

Choose a reason for hiding this comment

Uh oh!

aarnphm left a comment

Choose a reason for hiding this comment

Uh oh!

Xu-Wenqing commented May 10, 2025

Uh oh!

Xu-Wenqing commented May 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

WangErXiao commented May 29, 2025

Uh oh!

Xu-Wenqing commented May 29, 2025

Uh oh!

WangErXiao commented May 30, 2025

Uh oh!

xueshuai0922 commented Jun 10, 2025

Uh oh!

This comment was marked as off-topic.

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

Xu-Wenqing commented May 7, 2025 •

edited by github-actions bot

Loading

Xu-Wenqing commented May 8, 2025 •

edited

Loading

Xu-Wenqing commented May 8, 2025 •

edited

Loading

Xu-Wenqing commented May 10, 2025 •

edited

Loading