[Bugfix] fix the default reasoning mode in Reasoner Grammar. by jeremyzhang866 · Pull Request #10831 · sgl-project/sglang

jeremyzhang866 · 2025-09-24T01:56:03Z

Motivation

When we start a hybrid reasoning model on the server side, such as DeepSeek V3.1 or Qwen-14B, and enable both the reasoning parser and speculative decoding, then on the client side disable “thinking mode” in chat_template_kwargs while also enabling JSON-constrained decoding, we observe that SGLang produces abnormal results.
#10789

Modifications

We found that this may be related to the ReasonerGrammarBackend, where the ReasonerGrammarObject defaults to is_in_reasoning = true. Therefore, we are considering some actions to take when “thinking mode” is disabled in chat_template_kwargs.

sglang/python/sglang/srt/constrained/reasoner_grammar_backend.py

Line 28 in d21c352

self.is_in_reasoning = True

script

import json
from openai import OpenAI

def main(model_name="Qwen3-14B", thinking=False, use_json_schema=None):

    niogpt_base_url = "http://0.0.0.0:30000/v1"
    niogpt_api_key = "sk-no-api-key-needed"
    client = OpenAI(
        api_key=niogpt_api_key,
        base_url=niogpt_base_url
    )
    json_schema = {
        "type": "object",
        "properties": {
            "population": {"type": "integer"},
            "name": {"type": "string", "pattern": "^[\\w]+$"},
        },
        "required": ["name", "population"],
    }
    
    chat_kwargs = {"enable_thinking": thinking}

    # 基础请求参数
    request_params = dict(
        model=model_name,
        messages=[
            {
                "role": "user",
                "content": "show me the information of the capital of China in the JSON format.",
            }
        ],
        temperature=0,
        max_tokens=512,
        extra_body={"chat_template_kwargs": chat_kwargs}
    )
    if use_json_schema is True:
        request_params["response_format"] = {
            "type": "json_schema",
            "json_schema": {"name": "foo", "schema": json_schema}
        }

    response = client.chat.completions.create(**request_params)

    print("========== completion_tokens ==========")
    print(response.usage.completion_tokens)

    print("========== content ==========")
    for choice in response.choices:
        print(choice.message.content if choice.message.content is not None else "None")

    print("========== reasoning_content ==========")
    for choice in response.choices:
        print(choice.message.reasoning_content if choice.message.reasoning_content is not None else "None")
        print("=" * 50 + "\n")


if __name__ == "__main__":
    main(model_name="Qwen3-14B", thinking=False, use_json_schema=True)

before

========== completion_tokens ==========
512
========== content ==========
```!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!
========== reasoning_content ==========
None
==================================================

After PR

========== completion_tokens ==========
22
========== content ==========
{"population": 215423946, "name": "Beijing"}
========== reasoning_content ==========
None
==================================================

Benchmarking and Profiling

Checklist

Format your code according to the Format code with pre-commit.
Add unit tests according to the Run and add unit tests.
Update documentation according to Write documentations.
Provide accuracy and speed benchmark results according to Test the accuracy and Benchmark the speed.

…ammar

gemini-code-assist · 2025-09-24T01:56:06Z

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

jeremyzhang866 · 2025-09-24T01:57:21Z

@minleminzui please have a look

jeremyzhang866 · 2025-09-24T01:58:22Z

@hnyls2002 I reopened the PR — please review when convenient. Thanks!

jeremyzhang866 · 2025-09-24T23:53:46Z

convenient

@hnyls2002 have a look.thanks

jeremyzhang866 · 2025-09-25T06:23:11Z

@zhyncs Can you help review this?

minleminzui · 2025-09-25T06:34:43Z

@jeremyzhang866 sglang/test/srt/test_reasoning_parser.py，could you please verify whether you have passed this test

jeremyzhang866 · 2025-09-25T07:17:32Z

@jeremyzhang866 sglang/test/srt/test_reasoning_parser.py，could you please verify whether you have passed this test

@minleminzui I tested it and found no issues. Do you have any suggestions? Thanks.

zjm.zhang@chronic-daddy-founder-hzh-zjm-zhang-master-0:~/zjm_workspace/sglang/test/srt$ python3 test_reasoning_parser.py
................................................
----------------------------------------------------------------------
Ran 48 tests in 0.002s

jeremyzhang866 · 2025-09-26T00:01:01Z

@hnyls2002 @zhyncs Could you help review it when you have time

jeremyzhang866 · 2025-09-26T03:19:19Z

@xiezhq-hermann Could you help review it when you have time

JustinTong0323 · 2025-09-26T07:34:56Z

/gemini review

gemini-code-assist

Code Review

This pull request addresses a bug where grammar constraints were not correctly applied for reasoning models when 'thinking mode' was disabled. The fix involves introducing a may_can_reasoning flag that is passed from the scheduler to the ReasonerGrammarObject to correctly initialize its reasoning state. The overall approach is sound and effectively resolves the issue. My review includes a few suggestions to improve code clarity and maintainability by restoring type hints that were removed in base_grammar_backend.py and correcting an inconsistent type hint for the cache.

python/sglang/srt/constrained/base_grammar_backend.py

JustinTong0323

Thanks for the contribution! Overall LGTM, could you help resolve gemini's comments?

python/sglang/srt/managers/scheduler.py

…ammar

jeremyzhang866 · 2025-09-26T08:50:47Z

Thanks for the contribution! Overall LGTM, could you help resolve gemini's comments?
@JustinTong0323 I have fixed this issue, please review it again, thanks.

jeremyzhang866 · 2025-09-28T02:08:44Z

@JustinTong0323 Could you help review it again? Thanks.

jeremyzhang866 · 2025-09-28T07:34:20Z

@JustinTong0323 are there any other comments, or can it be merged? thanks :)

JustinTong0323 · 2025-09-28T09:00:44Z

@JustinTong0323 are there any other comments, or can it be merged? thanks :)

We need to wait for "required" CI green but there seems some issue with huggingface side blocking the PR. I would keep an eye on this PR, thanks for your effort!

jeremyzhang866 · 2025-09-28T10:42:30Z

@JustinTong0323 are there any other comments, or can it be merged? thanks :)

We need to wait for "required" CI green but there seems some issue with huggingface side blocking the PR. I would keep an eye on this PR, thanks for your effort!

Thank you for your reply.

jeremyzhang866 · 2025-10-24T03:38:30Z

@hnyls2002 @xiezhq-hermann please have a look. thanks

JustinTong0323 · 2025-10-28T10:24:47Z

@hnyls2002 @xiezhq-hermann please have a look. thanks

Sorry for the late reply, just got a similar issue, and it could be better to add a test for this corner case..
(and also due to lots of PR waiting for merge I would really appreciate if you could ping me on Slack to accelerate the process of any PR.

CatherineSue · 2025-10-28T20:09:26Z

python/sglang/srt/constrained/reasoner_grammar_backend.py

        super().__init__()
        self.grammar = grammar
        self.think_end_id = think_end_id
-        self.is_in_reasoning = True


Why don't we simpily set this field to False?

jeremyzhang866 · 2025-10-29T10:33:21Z

Sorry for the late reply, just got a similar issue, and it could be better to add a test for this corner case..
(and also due to lots of PR waiting for merge I would really appreciate if you could ping me on Slack to accelerate the process of any PR.

@JustinTong0323 Sorry Could you please clarify what I should do? I thought the example above already showed the issue. Thanks.

JustinTong0323 · 2025-11-04T16:06:54Z

Sorry for the late reply, just got a similar issue, and it could be better to add a test for this corner case..
(and also due to lots of PR waiting for merge I would really appreciate if you could ping me on Slack to accelerate the process of any PR.

@JustinTong0323 Sorry Could you please clarify what I should do? I thought the example above already showed the issue. Thanks.

May you check the PR that just mentioned? Maybe you could discuss and find out how to combine your PRs... (and sorry I am OOO this week so reply maybe late...

jeremyzhang866 · 2025-11-05T03:35:24Z

ou check the PR that just mentioned? Maybe you could discuss and find out how to combine your PRs... (and sorry I am OOO this week so reply maybe late...

Sorry for the late reply, just got a similar issue, and it could be better to add a test for this corner case..
(and also due to lots of PR waiting for merge I would really appreciate if you could ping me on Slack to accelerate the process of any PR.

@JustinTong0323 Sorry Could you please clarify what I should do? I thought the example above already showed the issue. Thanks.

May you check the PR that just mentioned? Maybe you could discuss and find out how to combine your PRs... (and sorry I am OOO this week so reply maybe late...

Thanks for your reply. I just checked that PR — the motivation behind it is the same as mine.

That PR might have better extensibility, but my changes are a bit simpler.

[Bugfix] fix the issue with the default reasoning mode in reasoner_gr…

d3aac35

…ammar

jeremyzhang866 requested review from Ying1123, hnyls2002, merrymercy and xiezhq-hermann as code owners September 24, 2025 01:56

Merge branch 'main' into jeremy_reasoner_grammer_fix

4145c9d

Merge branch 'main' into jeremy_reasoner_grammer_fix

216fca6

Merge branch 'main' into jeremy_reasoner_grammer_fix

ae78dff

Merge branch 'main' into jeremy_reasoner_grammer_fix

c12d7c0

Merge branch 'main' into jeremy_reasoner_grammer_fix

62dd271

Merge branch 'main' into jeremy_reasoner_grammer_fix

0d9277a

zhyncs assigned JustinTong0323 Sep 26, 2025

Merge branch 'main' into jeremy_reasoner_grammer_fix

943fffc

JustinTong0323 added the run-ci label Sep 26, 2025

gemini-code-assist bot reviewed Sep 26, 2025

View reviewed changes

python/sglang/srt/constrained/base_grammar_backend.py Show resolved Hide resolved

python/sglang/srt/constrained/base_grammar_backend.py Outdated Show resolved Hide resolved

python/sglang/srt/constrained/base_grammar_backend.py Outdated Show resolved Hide resolved

JustinTong0323 reviewed Sep 26, 2025

View reviewed changes

python/sglang/srt/managers/scheduler.py Outdated Show resolved Hide resolved

[Bugfix] fix the issue with the default reasoning mode in reasoner_gr…

1dae6b5

…ammar

jeremyzhang866 added 2 commits September 26, 2025 22:38

Merge branch 'main' into jeremy_reasoner_grammer_fix

599bc21

Merge branch 'main' into jeremy_reasoner_grammer_fix

c8ccadd

JustinTong0323 and others added 2 commits September 27, 2025 22:25

Merge branch 'main' into jeremy_reasoner_grammer_fix

244ec0c

Merge branch 'main' into jeremy_reasoner_grammer_fix

286f35d

jeremyzhang866 and others added 7 commits September 28, 2025 18:42

Merge branch 'main' into jeremy_reasoner_grammer_fix

50d4d22

Merge branch 'main' into jeremy_reasoner_grammer_fix

1694aee

Merge branch 'main' into jeremy_reasoner_grammer_fix

3e70338

Merge branch 'main' into jeremy_reasoner_grammer_fix

93daf97

Merge branch 'main' into jeremy_reasoner_grammer_fix

fbab761

Merge branch 'main' into jeremy_reasoner_grammer_fix

cdb8c6c

Merge branch 'main' into jeremy_reasoner_grammer_fix

e5b89ff

JustinTong0323 approved these changes Oct 21, 2025

View reviewed changes

jeremyzhang866 added 2 commits October 22, 2025 10:17

Merge branch 'main' into jeremy_reasoner_grammer_fix

aead1b5

Merge branch 'main' into jeremy_reasoner_grammer_fix

881f12d

JustinTong0323 mentioned this pull request Oct 28, 2025

[Bug] DeepSeek-V3.1 Required Tool Calling Cannot be Parsed #12203

Closed

5 tasks

CatherineSue reviewed Oct 28, 2025

View reviewed changes

CatherineSue mentioned this pull request Nov 3, 2025

[Reasoning + Structured Output] make reasoning compatible with structured output #12551

Merged

4 tasks

merrymercy requested review from DarkSharpness and zhyncs as code owners November 29, 2025 07:06

Conversation

jeremyzhang866 commented Sep 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Modifications

Benchmarking and Profiling

Checklist

Uh oh!

gemini-code-assist bot commented Sep 24, 2025

Uh oh!

jeremyzhang866 commented Sep 24, 2025

Uh oh!

jeremyzhang866 commented Sep 24, 2025

Uh oh!

jeremyzhang866 commented Sep 24, 2025

Uh oh!

jeremyzhang866 commented Sep 25, 2025

Uh oh!

minleminzui commented Sep 25, 2025

Uh oh!

jeremyzhang866 commented Sep 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jeremyzhang866 commented Sep 26, 2025

Uh oh!

jeremyzhang866 commented Sep 26, 2025

Uh oh!

JustinTong0323 commented Sep 26, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

JustinTong0323 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jeremyzhang866 commented Sep 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jeremyzhang866 commented Sep 28, 2025

Uh oh!

jeremyzhang866 commented Sep 28, 2025

Uh oh!

JustinTong0323 commented Sep 28, 2025

Uh oh!

jeremyzhang866 commented Sep 28, 2025

Uh oh!

jeremyzhang866 commented Oct 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JustinTong0323 commented Oct 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

CatherineSue Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

jeremyzhang866 commented Oct 29, 2025

Uh oh!

JustinTong0323 commented Nov 4, 2025

Uh oh!

jeremyzhang866 commented Nov 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

jeremyzhang866 commented Sep 24, 2025 •

edited

Loading

jeremyzhang866 commented Sep 25, 2025 •

edited

Loading

jeremyzhang866 commented Sep 26, 2025 •

edited

Loading

jeremyzhang866 commented Oct 24, 2025 •

edited

Loading

JustinTong0323 commented Oct 28, 2025 •

edited

Loading