feat: Add 'none' reasoning effort to ChatCompletionRequest by Javtor · Pull Request #20556 · sgl-project/sglang

Javtor · 2026-03-13T23:34:44Z

Motivation

The OpenAI Python SDK's ReasoningEffort type includes "none" (source), and vLLM recently added support for it (vllm#36238). SGLang currently only accepts "low", "medium", "high".

This is useful for models like Qwen3.5 that have built-in reasoning, so callers can disable thinking per-request via reasoning_effort: "none" without requiring engine-level configuration like --default-chat-template-kwargs '{"enable_thinking": false}'.

Modifications

protocol.py:

Add "none" to reasoning_effort Literal type and normalize_reasoning_inputs validator
When reasoning_effort="none", default thinking and enable_thinking to False in chat_template_kwargs via setdefault (user-provided values take precedence)

serving_chat.py:

Reject reasoning_effort="none" in the harmony (GPT-OSS) path with ValueError

test_protocol.py:

Add tests for reasoning_effort="none" via top-level param and nested reasoning dict

Checklist

Format your code according to the Format code with pre-commit.
Add unit tests according to the Run and add unit tests.
Follow the SGLang code style guidance.

Review Process

Ping Merge Oncalls to start the PR flow. See the PR Merge Process.
Get approvals from CODEOWNERS and other reviewers.
Trigger CI tests with comments or contact authorized users to do so.
- /tag-run-ci-label, /rerun-failed-ci, /tag-and-rerun-ci
After green CI and required approvals, ask Merge Oncalls to merge.

gemini-code-assist · 2026-03-13T23:34:49Z

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

chatgpt-codex-connector · 2026-03-13T23:34:49Z

You have reached your Codex usage limits for code reviews. You can see your limits in the Codex usage dashboard.
To continue using code reviews, you can upgrade your account or add credits to your account and enable them for code reviews in your settings.

Add support for reasoning_effort='none' in the chat completions API, matching the OpenAI Python SDK's ReasoningEffort type which includes 'none' (see openai-python: types/shared/reasoning_effort.py). When reasoning_effort='none': - Defaults thinking and enable_thinking to false in chat_template_kwargs (respects explicit user overrides via setdefault) - Harmony (GPT-OSS) path raises ValueError (not supported) This is useful for models like Qwen3.5 that have built-in reasoning but where callers want direct output without thinking tokens.

hnyls2002

LGTM.

hnyls2002 · 2026-03-16T03:11:47Z

/tag-and-rerun-ci

hnyls2002 · 2026-03-16T03:25:44Z

All hit tests pass locally.

…ct#20556)

Javtor requested review from CatherineSue, JustinTong0323, ispobock, merrymercy and slin1237 as code owners March 13, 2026 23:34

Javtor force-pushed the add-none-reasoning-effort branch from b35dbb2 to 3e344d0 Compare March 13, 2026 23:36

Javtor force-pushed the add-none-reasoning-effort branch from 3e344d0 to ec2a77f Compare March 13, 2026 23:36

Javtor changed the title ~~Add none reasoning effort~~ feat: Add 'none' reasoning effort to ChatCompletionRequest Mar 13, 2026

hnyls2002 approved these changes Mar 16, 2026

View reviewed changes

github-actions bot added the run-ci label Mar 16, 2026

hnyls2002 merged commit afc71ba into sgl-project:main Mar 16, 2026
95 of 142 checks passed

Wangzheee pushed a commit to Wangzheee/sglang that referenced this pull request Mar 21, 2026

feat: Add 'none' reasoning effort to ChatCompletionRequest (sgl-proje…

4009cef

…ct#20556)

0-693 pushed a commit to 0-693/sglang that referenced this pull request Mar 25, 2026

feat: Add 'none' reasoning effort to ChatCompletionRequest (sgl-proje…

c2b92a6

…ct#20556)

JustinTong0323 pushed a commit to JustinTong0323/sglang that referenced this pull request Apr 7, 2026

feat: Add 'none' reasoning effort to ChatCompletionRequest (sgl-proje…

6fb70b7

…ct#20556)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add 'none' reasoning effort to ChatCompletionRequest#20556

feat: Add 'none' reasoning effort to ChatCompletionRequest#20556
hnyls2002 merged 1 commit intosgl-project:mainfrom
Javtor:add-none-reasoning-effort

Javtor commented Mar 13, 2026 •

edited

Loading

Uh oh!

gemini-code-assist bot commented Mar 13, 2026

Uh oh!

chatgpt-codex-connector bot commented Mar 13, 2026

Uh oh!

hnyls2002 left a comment

Uh oh!

hnyls2002 commented Mar 16, 2026

Uh oh!

hnyls2002 commented Mar 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Javtor commented Mar 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Modifications

Checklist

Review Process

Uh oh!

gemini-code-assist bot commented Mar 13, 2026

Uh oh!

chatgpt-codex-connector bot commented Mar 13, 2026

Uh oh!

hnyls2002 left a comment

Choose a reason for hiding this comment

Uh oh!

hnyls2002 commented Mar 16, 2026

Uh oh!

hnyls2002 commented Mar 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Javtor commented Mar 13, 2026 •

edited

Loading