Skip to content

[Feature] Add 'none' reasoning effort to ChatCompletionRequest#20552

Closed
Javtor wants to merge 1 commit intosgl-project:mainfrom
Javtor:add-none-reasoning-effort
Closed

[Feature] Add 'none' reasoning effort to ChatCompletionRequest#20552
Javtor wants to merge 1 commit intosgl-project:mainfrom
Javtor:add-none-reasoning-effort

Conversation

@Javtor
Copy link
Copy Markdown
Contributor

@Javtor Javtor commented Mar 13, 2026

Motivation

The OpenAI Python SDK's ReasoningEffort type includes 'none' (source), and vLLM recently added support for it (vllm#36238). SGLang currently only accepts 'low', 'medium', 'high'.

Modifications

python/sglang/srt/entrypoints/openai/protocol.py:

  • Add "none" to reasoning_effort Literal type
  • Add "none" to the normalize_reasoning_inputs validator's accepted values
  • Add disable_thinking_for_none_effort model validator that sets chat_template_kwargs["thinking"] = False and separate_reasoning = False when reasoning_effort == "none"

python/sglang/srt/entrypoints/harmony_utils.py:

  • Add "none" to get_system_message type hint
  • Raise ValueError when reasoning_effort == "none" in harmony path (not supported, same approach as vLLM)

Checklist

  • Format: pre-commit run --all-files (no changes to formatted files)
  • Matches vLLM's approach in #36238
  • Harmony path explicitly rejects "none" with clear error

Add support for reasoning_effort='none' in the chat completions API,
matching the OpenAI Python SDK's ReasoningEffort type which includes
'none' (see openai-python: types/shared/reasoning_effort.py).

When reasoning_effort='none':
- Sets chat_template_kwargs["thinking"] = False to disable reasoning
- Sets separate_reasoning = False
- Harmony path raises ValueError (same as vllm#36238)

This is useful for models like Qwen3.5 that have built-in reasoning
but where callers want direct output without thinking tokens.
@gemini-code-assist
Copy link
Copy Markdown
Contributor

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

@chatgpt-codex-connector
Copy link
Copy Markdown

You have reached your Codex usage limits for code reviews. You can see your limits in the Codex usage dashboard.
To continue using code reviews, you can upgrade your account or add credits to your account and enable them for code reviews in your settings.

@Javtor Javtor closed this Mar 13, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant