[Frontend] Support GLM-4.5 / GLM-4.7 with enable_thinking: false by chaunceyjiang · Pull Request #31788 · vllm-project/vllm

chaunceyjiang · 2026-01-06T07:51:07Z

Purpose

[Frontend] Support GLM-4.5 / GLM-4.7 with enable_thinking: false

FIX #31319
FIX #31449 (comment)

Test Plan

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>

gemini-code-assist

Code Review

This pull request aims to add support for GLM-4.5/GLM-4.7 models, particularly handling the enable_thinking: false parameter. The changes involve updating reasoning parsers.

My review found a critical logic issue in holo2_reasoning_parser.py where the or operator is used incorrectly, preventing the enable_thinking: false flag from disabling the thinking feature as intended. I've suggested using and instead. Additionally, I've pointed out an outdated docstring in glm4_moe_reasoning_parser.py that needs to be updated to reflect the new class inheritance.

vllm/reasoning/holo2_reasoning_parser.py

vllm/reasoning/glm4_moe_reasoning_parser.py

Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>

chaunceyjiang · 2026-01-06T07:56:07Z

/cc @hhd52859 @zhangsongqing @athenacykes Could you help test this?

Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>

hhd52859 · 2026-01-06T08:46:08Z

/cc @hhd52859 @zhangsongqing @athenacykes Could you help test this?

LGTM, thanks!

chaunceyjiang · 2026-01-06T09:30:56Z

/cc @zRzRzRzRzRzRzR PTAL

JaredforReal · 2026-01-06T11:11:35Z

@chaunceyjiang not working well, extra_body ignored

JaredforReal · 2026-01-06T11:13:31Z

@chaunceyjiang try working on protocol.py, appreciate that

cjackal · 2026-01-06T11:28:48Z

So in effect glm45 reasoning parser is --reasoning-parser deepseek_v3 --default-chat-template-kwargs {"enable_thinking":true}. After #31581 reasoning parsets handle default chat template kwargs well, FYI.

chaunceyjiang · 2026-01-06T11:33:46Z

So in effect glm45 reasoning parser is --reasoning-parser deepseek_v3 --default-chat-template-kwargs {"enable_thinking":true}. After #31581 reasoning parsets handle default chat template kwargs well, FYI.

@cjackal Yes. This is more of a workaround. The root cause is still an incorrect implementation of the GLM45 reasoning parser.

chaunceyjiang · 2026-01-06T11:36:10Z

@chaunceyjiang not working well, extra_body ignored

@JaredforReal, could you share your client code? I think the issue is with the client-side code.

JaredforReal · 2026-01-06T11:47:34Z

@chaunceyjiang Thanks！

chaunceyjiang · 2026-01-06T12:04:56Z

cc @DarkLight1337 PTAL

…m-project#31788) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com> Signed-off-by: Anexdeus <5142168@mail.ru>

…m-project#31788) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>

mballesterosc · 2026-01-09T19:34:45Z

In streaming mode, when {"chat_template_kwargs":{"enable_thinking": False}} is set, output is encapsuled in <think></think> tags, it is intended?

Thanks

RocketRider · 2026-01-10T13:22:37Z

In streaming mode, when {"chat_template_kwargs":{"enable_thinking": False}} is set, output is encapsuled in <think></think> tags, it is intended?

Thanks

Did you test it with the latest nightly?
For me it is fixed with the latest version.

mballesterosc · 2026-01-10T20:45:18Z

In streaming mode, when {"chat_template_kwargs":{"enable_thinking": False}} is set, output is encapsuled in <think></think> tags, it is intended?
Thanks

Did you test it with the latest nightly? For me it is fixed with the latest version.

I'm sorry, I tried with the nightly build from the day the pull request was closed, and it's OK now. Thank you.

…m-project#31788) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>

…m-project#31788) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com> Signed-off-by: dsuhinin <suhinin.dmitriy@gmail.com>

…m-project#31788) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>

[Frontend] Support GLM-4.5 / GLM-4.7 with enable_thinking: false

683c642

Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>

chaunceyjiang requested a review from aarnphm as a code owner January 6, 2026 07:51

mergify bot added the deepseek Related to DeepSeek models label Jan 6, 2026

gemini-code-assist bot reviewed Jan 6, 2026

View reviewed changes

vllm/reasoning/holo2_reasoning_parser.py Outdated Show resolved Hide resolved

vllm/reasoning/glm4_moe_reasoning_parser.py Outdated Show resolved Hide resolved

[Frontend] Support GLM-4.5 / GLM-4.7 with enable_thinking: false

95caee5

Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>

chaunceyjiang mentioned this pull request Jan 6, 2026

[Bug]: GLM-4.7-FP8 missing beginning <think> tag #31319

Closed

1 task

[Frontend] Support GLM-4.5 / GLM-4.7 with enable_thinking: false

da9d5de

Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>

JaredforReal approved these changes Jan 6, 2026

View reviewed changes

zRzRzRzRzRzRzR approved these changes Jan 6, 2026

View reviewed changes

chaunceyjiang added the ready ONLY add when PR is ready to merge/full CI is needed label Jan 6, 2026

DarkLight1337 approved these changes Jan 6, 2026

View reviewed changes

DarkLight1337 enabled auto-merge (squash) January 6, 2026 12:11

DarkLight1337 merged commit 0202971 into vllm-project:main Jan 6, 2026
46 checks passed

Anexdeus pushed a commit to Anexdeus/vllm that referenced this pull request Jan 6, 2026

[Frontend] Support GLM-4.5 / GLM-4.7 with enable_thinking: false (vll…

ada1cea

…m-project#31788) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com> Signed-off-by: Anexdeus <5142168@mail.ru>

LucasWilkinson pushed a commit to neuralmagic/vllm that referenced this pull request Jan 6, 2026

[Frontend] Support GLM-4.5 / GLM-4.7 with enable_thinking: false (vll…

5045991

…m-project#31788) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>

yugong333 pushed a commit to yugong333/vllm that referenced this pull request Jan 9, 2026

[Frontend] Support GLM-4.5 / GLM-4.7 with enable_thinking: false (vll…

949ba27

…m-project#31788) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>

akh64bit pushed a commit to akh64bit/vllm that referenced this pull request Jan 16, 2026

[Frontend] Support GLM-4.5 / GLM-4.7 with enable_thinking: false (vll…

fceb840

…m-project#31788) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>

huijjj mentioned this pull request Jan 20, 2026

vllm reasoning parser should be deepseek_v3 not deepseek_r1 LG-AI-EXAONE/EXAONE-4.0#6

Open

ai-infos mentioned this pull request Jan 21, 2026

[Doc]: Share Working / Failed Models nlzy/vllm-gfx906#29

Open

QwertyJack mentioned this pull request Jan 30, 2026

[Reasoning] Add glm47 reasoning parser for GLM-4.7 models #33349

Closed

2 tasks

ehfd mentioned this pull request Jan 30, 2026

[Bug] GLM-4.7 uses wrong reasoning parser (should use deepseek_r1 instead of glm45) #33348

Closed

ItzDEXX pushed a commit to ItzDEXX/vllm that referenced this pull request Feb 19, 2026

[Frontend] Support GLM-4.5 / GLM-4.7 with enable_thinking: false (vll…

222bca2

…m-project#31788) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>

Uh oh!

Conversation

chaunceyjiang commented Jan 6, 2026 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

chaunceyjiang commented Jan 6, 2026

Uh oh!

hhd52859 commented Jan 6, 2026

Uh oh!

chaunceyjiang commented Jan 6, 2026

Uh oh!

JaredforReal commented Jan 6, 2026

Uh oh!

JaredforReal commented Jan 6, 2026

Uh oh!

cjackal commented Jan 6, 2026

Uh oh!

chaunceyjiang commented Jan 6, 2026

Uh oh!

chaunceyjiang commented Jan 6, 2026

Uh oh!

JaredforReal commented Jan 6, 2026

Uh oh!

chaunceyjiang commented Jan 6, 2026

Uh oh!

Uh oh!

mballesterosc commented Jan 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

RocketRider commented Jan 10, 2026

Uh oh!

mballesterosc commented Jan 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

chaunceyjiang commented Jan 6, 2026 •

edited by github-actions bot

Loading

mballesterosc commented Jan 9, 2026 •

edited

Loading