[Reasoning] Add glm47 reasoning parser for GLM-4.7 models by QwertyJack · Pull Request #33349 · vllm-project/vllm

QwertyJack · 2026-01-29T15:49:50Z

Summary

GLM-4.7 models have a different chat template than GLM-4.5/4.6 models:

GLM-4.5/4.6: <think> is NOT included in the generation prompt
GLM-4.7: <think> IS included in the generation prompt

This means GLM-4.7 should use DeepSeekR1ReasoningParser instead of DeepSeekV3ReasoningWithThinkingParser used by GLM-4.5/4.6.

Changes

Add glm47 reasoning parser registration
Add tests for glm47 reasoning parser
Update documentation to include GLM-4.7 series

Test plan

All 24 new tests pass for glm47 reasoning parser
Existing glm45 tests still pass

Fixes #33348

GLM-4.7 models have a different chat template than GLM-4.5/4.6 models: - GLM-4.5/4.6: <think> is NOT in the generation prompt - GLM-4.7: <think> IS included in the generation prompt This means GLM-4.7 should use DeepSeekR1ReasoningParser instead of DeepSeekV3ReasoningWithThinkingParser used by GLM-4.5/4.6. Changes: - Add glm47 reasoning parser registration - Add tests for glm47 reasoning parser - Update documentation Fixes vllm-project#33348 Signed-off-by: QwertyJack <7554089+QwertyJack@users.noreply.github.com>

mergify · 2026-01-29T15:50:32Z

Documentation preview: https://vllm--33349.org.readthedocs.build/en/33349/

gemini-code-assist

Code Review

This pull request adds support for GLM-4.7 reasoning models by aliasing the glm47 parser to the existing DeepSeekR1ReasoningParser. The changes are well-structured, including updates to documentation, registration of the new parser, and a comprehensive new test suite. The approach of reusing an existing parser is sound given the model's behavior.

My main feedback is regarding the new test file, which introduces significant code duplication. While the tests themselves are thorough, refactoring them to be more reusable would greatly improve the long-term maintainability of the test suite. I've left a specific comment with a suggestion on how to achieve this.

tests/reasoning/test_glm47_reasoning_parser.py

QwertyJack · 2026-01-29T16:04:28Z

@zRzRzRzRzRzRzR PTAL

chaunceyjiang

Have you tested --reasoning-parser=glm45?

QwertyJack · 2026-01-30T04:32:18Z

After further investigation, I found that:

v0.13.0 had a bug - the old Glm4MoeModelReasoningParser had issues that caused thinking to be mixed with content for GLM-4.7
This was fixed in recent PRs:
- PR fix no think of GLM-4.5 / GLM-4.7 #31449 simplified Glm4MoeModelReasoningParser to inherit from DeepSeekR1ReasoningParser
- PR [Frontend] Support GLM-4.5 / GLM-4.7 with enable_thinking: false #31788 changed it to inherit from Holo2ReasoningParser for enable_thinking: false support
- PR [Misc] Provide a DeepSeek ReasoningParser with thinking enabled by default #33221 (by @chaunceyjiang, merged 2 days ago) further simplified by removing Glm4MoeModelReasoningParser entirely and having glm45 use DeepSeekV3ReasoningWithThinkingParser directly
Current state on main: glm45 now uses DeepSeekV3ReasoningWithThinkingParser which defaults to thinking=True and delegates to DeepSeekR1ReasoningParser - this should work for both GLM-4.5/4.6 and GLM-4.7

Given that the recent refactoring has unified the parser, I can see the argument that a separate glm47 parser may not be strictly necessary anymore.

However, having glm47 as an explicit alias still provides:

Clear semantic mapping for users (GLM-4.7 → glm47)
Consistency with the existing glm47 tool parser

@chaunceyjiang What do you think? Should we keep the glm47 alias, or is the current glm45 sufficient for all GLM-4.x models?

chaunceyjiang · 2026-01-30T05:39:14Z

current glm45 sufficient for all GLM-4.x models

+1

QwertyJack · 2026-01-30T08:40:28Z

Closing this PR based on feedback from @chaunceyjiang.

After investigation, we found that:

The issue in v0.13.0 has been fixed in recent PRs (fix no think of GLM-4.5 / GLM-4.7 #31449, [Frontend] Support GLM-4.5 / GLM-4.7 with enable_thinking: false #31788, [Misc] Provide a DeepSeek ReasoningParser with thinking enabled by default #33221)
Current main branch: glm45 now uses DeepSeekV3ReasoningWithThinkingParser which works correctly for all GLM-4.x models (GLM-4.5, GLM-4.6, and GLM-4.7)

A separate glm47 parser is not needed - users should use --reasoning-parser=glm45 for all GLM-4.x models.

Thanks for the review!

QwertyJack requested review from aarnphm and chaunceyjiang as code owners January 29, 2026 15:49

mergify bot added the documentation Improvements or additions to documentation label Jan 29, 2026

gemini-code-assist bot reviewed Jan 29, 2026

View reviewed changes

tests/reasoning/test_glm47_reasoning_parser.py Show resolved Hide resolved

chaunceyjiang reviewed Jan 30, 2026

View reviewed changes

QwertyJack closed this Jan 30, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Reasoning] Add glm47 reasoning parser for GLM-4.7 models#33349

[Reasoning] Add glm47 reasoning parser for GLM-4.7 models#33349
QwertyJack wants to merge 1 commit intovllm-project:mainfrom
QwertyJack:fix/glm47-reasoning

QwertyJack commented Jan 29, 2026

Uh oh!

mergify bot commented Jan 29, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

QwertyJack commented Jan 29, 2026

Uh oh!

chaunceyjiang left a comment

Uh oh!

QwertyJack commented Jan 30, 2026

Uh oh!

chaunceyjiang commented Jan 30, 2026

Uh oh!

QwertyJack commented Jan 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

QwertyJack commented Jan 29, 2026

Summary

Changes

Test plan

Uh oh!

mergify bot commented Jan 29, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

QwertyJack commented Jan 29, 2026

Uh oh!

chaunceyjiang left a comment

Choose a reason for hiding this comment

Uh oh!

QwertyJack commented Jan 30, 2026

Uh oh!

chaunceyjiang commented Jan 30, 2026

Uh oh!

QwertyJack commented Jan 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants