[Frontend] Support GLM-4.5 / GLM-4.7 with enable_thinking: false#31788
[Frontend] Support GLM-4.5 / GLM-4.7 with enable_thinking: false#31788DarkLight1337 merged 3 commits intovllm-project:mainfrom
Conversation
Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>
There was a problem hiding this comment.
Code Review
This pull request aims to add support for GLM-4.5/GLM-4.7 models, particularly handling the enable_thinking: false parameter. The changes involve updating reasoning parsers.
My review found a critical logic issue in holo2_reasoning_parser.py where the or operator is used incorrectly, preventing the enable_thinking: false flag from disabling the thinking feature as intended. I've suggested using and instead. Additionally, I've pointed out an outdated docstring in glm4_moe_reasoning_parser.py that needs to be updated to reflect the new class inheritance.
Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>
|
/cc @hhd52859 @zhangsongqing @athenacykes Could you help test this? |
Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>
LGTM, thanks! |
|
/cc @zRzRzRzRzRzRzR PTAL |
|
@chaunceyjiang not working well, |
|
@chaunceyjiang try working on |
|
So in effect |
@cjackal Yes. This is more of a workaround. The root cause is still an incorrect implementation of the GLM45 reasoning parser. |
@JaredforReal, could you share your client code? I think the issue is with the client-side code. |
|
@chaunceyjiang Thanks! |
|
cc @DarkLight1337 PTAL |
…m-project#31788) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com> Signed-off-by: Anexdeus <5142168@mail.ru>
…m-project#31788) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>
…m-project#31788) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>
|
In streaming mode, when Thanks |
Did you test it with the latest nightly? |
I'm sorry, I tried with the nightly build from the day the pull request was closed, and it's OK now. Thank you. |
…m-project#31788) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>
…m-project#31788) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com> Signed-off-by: dsuhinin <suhinin.dmitriy@gmail.com>
…m-project#31788) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>

Purpose
[Frontend] Support GLM-4.5 / GLM-4.7 with enable_thinking: false
FIX #31319
FIX #31449 (comment)
Test Plan
Test Result
Essential Elements of an Effective PR Description Checklist
supported_models.mdandexamplesfor a new model.