Skip to content

[GLM-OCR] GLM-OCR with MTP Support#33005

Merged
vllm-bot merged 25 commits intovllm-project:mainfrom
zRzRzRzRzRzRzR:glm
Jan 26, 2026
Merged

[GLM-OCR] GLM-OCR with MTP Support#33005
vllm-bot merged 25 commits intovllm-project:mainfrom
zRzRzRzRzRzRzR:glm

Conversation

@zRzRzRzRzRzRzR
Copy link
Copy Markdown
Contributor

@zRzRzRzRzRzRzR zRzRzRzRzRzRzR commented Jan 24, 2026

A dense model using the GLM-4-0414 architecture with bias, featuring a completely new VIT structure and MTP implementation.

Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>
Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>
Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>
Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>
Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>
Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>
Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>
Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>
Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>
Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>
Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>
Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>
Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>
Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>
Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>
Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>
Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>
@mergify
Copy link
Copy Markdown
Contributor

mergify Bot commented Jan 24, 2026

Documentation preview: https://vllm--33005.org.readthedocs.build/en/33005/

@mergify mergify Bot added documentation Improvements or additions to documentation multi-modality Related to multi-modality (#4194) new-model Requests to new models speculative-decoding v1 labels Jan 24, 2026
Copy link
Copy Markdown
Contributor

@gemini-code-assist gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request introduces support for the GLM-OCR model and its Multi-Token Prediction (MTP) variant. The changes are comprehensive, including documentation updates, new model implementation files (glm_ocr.py, glm_ocr_mtp.py), example usage, and extensive test configurations. The implementation correctly follows the patterns for adding new models within the vLLM framework, inheriting and adapting from the existing GLM-4 series models where appropriate. The MTP support is also consistent with existing implementations. After a thorough review, I found no critical or high-severity issues. The code appears to be well-structured and correct.

Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>
Copy link
Copy Markdown

@cursor cursor Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Cursor Bugbot has reviewed your changes and found 1 potential issue.

Bugbot Autofix is OFF. To automatically fix reported issues with Cloud Agents, enable Autofix in the Cursor dashboard.

Comment @cursor review or bugbot run to trigger another review on this PR

Comment thread tests/models/registry.py
Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>
Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>
Comment thread vllm/model_executor/models/glm4_1v.py Outdated
Comment thread vllm/model_executor/models/glm_ocr.py Outdated
Comment thread vllm/model_executor/models/glm_ocr.py
Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>
Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>
@zRzRzRzRzRzRzR
Copy link
Copy Markdown
Contributor Author

update, please check.

Copy link
Copy Markdown
Member

@Isotr0py Isotr0py left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM now

@Isotr0py Isotr0py added the ready ONLY add when PR is ready to merge/full CI is needed label Jan 26, 2026
Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>
@Isotr0py Isotr0py enabled auto-merge (squash) January 26, 2026 12:28
@vllm-bot vllm-bot merged commit bb17e8f into vllm-project:main Jan 26, 2026
55 of 57 checks passed
apd10 pushed a commit to apd10/vllm that referenced this pull request Jan 31, 2026
Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>
Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>
Co-authored-by: Isotr0py <mozf@mail2.sysu.edu.cn>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

documentation Improvements or additions to documentation multi-modality Related to multi-modality (#4194) new-model Requests to new models ready ONLY add when PR is ready to merge/full CI is needed speculative-decoding v1

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants