[GLM-OCR] GLM-OCR with MTP Support#33005
Conversation
Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>
Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>
Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>
Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>
Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>
Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>
Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>
Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>
Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>
Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>
Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>
Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>
Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>
|
Documentation preview: https://vllm--33005.org.readthedocs.build/en/33005/ |
There was a problem hiding this comment.
Code Review
This pull request introduces support for the GLM-OCR model and its Multi-Token Prediction (MTP) variant. The changes are comprehensive, including documentation updates, new model implementation files (glm_ocr.py, glm_ocr_mtp.py), example usage, and extensive test configurations. The implementation correctly follows the patterns for adding new models within the vLLM framework, inheriting and adapting from the existing GLM-4 series models where appropriate. The MTP support is also consistent with existing implementations. After a thorough review, I found no critical or high-severity issues. The code appears to be well-structured and correct.
There was a problem hiding this comment.
Cursor Bugbot has reviewed your changes and found 1 potential issue.
Bugbot Autofix is OFF. To automatically fix reported issues with Cloud Agents, enable Autofix in the Cursor dashboard.
Comment @cursor review or bugbot run to trigger another review on this PR
Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>
Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>
|
update, please check. |
Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com> Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by: Isotr0py <mozf@mail2.sysu.edu.cn>
A dense model using the GLM-4-0414 architecture with bias, featuring a completely new VIT structure and MTP implementation.