[Bugfix] Fix GLM-4.6V vision regression in glm4v_moe and glm_ocr by JustinTong0323 · Pull Request #20463 · sgl-project/sglang

JustinTong0323 · 2026-03-12T16:46:23Z

Motivation

PR #20033 replaced Conv3d with Linear in Glm4vVisionPatchEmbed and added copy_conv3d_weight_to_linear() at the end of glm4v.py's load_weights(). However, glm4v_moe.py and glm_ocr.py have their own load_weights() overrides and the call was not added there. This left the Linear layer with random weights, causing the vision encoder to produce garbage embeddings — the model outputs text completely unrelated to the image content.

Fix

Add the missing self.visual.patch_embed.copy_conv3d_weight_to_linear() call at the end of load_weights() in both glm4v_moe.py and glm_ocr.py.

Test Plan

Verified with GLM-4.6V-FP8 (glm4v_moe) on B200 TP=4: vision responses now correctly describe image content
Root cause confirmed via git bisect over 625 commits (v0.5.9 → 7a1ca53)

Fixes #20462

PR sgl-project#20033 replaced Conv3d with Linear in Glm4vVisionPatchEmbed and added copy_conv3d_weight_to_linear() to glm4v.py's load_weights, but missed adding it to glm4v_moe.py and glm_ocr.py. This left the linear layer with random weights, causing the vision encoder to produce garbage embeddings — the model outputs text unrelated to the image. Fixes sgl-project#20462

gemini-code-assist · 2026-03-12T16:46:28Z

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

JustinTong0323 · 2026-03-12T16:51:51Z

/tag-and-rerun-ci

JustinTong0323 · 2026-03-12T17:05:07Z

/rerun-failed-ci

JustinTong0323 · 2026-03-12T17:56:27Z

/rerun-failed-ci

JustinTong0323 · 2026-03-12T18:57:41Z

/rerun-failed-ci

yuan-luo · 2026-03-13T02:21:31Z

/rerun-failed-ci

zRzRzRzRzRzRzR · 2026-03-13T10:00:39Z

Need to be modified to

if not is_nextn:
            self.visual.patch_embed.copy_conv3d_weight_to_linear()

Otherwise, loading MTP will fail to find visual and report an error

This modification is applicable to these two models, as well as GLM-4V

MTP loading calls load_weights with is_nextn=True, where self.visual does not exist. Wrap the call with `if not is_nextn` to avoid AttributeError.

JustinTong0323 · 2026-03-14T00:15:35Z

Thanks @zRzRzRzRzRzRzR, good catch! Fixed in 13aed3e — added if not is_nextn guard for both glm4v_moe.py and glm_ocr.py.

yuan-luo · 2026-03-14T13:16:55Z

Need to be modified to
if not is_nextn:
            self.visual.patch_embed.copy_conv3d_weight_to_linear()
Otherwise, loading MTP will fail to find visual and report an error

This modification is applicable to these two models, as well as GLM-4V

We may need to add test cases to cover this case.

…-project#20463)

github-actions bot added the run-ci label Mar 12, 2026

JustinTong0323 assigned yuan-luo Mar 12, 2026

yuan-luo approved these changes Mar 13, 2026

View reviewed changes

Guard copy_conv3d_weight_to_linear with is_nextn check

13aed3e

MTP loading calls load_weights with is_nextn=True, where self.visual does not exist. Wrap the call with `if not is_nextn` to avoid AttributeError.

Fridge003 merged commit c330b68 into sgl-project:main Mar 14, 2026
113 of 133 checks passed

yhyang201 pushed a commit to yhyang201/sglang that referenced this pull request Mar 15, 2026

[Bugfix] Fix GLM-4.6V vision regression in glm4v_moe and glm_ocr (sgl…

ab2c343

…-project#20463)

mickqian mentioned this pull request Mar 17, 2026

Revert "[Bugfix] Fix GLM-4.6V vision regression in glm4v_moe and glm_ocr" #20740

Merged

Wangzheee pushed a commit to Wangzheee/sglang that referenced this pull request Mar 21, 2026

[Bugfix] Fix GLM-4.6V vision regression in glm4v_moe and glm_ocr (sgl…

60fea17

…-project#20463)

0-693 pushed a commit to 0-693/sglang that referenced this pull request Mar 25, 2026

[Bugfix] Fix GLM-4.6V vision regression in glm4v_moe and glm_ocr (sgl…

bae6240

…-project#20463)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bugfix] Fix GLM-4.6V vision regression in glm4v_moe and glm_ocr#20463

[Bugfix] Fix GLM-4.6V vision regression in glm4v_moe and glm_ocr#20463
Fridge003 merged 2 commits intosgl-project:mainfrom
JustinTong0323:fix/glm4v-moe-ocr-vision-regression

JustinTong0323 commented Mar 12, 2026

Uh oh!

gemini-code-assist bot commented Mar 12, 2026

Uh oh!

JustinTong0323 commented Mar 12, 2026

Uh oh!

JustinTong0323 commented Mar 12, 2026

Uh oh!

JustinTong0323 commented Mar 12, 2026

Uh oh!

JustinTong0323 commented Mar 12, 2026

Uh oh!

yuan-luo commented Mar 13, 2026

Uh oh!

zRzRzRzRzRzRzR commented Mar 13, 2026

Uh oh!

JustinTong0323 commented Mar 14, 2026

Uh oh!

Uh oh!

yuan-luo commented Mar 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

JustinTong0323 commented Mar 12, 2026

Motivation

Fix

Test Plan

Uh oh!

gemini-code-assist bot commented Mar 12, 2026

Uh oh!

JustinTong0323 commented Mar 12, 2026

Uh oh!

JustinTong0323 commented Mar 12, 2026

Uh oh!

JustinTong0323 commented Mar 12, 2026

Uh oh!

JustinTong0323 commented Mar 12, 2026

Uh oh!

yuan-luo commented Mar 13, 2026

Uh oh!

zRzRzRzRzRzRzR commented Mar 13, 2026

Uh oh!

JustinTong0323 commented Mar 14, 2026

Uh oh!

Uh oh!

yuan-luo commented Mar 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants