[Model] Register Qwen3_5ForCausalLM and Qwen3_5MoeForCausalLM for text-only checkpoints by aminsamir45 · Pull Request #36289 · vllm-project/vllm

aminsamir45 · 2026-03-06T22:41:55Z

Summary

Qwen3_5ForCausalLM and Qwen3_5MoeForCausalLM classes already exist in qwen3_5.py but are not registered in _TEXT_GENERATION_MODELS in the model registry. This means vLLM cannot load text-only Qwen3.5 checkpoints (e.g. Qwen/Qwen3.5-4B).

When a text-only checkpoint specifies architectures: ["Qwen3_5ForCausalLM"], the registry lookup fails and vLLM falls through to the VLM class Qwen3_5ForConditionalGeneration, which expects weights under language_model.model.layers.* instead of model.layers.*, causing a hard weight mismatch.

This completely blocks using vLLM with TRL's GRPOTrainer (colocate mode) for RL training on text-only Qwen3.5 models.

Fix

Two lines added to _TEXT_GENERATION_MODELS in registry.py:

"Qwen3_5ForCausalLM": ("qwen3_5", "Qwen3_5ForCausalLM"),
"Qwen3_5MoeForCausalLM": ("qwen3_5", "Qwen3_5MoeForCausalLM"),

The implementation classes are already fully functional — they just weren't wired into the registry.

Fixes #36275
Related: #36236

Test plan

Load a text-only Qwen3.5 checkpoint (e.g. Qwen/Qwen3.5-4B) and verify it resolves to Qwen3_5ForCausalLM instead of Qwen3_5ForConditionalGeneration
Verify weights load correctly at model.layers.* without the language_model. prefix mismatch
Verify existing VLM usage of Qwen3_5ForConditionalGeneration is unaffected

Made with Cursor

gemini-code-assist

Code Review

This pull request correctly registers Qwen3_5ForCausalLM and Qwen3_5MoeForCausalLM to enable loading text-only Qwen3.5 checkpoints. The change is straightforward and addresses the described issue. However, it appears to be missing a corresponding update to the test registry, which is a required step for adding new model architectures to ensure proper test coverage.

vllm/model_executor/models/registry.py

github-actions · 2026-03-06T22:48:54Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors.

You ask your reviewers to trigger select CI tests on top of fastcheck CI.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

If you have any questions, please reach out to us on Slack at https://slack.vllm.ai.

🚀

…l registry The Qwen3_5ForCausalLM and Qwen3_5MoeForCausalLM classes already exist in qwen3_5.py but are not registered in the model registry, so vLLM cannot load text-only Qwen3.5 checkpoints. When a text-only checkpoint (e.g. Qwen3.5-4B) specifies architectures: ["Qwen3_5ForCausalLM"], the registry lookup fails and vLLM falls through to the VLM class Qwen3_5ForConditionalGeneration, which expects weights under language_model.model.layers.* instead of model.layers.*, causing a hard weight mismatch. Changes: - Add Qwen3_5ForCausalLM and Qwen3_5MoeForCausalLM to _TEXT_GENERATION_MODELS in registry.py - Add IsHybrid mixin and get_mamba_state_dtype_from_config, get_mamba_state_shape_from_config, get_mamba_state_copy_func to Qwen3_5ForCausalLMBase so standalone text-only usage correctly configures the Gated DeltaNet state cache - Add both architectures to MODELS_CONFIG_MAP in config.py so mamba_ssm_cache_dtype is auto-configured from the HF config - Add test registry entries for CI coverage Fixes vllm-project#36275 Signed-off-by: Samir Amin <aminsamir45@gmail.com> Made-with: Cursor

ywang96 · 2026-03-07T01:10:26Z

This means vLLM cannot load text-only Qwen3.5 checkpoints (e.g. Qwen/Qwen3.5-4B).

This is not true - afaik all Qwen3.5 checkpoints are natively multimodal: https://huggingface.co/Qwen/Qwen3.5-4B/blob/main/config.json#L3

Do you have a text-only checkpoint that's available on huggingface? Adding them into the list of model architectures will also require public checkpoints with Qwen3_5ForCausalLM and Qwen3_5MoeForCausalLM in their architectures field for CI to test model loading

ywang96 · 2026-03-07T01:12:00Z

tests/models/registry.py

+    "Qwen3_5ForCausalLM": _HfExamplesInfo(
+        "Qwen/Qwen3.5-0.8B",
+        max_model_len=4096,
+    ),
+    "Qwen3_5MoeForCausalLM": _HfExamplesInfo(
+        "Qwen/Qwen3.5-35B-A3B",
+        max_model_len=4096,
+    ),


Cursor is just wrong here... both models are registered as XXXForConditionalGeneration

Tests have been removed

The example models (Qwen/Qwen3.5-0.8B, Qwen/Qwen3.5-35B-A3B) are VLMs that use Qwen3_5ForConditionalGeneration, not Qwen3_5ForCausalLM. The test entries never exercised the text-only code path. For text-only checkpoints produced by fine-tuning a Qwen3.5 VLM with AutoModelForCausalLM, the officially supported path is to load via the VLM class with language_model_only=True, which skips the vision pipeline and loads only the LM backbone. Signed-off-by: Samir Amin <aminsamir45@gmail.com>

ywang96 · 2026-03-08T21:54:58Z

There are no changes from this PR - do you still intend to keep it?

If the purpose is to provide guidance for people who run into the issue when they cannot initialize the model as a text-only model, we already have --language-model-only flag and it'd be great if you can enhance our documentation about it! Thanks!

mergify bot added new-model Requests to new models qwen Related to Qwen models labels Mar 6, 2026

aminsamir45 force-pushed the fix/register-qwen3-5-causal-lm branch from d02d404 to 69c39ca Compare March 6, 2026 22:43

gemini-code-assist bot reviewed Mar 6, 2026

View reviewed changes

vllm/model_executor/models/registry.py Outdated Show resolved Hide resolved

aminsamir45 force-pushed the fix/register-qwen3-5-causal-lm branch from 69c39ca to efb2d60 Compare March 6, 2026 23:04

aminsamir45 requested review from DarkLight1337, sighingnow and ywang96 as code owners March 6, 2026 23:04

Merge branch 'main' into fix/register-qwen3-5-causal-lm

1e76a89

ywang96 requested changes Mar 7, 2026

View reviewed changes

aminsamir45 and others added 2 commits March 8, 2026 14:36

Merge branch 'main' into fix/register-qwen3-5-causal-lm

8a918e8

ywang96 closed this Mar 9, 2026

ferreroal mentioned this pull request Mar 11, 2026

[Bug]: vllm fails to load continual pre-trained Qwen3.5-MoE model due to missing support for transformers 5.x renamed class (Qwen3_5MoeTextConfig) #36236

Open

1 task

deter3 mentioned this pull request Mar 11, 2026

qwen3.5-0.8 grpo trl(most recent version) is not working with vllm 0.17.0 huggingface/trl#5269

Open

5 tasks

DarkLight1337 mentioned this pull request Mar 20, 2026

feat(models): enable Qwen3.5 text-only (Qwen3_5ForCausalLM) — IsHybrid, SupportsMRoPE, VL weight remapping #36607

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Model] Register Qwen3_5ForCausalLM and Qwen3_5MoeForCausalLM for text-only checkpoints#36289

[Model] Register Qwen3_5ForCausalLM and Qwen3_5MoeForCausalLM for text-only checkpoints#36289
aminsamir45 wants to merge 4 commits intovllm-project:mainfrom
aminsamir45:fix/register-qwen3-5-causal-lm

aminsamir45 commented Mar 6, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

github-actions bot commented Mar 6, 2026

Uh oh!

ywang96 commented Mar 7, 2026 •

edited

Loading

Uh oh!

ywang96 Mar 7, 2026

Uh oh!

aminsamir45 Mar 8, 2026

Uh oh!

ywang96 commented Mar 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

aminsamir45 commented Mar 6, 2026

Summary

Fix

Test plan

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

github-actions bot commented Mar 6, 2026

Uh oh!

ywang96 commented Mar 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ywang96 Mar 7, 2026

Choose a reason for hiding this comment

Uh oh!

aminsamir45 Mar 8, 2026

Choose a reason for hiding this comment

Uh oh!

ywang96 commented Mar 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ywang96 commented Mar 7, 2026 •

edited

Loading