Skip to content

[Qwen3 VL MoE] Turn off gate quantization#25923

Closed
dsikka wants to merge 1 commit intovllm-project:mainfrom
neuralmagic:qwen3_vl_moe_gate
Closed

[Qwen3 VL MoE] Turn off gate quantization#25923
dsikka wants to merge 1 commit intovllm-project:mainfrom
neuralmagic:qwen3_vl_moe_gate

Conversation

@dsikka
Copy link
Copy Markdown
Contributor

@dsikka dsikka commented Sep 30, 2025

Summary

  • Dont quantize gate layers for Qwen3 MoE

@mergify mergify bot added the qwen Related to Qwen models label Sep 30, 2025
@jeejeelee
Copy link
Copy Markdown
Collaborator

Maybe you can refer to: #25455

@dsikka dsikka closed this Sep 30, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

qwen Related to Qwen models

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants