Skip to content

[Qwen3.5] Fix missing quant_config in Qwen3VL#19291

Merged
Kangyan-Zhou merged 4 commits into
sgl-project:mainfrom
mmangkad-dev:qwen35-kv-fix
Mar 2, 2026
Merged

[Qwen3.5] Fix missing quant_config in Qwen3VL#19291
Kangyan-Zhou merged 4 commits into
sgl-project:mainfrom
mmangkad-dev:qwen35-kv-fix

Conversation

@mmangkad
Copy link
Copy Markdown
Contributor

Motivation

Fix missing quant_config in Qwen3VL causing Qwen3.5 NVFP4 versions to use bf16 KV cache instead of fp8.

Modifications

Accuracy Tests

Benchmarking and Profiling

Checklist

Review Process

  1. Ping Merge Oncalls to start the PR flow. See the PR Merge Process.
  2. Get approvals from CODEOWNERS and other reviewers.
  3. Trigger CI tests with comments or contact authorized users to do so.
    • /tag-run-ci-label, /rerun-failed-ci, /tag-and-rerun-ci
  4. After green CI and required approvals, ask Merge Oncalls to merge.

@gemini-code-assist
Copy link
Copy Markdown
Contributor

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

@mmangkad
Copy link
Copy Markdown
Contributor Author

cc @mickqian

@Kangyan-Zhou Kangyan-Zhou merged commit 3f9fc8b into sgl-project:main Mar 2, 2026
53 of 61 checks passed
@mmangkad mmangkad deleted the qwen35-kv-fix branch March 3, 2026 03:18
Kangyan-Zhou pushed a commit to Kangyan-Zhou/sglang that referenced this pull request Mar 4, 2026
magicYang1573 pushed a commit to magicYang1573/sglang that referenced this pull request Mar 9, 2026
Wangzheee pushed a commit to Wangzheee/sglang that referenced this pull request Mar 21, 2026
JustinTong0323 pushed a commit to JustinTong0323/sglang that referenced this pull request Apr 7, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants