Adding Support for Qwen3.5 & Qwen3.5 MoE (Vision) by johnmai-dev · Pull Request #120 · ml-explore/mlx-swift-lm

johnmai-dev · 2026-02-25T14:01:04Z

Proposed changes

Add Qwen3.5 and Qwen3.5 MoE (Vision)
Skip sanitize when metadata.format == "mlx" to avoid reapplying HF-only transforms (e.g., norm + 1) that cause gibberish output. https://github.com/Blaizzy/mlx-vlm/blob/a673abf2e91c0afbb0fd53d8cf6cd7eda9bd487f/mlx_vlm/utils.py#L215

Ported

Checklist

Put an x in the boxes that apply.

I have read the CONTRIBUTING document
I have run pre-commit run --all-files to format my code / installed pre-commit prior to committing changes
I have added tests that prove my fix is effective or that my feature works
I have updated the necessary documentation (if needed)

Libraries/MLXLMCommon/Load.swift

johnmai-dev · 2026-02-25T14:03:53Z

mlx-community/Qwen3.5-27B-4bit

mlx-community/Qwen3.5-35B-A3B-4bit

tpae · 2026-02-27T13:38:48Z

Would love this to be merged soon @davidkoski 🙏

Libraries/MLXLMCommon/Load.swift

davidkoski

The new models look good but see my question on the weight loading and let me know what you think.

johnmai-dev · 2026-03-02T14:11:15Z

Qwen/Qwen3.5-0.8B

johnmai-dev · 2026-03-02T14:14:42Z

The new models look good but see my question on the weight loading and let me know what you think.

Previously, I referred to the implementation at Blaizzy/mlx-vlm -> utils.py/#L215, and at first I thought it was necessary to keep the logic consistent with mlx-vlm.

After reading your suggestion, I’ve already used loadArraysAndMetadata.

johnmai-dev · 2026-03-02T16:20:52Z

performance optimization

Before

After

ronaldmannak · 2026-03-02T16:57:56Z

Am I the only one checking the status of this PR every few hours? Can't wait @johnmai-dev :)

johnmai-dev · 2026-03-02T17:01:56Z

Am I the only one checking the status of this PR every few hours? Can't wait @johnmai-dev :)

haha,I can't wait too!This PR is ready.❤️

davidkoski

Looks great, thank you!

* Add Qwen3.5 and Qwen3.5 MoE (Vision) * Use `loadArraysAndMetadata` * performance optimization

Add Qwen3.5 and Qwen3.5 MoE (Vision)

37fee31

johnmai-dev commented Feb 25, 2026

View reviewed changes

Libraries/MLXLMCommon/Load.swift Outdated Show resolved Hide resolved

johnmai-dev changed the title ~~Add Qwen3.5 and Qwen3.5 MoE (Vision)~~ Adding Support for Qwen3.5 & Qwen3.5 MoE (Vision) Feb 25, 2026

tpae mentioned this pull request Feb 27, 2026

Support for qwen3.5_moe model type osaurus-ai/osaurus#535

Closed

2 tasks

davidkoski reviewed Feb 27, 2026

View reviewed changes

Libraries/MLXLMCommon/Load.swift Outdated Show resolved Hide resolved

davidkoski requested changes Feb 27, 2026

View reviewed changes

johnmai-dev force-pushed the qwen3_5_vision branch from 775074d to 37fee31 Compare March 2, 2026 14:36

johnmai-dev added 2 commits March 2, 2026 23:16

Use loadArraysAndMetadata

fb3408a

Merge branch 'main' into qwen3_5_vision

9a4cb85

johnmai-dev requested a review from davidkoski March 2, 2026 15:24

performance optimization

7116be2

format

15f8ceb

davidkoski approved these changes Mar 2, 2026

View reviewed changes

davidkoski merged commit 7da3344 into ml-explore:main Mar 2, 2026
2 checks passed

davidkoski mentioned this pull request Mar 2, 2026

MoE models 7.3x slower than Python mlx-lm (per-token sync + evalLock) #124

Open

viktike pushed a commit to viktike/mlx-swift-lm that referenced this pull request Mar 7, 2026

Adding Support for Qwen3.5 & Qwen3.5 MoE (Vision) (ml-explore#120)

a1b2f5d

* Add Qwen3.5 and Qwen3.5 MoE (Vision) * Use `loadArraysAndMetadata` * performance optimization

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding Support for Qwen3.5 & Qwen3.5 MoE (Vision)#120

Adding Support for Qwen3.5 & Qwen3.5 MoE (Vision)#120
davidkoski merged 5 commits intoml-explore:mainfrom
johnmai-dev:qwen3_5_vision

johnmai-dev commented Feb 25, 2026 •

edited

Loading

Uh oh!

Uh oh!

johnmai-dev commented Feb 25, 2026

Uh oh!

tpae commented Feb 27, 2026

Uh oh!

Uh oh!

davidkoski left a comment

Uh oh!

johnmai-dev commented Mar 2, 2026

Uh oh!

johnmai-dev commented Mar 2, 2026

Uh oh!

johnmai-dev commented Mar 2, 2026

Uh oh!

ronaldmannak commented Mar 2, 2026

Uh oh!

johnmai-dev commented Mar 2, 2026

Uh oh!

davidkoski left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

johnmai-dev commented Feb 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Proposed changes

Checklist

Uh oh!

Uh oh!

johnmai-dev commented Feb 25, 2026

mlx-community/Qwen3.5-27B-4bit

mlx-community/Qwen3.5-35B-A3B-4bit

Uh oh!

tpae commented Feb 27, 2026

Uh oh!

Uh oh!

davidkoski left a comment

Choose a reason for hiding this comment

Uh oh!

johnmai-dev commented Mar 2, 2026

Qwen/Qwen3.5-0.8B

Uh oh!

johnmai-dev commented Mar 2, 2026

Uh oh!

johnmai-dev commented Mar 2, 2026

Before

After

Uh oh!

ronaldmannak commented Mar 2, 2026

Uh oh!

johnmai-dev commented Mar 2, 2026

Uh oh!

davidkoski left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

johnmai-dev commented Feb 25, 2026 •

edited

Loading