Skip to content

Add Qwen3.5 MoE#862

Closed
Goekdeniz-Guelmez wants to merge 8 commits intoml-explore:mainfrom
Goekdeniz-Guelmez:add-qwen3.5-moe
Closed

Add Qwen3.5 MoE#862
Goekdeniz-Guelmez wants to merge 8 commits intoml-explore:mainfrom
Goekdeniz-Guelmez:add-qwen3.5-moe

Conversation

@Goekdeniz-Guelmez
Copy link
Contributor

No description provided.

@Goekdeniz-Guelmez
Copy link
Contributor Author

looks to me that its juts Qwen3 Next with vision PR

@johnmai-dev
Copy link
Contributor

looks to me that its juts Qwen3 Next with vision PR

Although they are largely similar, there are still some subtle differences.

Qwen3_5MoeGatedDeltaNet

https://github.com/bozheng-hit/transformers/blob/be63a26077e10243d3fd8eb8cc074ce314feb04f/src/transformers/models/qwen3_5_moe/modeling_qwen3_5_moe.py#L446-L627

Qwen3NextGatedDeltaNet

https://github.com/huggingface/transformers/blob/7769f660935b5d48b73bf6711d0a78b6f8f98739/src/transformers/models/qwen3_next/modeling_qwen3_next.py#L590-L801

@Goekdeniz-Guelmez
Copy link
Contributor Author

thanks for that! will get updated.

@johnmai-dev
Copy link
Contributor

johnmai-dev commented Feb 9, 2026

thanks for that! will get updated.

You might be able to directly reference this: mlx_lm/models/qwen3_5_text.py → Qwen3_5GatedDeltaNet.

https://github.com/johnmai-dev/mlx-lm/blob/728b8713f6bf73099f04c0091ad1b180dd3231bf/mlx_lm/models/qwen3_5_text.py#L56-L159

#861

@awni
Copy link
Member

awni commented Feb 10, 2026

Closing this in favor of #869.

Thanks for adding this @Goekdeniz-Guelmez I will add you as a co-author on the other PR.

@awni awni closed this Feb 10, 2026
@Goekdeniz-Guelmez
Copy link
Contributor Author

@awni totally!

@Goekdeniz-Guelmez Goekdeniz-Guelmez deleted the add-qwen3.5-moe branch February 10, 2026 22:22
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants