[docs] update model cards by stevhliu · Pull Request #45612 · huggingface/transformers

stevhliu · 2026-04-23T19:59:06Z

backfill model cards for PE family (cc @eustlb) and Qwen3.5 (cc @vasqu)

HuggingFaceDocBuilderDev · 2026-04-23T20:18:46Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

vasqu

LGTM, just some smaller details from my side 🤗 @eustlb when you have time to check the PE models

vasqu · 2026-04-30T16:56:35Z

+[Qwen3.5 MoE](https://qwen.ai/blog?id=qwen3.5) is the sparse-expert variant of Qwen3.5. It keeps the same natively multimodal decoder and 3:1 Gated DeltaNet / Gated Attention backbone, but replaces dense FFNs with a 256-expert sparse mixture — 8 routed experts are activated per token, plus 1 shared expert — so total parameters scale well past the dense checkpoints while active compute per token stays much smaller.

-[Qwen3.5 Moe](https://huggingface.co/papers/2502.13923) TODO @shuaibai @bozheng
+This family includes `Qwen/Qwen3.5-35B-A3B` (35B total / 3B active), `Qwen/Qwen3.5-122B-A10B`, and the flagship `Qwen/Qwen3.5-397B-A17B`. The text tower reuses `Qwen3NextSparseMoeBlock` and expert kernels from Qwen3-Next; the vision tower is inherited from Qwen3-VL.


Qwen 3.6 are also included iirc, would be cool if you could cross check

correct, they have the same model_type ! 👍

eustlb

thanks !! LGTM for PE models

* docs * fix * feedback * fix

stevhliu requested review from eustlb and vasqu April 23, 2026 20:21

This was referenced Apr 28, 2026

Cumulative defect fixes from recent Transformers PRs evalstate/transformers#41

Open

Cumulative feature and defect updates from recent Transformers PRs evalstate/transformers#42

Open

stevhliu mentioned this pull request Apr 29, 2026

[skills] model doc #45705

Draft

evalstate mentioned this pull request Apr 29, 2026

Cumulative defect fixes from recent Transformers PRs evalstate/transformers#43

Open

vasqu approved these changes Apr 30, 2026

View reviewed changes

massimilianoviola mentioned this pull request May 11, 2026

Fix/pe audio video bugs #45886

Merged

eustlb approved these changes May 12, 2026

View reviewed changes

stevhliu added 4 commits May 12, 2026 10:37

docs

7c3fee7

fix

a27007b

feedback

be3d72c

fix

2aac299

stevhliu force-pushed the backfill branch from 354403b to 2aac299 Compare May 12, 2026 17:39

stevhliu added this pull request to the merge queue May 12, 2026

Merged via the queue into huggingface:main with commit 7ee56fc May 12, 2026
31 checks passed

stevhliu deleted the backfill branch May 12, 2026 18:09

jp1924 pushed a commit to jp1924/transformers that referenced this pull request May 18, 2026

[docs] update model cards (huggingface#45612)

bd786c0

* docs * fix * feedback * fix

khushali9 pushed a commit to khushali9/transformers that referenced this pull request Jun 8, 2026

[docs] update model cards (huggingface#45612)

eb651e9

* docs * fix * feedback * fix

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[docs] update model cards#45612

[docs] update model cards#45612
stevhliu merged 4 commits into
huggingface:mainfrom
stevhliu:backfill

stevhliu commented Apr 23, 2026

Uh oh!

HuggingFaceDocBuilderDev commented Apr 23, 2026

Uh oh!

vasqu left a comment

Uh oh!

Uh oh!

vasqu Apr 30, 2026

Uh oh!

stevhliu Apr 30, 2026

Uh oh!

Uh oh!

Uh oh!

eustlb left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

stevhliu commented Apr 23, 2026

Uh oh!

HuggingFaceDocBuilderDev commented Apr 23, 2026

Uh oh!

vasqu left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

vasqu Apr 30, 2026

Choose a reason for hiding this comment

Uh oh!

stevhliu Apr 30, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

eustlb left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants