Skip to content
Merged
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
16 changes: 6 additions & 10 deletions tests/models/registry.py
Original file line number Diff line number Diff line change
Expand Up @@ -1005,24 +1005,20 @@ def check_available_online(
min_transformers_version="4.57",
),
"Qwen3_5ForConditionalGeneration": _HfExamplesInfo(
"Qwen/Qwen3.5-9B-Instruct",
"Qwen/Qwen3.5-0.8B",
max_model_len=4096,
min_transformers_version="5.1.0",
),
"Qwen3_5MoeForConditionalGeneration": _HfExamplesInfo(
"Qwen/Qwen3.5-35B-A3B-Instruct",
"Qwen/Qwen3.5-35B-A3B",
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

critical

The model Qwen/Qwen3.5-35B-A3B does not appear to be a publicly available model on the Hugging Face Hub. This will likely cause CI failures when trying to download the model. The smallest publicly available Qwen3.5 MoE model is Qwen/Qwen3.5-32B-A2.7B. Please consider using this model instead.

Suggested change
"Qwen/Qwen3.5-35B-A3B",
"Qwen/Qwen3.5-32B-A2.7B",

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lol what

max_model_len=4096,
min_transformers_version="5.1.0",
),
"Qwen3_5MTP": _HfExamplesInfo(
"Qwen/Qwen3.5-9B-Instruct",
speculative_model="Qwen/Qwen3.5-9B-Instruct",
min_transformers_version="5.1.0",
"Qwen/Qwen3.5-0.8B",
speculative_model="Qwen/Qwen3.5-0.8B",
),
"Qwen3_5MoeMTP": _HfExamplesInfo(
"Qwen/Qwen3.5-35B-A3B-Instruct",
speculative_model="Qwen/Qwen3.5-35B-A3B-Instruct",
min_transformers_version="5.1.0",
"Qwen/Qwen3.5-35B-A3B",
speculative_model="Qwen/Qwen3.5-35B-A3B",
Comment on lines +1020 to +1021
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

critical

As mentioned in the previous comment, Qwen/Qwen3.5-35B-A3B is not a public model. Please replace it with a valid, public model like Qwen/Qwen3.5-32B-A2.7B for both the model and the speculative model.

Suggested change
"Qwen/Qwen3.5-35B-A3B",
speculative_model="Qwen/Qwen3.5-35B-A3B",
"Qwen/Qwen3.5-32B-A2.7B",
speculative_model="Qwen/Qwen3.5-32B-A2.7B",

),
"Qwen3OmniMoeForConditionalGeneration": _HfExamplesInfo(
"Qwen/Qwen3-Omni-30B-A3B-Instruct",
Expand Down