QVAC-18717 feat[mod]: add Gemma4-E2B/E4B/31B, Qwen3.5-0.8B/2B/4B/9B, Qwen3.6-27B/35B-A3B to registry by donriddo · Pull Request #1988 · tetherto/qvac

donriddo · 2026-05-11T19:43:20Z

What problem does this PR solve?

New multimodal models from Google (Gemma 4) and Qwen (3.5 and 3.6 series) are not yet in the registry.

How does it solve it?

Adds 36 entries to packages/registry-server/data/models.prod.json covering:

Gemma4-E2B-it (Q4_K_M, Q6_K, mmproj-bf16, mmproj-f16)
Gemma4-E4B-it (Q4_K_M, Q6_K, mmproj-bf16, mmproj-f16)
Gemma4-31B-it (Q4_K_M, Q6_K, mmproj-f16, mmproj-bf16)
Qwen3.5-0.8B (Q8_0, Q4_K_M, mmproj-F16, mmproj-BF16)
Qwen3.5-2B (Q4_K_M, Q6_K, mmproj-F16, mmproj-BF16)
Qwen3.5-4B (Q4_K_M, Q6_K, mmproj-BF16, mmproj-F16)
Qwen3.5-9B (Q4_K_M, Q6_K, mmproj-F16, mmproj-BF16)
Qwen3.6-27B (Q4_K_XL, Q6_K_XL, mmproj-BF16, mmproj-F16)
Qwen3.6-35B-A3B (Q4_K_M, Q6_K_XL, mmproj-BF16, mmproj-F16)

All Gemma 4 entries use licenseId: "Apache-2.0". Qwen entries use licenseId: "Apache-2.0". All use engine: "@qvac/llm-llamacpp" with HuggingFace source URLs pinned to commit SHAs (fetched via HF API). All tagged ["generation", "multimodal", "<family>", "instruct"] with the model family at tags[2] (gemma4 / qwen3.5 / qwen3.6), matching the existing multimodal convention.

How was it tested?

validate-json CI check passes — 715 total entries, no syntax errors, all HF source URLs pinned to commit hashes.

Breaking changes

None.

…-A3B models to registry

Replace /resolve/main/ with pinned commit hashes for all 36 new model entries — validate-json requires commit-pinned URLs.

Gemma 4 (E2B, E4B, 31B) is released under Apache-2.0, not the custom Gemma Terms of Use used by earlier Gemma generations.

Reorder tags on all 36 new entries so the model family sits at tags[2] and "instruct" at tags[3], matching the established multimodal convention (e.g. Qwen3-VL). The previous order put "instruct" at tags[2], which the SDK naming pipeline treats as the family name and would generate colliding INST_* constants. Also correct params from "35B" to "35B-A3B" for the four Qwen3.6-35B-A3B MoE entries to preserve the variant qualifier.

gianni-cor · 2026-05-12T10:55:22Z

/review

github-actions · 2026-05-12T10:55:57Z

Tier-based Approval Status

**PR Tier:** TIER1

**Current Status:** ✅ APPROVED

**Requirements:**
- 1 Team Member approval ✅ (1/1)
- 1 Team Lead OR Management approval ✅ (2/1)



---
*This comment is automatically updated when reviews change.*

gianni-cor · 2026-05-12T10:57:15Z

/review

gianni-cor · 2026-05-12T11:02:35Z

/review

…Qwen3.6-27B/35B-A3B to registry (#1988) * feat[mod]: add Gemma4-E2B/E4B, Qwen3.5-0.8B/2B/4B/9B, Qwen3.6-27B/35B-A3B models to registry * feat[mod]: add Gemma4-31B-it models to registry * fix: pin HuggingFace source URLs to commit SHAs Replace /resolve/main/ with pinned commit hashes for all 36 new model entries — validate-json requires commit-pinned URLs. * fix[mod]: correct Gemma4 licenseId from gemma to Apache-2.0 Gemma 4 (E2B, E4B, 31B) is released under Apache-2.0, not the custom Gemma Terms of Use used by earlier Gemma generations. * fix[mod]: fix tag order and Qwen3.6-35B-A3B params Reorder tags on all 36 new entries so the model family sits at tags[2] and "instruct" at tags[3], matching the established multimodal convention (e.g. Qwen3-VL). The previous order put "instruct" at tags[2], which the SDK naming pipeline treats as the family name and would generate colliding INST_* constants. Also correct params from "35B" to "35B-A3B" for the four Qwen3.6-35B-A3B MoE entries to preserve the variant qualifier. --------- Co-authored-by: gianni-cor <gianfranco.cordella@tether.io>

feat[mod]: add Gemma4-E2B/E4B, Qwen3.5-0.8B/2B/4B/9B, Qwen3.6-27B/35B…

3b77a71

…-A3B models to registry

donriddo requested review from a team as code owners May 11, 2026 19:43

feat[mod]: add Gemma4-31B-it models to registry

1e6932b

donriddo changed the title ~~QVAC-18717 feat[mod]: add Gemma4-E2B/E4B, Qwen3.5-0.8B/2B/4B/9B, Qwen3.6-27B/35B-A3B to registry~~ QVAC-18717 feat[mod]: add Gemma4-E2B/E4B/31B, Qwen3.5-0.8B/2B/4B/9B, Qwen3.6-27B/35B-A3B to registry May 11, 2026

donriddo marked this pull request as draft May 11, 2026 19:49

fix: pin HuggingFace source URLs to commit SHAs

58fc488

Replace /resolve/main/ with pinned commit hashes for all 36 new model entries — validate-json requires commit-pinned URLs.

donriddo added verify safe-to-test Reviewer has verified safety of packages & github actions in this PR labels May 11, 2026

donriddo marked this pull request as ready for review May 11, 2026 20:14

donriddo added 3 commits May 11, 2026 21:25

fix[mod]: correct Gemma4 licenseId from gemma to Apache-2.0

cb3cd4a

Gemma 4 (E2B, E4B, 31B) is released under Apache-2.0, not the custom Gemma Terms of Use used by earlier Gemma generations.

Merge branch 'main' into feat/registry-gemma4-qwen35-qwen36-models

6f9581d

yuranich approved these changes May 12, 2026

View reviewed changes

gianni-cor approved these changes May 12, 2026

View reviewed changes

Merge branch 'main' into feat/registry-gemma4-qwen35-qwen36-models

9fb45e3

maxim-smotrov approved these changes May 12, 2026

View reviewed changes

yuranich merged commit d0efd56 into tetherto:main May 12, 2026
8 of 9 checks passed

yuranich mentioned this pull request May 13, 2026

fix(registry-server): harden HF downloads against socket drops #2026

Merged

This was referenced May 17, 2026

QVAC-18717 feat[mod]: update SDK registry with Gemma4-E2B/E4B/31B, Qwen3.5-0.8B/2B/4B/9B, Qwen3.6-27B/35B-A3B #2092

Closed

QVAC-18873 feat[api|mod]: expose diffusion_fa, drop flux_flow, sync model registry #2046

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

QVAC-18717 feat[mod]: add Gemma4-E2B/E4B/31B, Qwen3.5-0.8B/2B/4B/9B, Qwen3.6-27B/35B-A3B to registry#1988

QVAC-18717 feat[mod]: add Gemma4-E2B/E4B/31B, Qwen3.5-0.8B/2B/4B/9B, Qwen3.6-27B/35B-A3B to registry#1988
yuranich merged 7 commits into
tetherto:mainfrom
donriddo:feat/registry-gemma4-qwen35-qwen36-models

donriddo commented May 11, 2026 •

edited

Loading

Uh oh!

gianni-cor commented May 12, 2026

Uh oh!

github-actions Bot commented May 12, 2026 •

edited

Loading

Uh oh!

gianni-cor commented May 12, 2026

Uh oh!

gianni-cor commented May 12, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

donriddo commented May 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What problem does this PR solve?

How does it solve it?

How was it tested?

Breaking changes

Uh oh!

gianni-cor commented May 12, 2026

Uh oh!

github-actions Bot commented May 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Tier-based Approval Status

Uh oh!

gianni-cor commented May 12, 2026

Uh oh!

gianni-cor commented May 12, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

donriddo commented May 11, 2026 •

edited

Loading

github-actions Bot commented May 12, 2026 •

edited

Loading