Skip to content

QVAC-18717 feat[mod]: add Gemma4-E2B/E4B/31B, Qwen3.5-0.8B/2B/4B/9B, Qwen3.6-27B/35B-A3B to registry#1988

Merged
yuranich merged 7 commits into
tetherto:mainfrom
donriddo:feat/registry-gemma4-qwen35-qwen36-models
May 12, 2026
Merged

QVAC-18717 feat[mod]: add Gemma4-E2B/E4B/31B, Qwen3.5-0.8B/2B/4B/9B, Qwen3.6-27B/35B-A3B to registry#1988
yuranich merged 7 commits into
tetherto:mainfrom
donriddo:feat/registry-gemma4-qwen35-qwen36-models

Conversation

@donriddo

@donriddo donriddo commented May 11, 2026

Copy link
Copy Markdown
Contributor

What problem does this PR solve?

New multimodal models from Google (Gemma 4) and Qwen (3.5 and 3.6 series) are not yet in the registry.

How does it solve it?

Adds 36 entries to packages/registry-server/data/models.prod.json covering:

  • Gemma4-E2B-it (Q4_K_M, Q6_K, mmproj-bf16, mmproj-f16)
  • Gemma4-E4B-it (Q4_K_M, Q6_K, mmproj-bf16, mmproj-f16)
  • Gemma4-31B-it (Q4_K_M, Q6_K, mmproj-f16, mmproj-bf16)
  • Qwen3.5-0.8B (Q8_0, Q4_K_M, mmproj-F16, mmproj-BF16)
  • Qwen3.5-2B (Q4_K_M, Q6_K, mmproj-F16, mmproj-BF16)
  • Qwen3.5-4B (Q4_K_M, Q6_K, mmproj-BF16, mmproj-F16)
  • Qwen3.5-9B (Q4_K_M, Q6_K, mmproj-F16, mmproj-BF16)
  • Qwen3.6-27B (Q4_K_XL, Q6_K_XL, mmproj-BF16, mmproj-F16)
  • Qwen3.6-35B-A3B (Q4_K_M, Q6_K_XL, mmproj-BF16, mmproj-F16)

All Gemma 4 entries use licenseId: "Apache-2.0". Qwen entries use licenseId: "Apache-2.0". All use engine: "@qvac/llm-llamacpp" with HuggingFace source URLs pinned to commit SHAs (fetched via HF API). All tagged ["generation", "multimodal", "<family>", "instruct"] with the model family at tags[2] (gemma4 / qwen3.5 / qwen3.6), matching the existing multimodal convention.

How was it tested?

  • validate-json CI check passes — 715 total entries, no syntax errors, all HF source URLs pinned to commit hashes.

Breaking changes

None.

@donriddo donriddo requested review from a team as code owners May 11, 2026 19:43
@donriddo donriddo changed the title QVAC-18717 feat[mod]: add Gemma4-E2B/E4B, Qwen3.5-0.8B/2B/4B/9B, Qwen3.6-27B/35B-A3B to registry QVAC-18717 feat[mod]: add Gemma4-E2B/E4B/31B, Qwen3.5-0.8B/2B/4B/9B, Qwen3.6-27B/35B-A3B to registry May 11, 2026
@donriddo donriddo marked this pull request as draft May 11, 2026 19:49
Replace /resolve/main/ with pinned commit hashes for all 36 new
model entries — validate-json requires commit-pinned URLs.
@donriddo donriddo added verify safe-to-test Reviewer has verified safety of packages & github actions in this PR labels May 11, 2026
@donriddo donriddo marked this pull request as ready for review May 11, 2026 20:14
donriddo added 3 commits May 11, 2026 21:25
Gemma 4 (E2B, E4B, 31B) is released under Apache-2.0, not the
custom Gemma Terms of Use used by earlier Gemma generations.
Reorder tags on all 36 new entries so the model family sits at
tags[2] and "instruct" at tags[3], matching the established
multimodal convention (e.g. Qwen3-VL). The previous order put
"instruct" at tags[2], which the SDK naming pipeline treats as
the family name and would generate colliding INST_* constants.

Also correct params from "35B" to "35B-A3B" for the four
Qwen3.6-35B-A3B MoE entries to preserve the variant qualifier.
@gianni-cor

Copy link
Copy Markdown
Contributor

/review

@github-actions

github-actions Bot commented May 12, 2026

Copy link
Copy Markdown
Contributor

Tier-based Approval Status

**PR Tier:** TIER1

**Current Status:** ✅ APPROVED

**Requirements:**
- 1 Team Member approval ✅ (1/1)
- 1 Team Lead OR Management approval ✅ (2/1)



---
*This comment is automatically updated when reviews change.*

@gianni-cor

Copy link
Copy Markdown
Contributor

/review

@gianni-cor

Copy link
Copy Markdown
Contributor

/review

@yuranich yuranich merged commit d0efd56 into tetherto:main May 12, 2026
8 of 9 checks passed
Proletter pushed a commit that referenced this pull request May 24, 2026
…Qwen3.6-27B/35B-A3B to registry (#1988)

* feat[mod]: add Gemma4-E2B/E4B, Qwen3.5-0.8B/2B/4B/9B, Qwen3.6-27B/35B-A3B models to registry

* feat[mod]: add Gemma4-31B-it models to registry

* fix: pin HuggingFace source URLs to commit SHAs

Replace /resolve/main/ with pinned commit hashes for all 36 new
model entries — validate-json requires commit-pinned URLs.

* fix[mod]: correct Gemma4 licenseId from gemma to Apache-2.0

Gemma 4 (E2B, E4B, 31B) is released under Apache-2.0, not the
custom Gemma Terms of Use used by earlier Gemma generations.

* fix[mod]: fix tag order and Qwen3.6-35B-A3B params

Reorder tags on all 36 new entries so the model family sits at
tags[2] and "instruct" at tags[3], matching the established
multimodal convention (e.g. Qwen3-VL). The previous order put
"instruct" at tags[2], which the SDK naming pipeline treats as
the family name and would generate colliding INST_* constants.

Also correct params from "35B" to "35B-A3B" for the four
Qwen3.6-35B-A3B MoE entries to preserve the variant qualifier.

---------

Co-authored-by: gianni-cor <gianfranco.cordella@tether.io>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

safe-to-test Reviewer has verified safety of packages & github actions in this PR verify

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants