Skip to content

QVAC-14576 chore: remove Opus model entries from registry server#1680

Merged
yuranich merged 5 commits into
mainfrom
feat/QVAC-14576-deprecate-opus-registry
Apr 22, 2026
Merged

QVAC-14576 chore: remove Opus model entries from registry server#1680
yuranich merged 5 commits into
mainfrom
feat/QVAC-14576-deprecate-opus-registry

Conversation

@RamazTs

@RamazTs RamazTs commented Apr 20, 2026

Copy link
Copy Markdown
Contributor

What problem does this PR solve?

The distributed model registry (models.prod.json) still contains 32 Opus/Marian-MT model entries that have been deprecated. NOTICE attribution files reference Opus-MT/Helsinki-NLP models that are no longer served. When sync-models runs,
these entries will be auto-deprecated in the live registry.

How does it solve it?

  • Removed 32 Opus model entries from data/models.prod.json (712 → 680 entries)
    • All entries with tags opus, opus-ggml, or marian
    • All entries with S3 sources containing ggml/marian/ or ggml-opus-
  • Removed Marian attribution from NOTICE (Apache-2.0 compiled derivatives section)
  • Removed CC-BY-4.0 Opus-MT section from NOTICE
  • Removed 22 ggml-opus-* entries from client/NOTICE

Kept: All IndicTrans, Bergamot, LLM, TTS, STT, OCR entries. Also kept mariana-coelho-9 Whisper model (HuggingFace username, not Opus-related).

How was it tested?

  • JSON validated: JSON.parse() succeeds, 680 entries remain
  • Grep verified: zero opus/ggml-opus/ggml/marian references in all 3 files
  • No code changes — purely data/catalog cleanup

Remove 32 Opus/Marian-MT model entries from models.prod.json and
corresponding attribution entries from NOTICE files. This ensures
sync-models will deprecate these models in the live registry.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
@github-actions

github-actions Bot commented Apr 20, 2026

Copy link
Copy Markdown
Contributor

Tier-based Approval Status

**PR Tier:** TIER1

**Current Status:** ✅ APPROVED

**Requirements:**
- 1 Team Member approval ✅ (1/1)
- 1 Team Lead OR Management approval ✅ (1/1)



---
*This comment is automatically updated when reviews change.*

@yuranich

Copy link
Copy Markdown
Contributor

/review

@yuranich yuranich merged commit 048065a into main Apr 22, 2026
23 of 24 checks passed
@yuranich yuranich deleted the feat/QVAC-14576-deprecate-opus-registry branch April 22, 2026 08:41
opaninakuffo added a commit that referenced this pull request May 5, 2026
…eprecated marian opus (#1903)

- Bumps BERGAMOT_EN_IT and BERGAMOT_ES_EN to the base-memory variant at
  bergamot-{enit,esen}/2026-04-28/. Fixes leading "- " hallucinations on
  short inputs and en->it quality regression (registry seed updated in
  #1785, synced to DHT 2026-05-05).
- Drops 32 deprecated Marian Opus entries (NMT_Q0F16*, NMT_Q4_0*)
  auto-deprecated by #1680.
- Auto-generated by `bun update-models`.
opaninakuffo added a commit to opaninakuffo/qvac that referenced this pull request May 5, 2026
…eprecated marian opus (tetherto#1903)

- Bumps BERGAMOT_EN_IT and BERGAMOT_ES_EN to the base-memory variant at
  bergamot-{enit,esen}/2026-04-28/. Fixes leading "- " hallucinations on
  short inputs and en->it quality regression (registry seed updated in
  tetherto#1785, synced to DHT 2026-05-05).
- Drops 32 deprecated Marian Opus entries (NMT_Q0F16*, NMT_Q4_0*)
  auto-deprecated by tetherto#1680.
- Auto-generated by `bun update-models`.
tamer-hassan-tether pushed a commit that referenced this pull request May 6, 2026
…eprecated marian opus (#1903)

- Bumps BERGAMOT_EN_IT and BERGAMOT_ES_EN to the base-memory variant at
  bergamot-{enit,esen}/2026-04-28/. Fixes leading "- " hallucinations on
  short inputs and en->it quality regression (registry seed updated in
  #1785, synced to DHT 2026-05-05).
- Drops 32 deprecated Marian Opus entries (NMT_Q0F16*, NMT_Q4_0*)
  auto-deprecated by #1680.
- Auto-generated by `bun update-models`.
tamer-hassan-tether pushed a commit that referenced this pull request May 6, 2026
…eprecated marian opus (#1903)

- Bumps BERGAMOT_EN_IT and BERGAMOT_ES_EN to the base-memory variant at
  bergamot-{enit,esen}/2026-04-28/. Fixes leading "- " hallucinations on
  short inputs and en->it quality regression (registry seed updated in
  #1785, synced to DHT 2026-05-05).
- Drops 32 deprecated Marian Opus entries (NMT_Q0F16*, NMT_Q4_0*)
  auto-deprecated by #1680.
- Auto-generated by `bun update-models`.
opaninakuffo added a commit that referenced this pull request May 6, 2026
* QVAC-18420 fix[mod]: Bergamot vocab re-downloaded on every loadModel for shared-vocab pairs (#1892)

* QVAC-18420 fix[mod]: Bergamot vocab re-downloaded on every loadModel for shared-vocab pairs

- Dedup preserves registry entries referenced as companion files in any
  companion set, so shared-vocab blobs (identical sha256 under two paths) keep
  their standalone RegistryItem. This adds back 7 previously-dropped
  BERGAMOT_<pair>_VOCAB entries (FR_EN, EN_DE, EN_CS, ET_EN, FI_EN, PL_EN, PT_EN)
  and restores correct expectedSize/sha lookups for the Bergamot plugin's
  separate vocab resolve.
- For registry:// Bergamot loads with auto-derived vocabs (non-pivot and pivot),
  skip the separate per-vocab resolveModelPath entirely. The companion-set
  download already colocates vocabs under sets/<setKey>/ and createModel
  derives those paths via deriveColocatedBergamotVocabPaths — no redundant
  flat-cache download, no dedup-hole exposure.

* QVAC-18420 test: cover Bergamot vocab resolver + extract pure helpers

Reviewer ask on PR #1892: lock the Fix-C registry-source vocab optimization
behind unit tests so the contract doesn't drift.

- Extract resolveBergamotVocab and its derivation helpers into
  resolve-vocab.ts. The plugin module imports @qvac/translation-nmtcpp at
  module-load time (Bare-only native addon), so the helpers were not
  importable from a bun-driven test runner. The new module has zero native
  deps; plugin.ts re-imports from it.
- Add test/unit/nmtcpp-resolve-vocab.test.ts covering:
  - registry:// + auto-derived vocabs (non-pivot) → ctx.resolveModelPath
    is never called and no vocab artifacts are emitted
  - registry:// + auto-derived vocabs (pivot) → exactly one resolveModelPath
    call (the pivot model itself), no vocab artifacts
  - registry:// + user-supplied srcVocabSrc OR dstVocabSrc → optimization
    is skipped and full per-vocab resolution runs (sanity: keeps the
    optimization narrow)
  - pear:// source still resolves vocabs explicitly (companion-set
    semantics don't apply)
  - mixed registry/pear pivot triggers full resolution path
  - unsupported sources raise ModelLoadFailedError
  - pure derivation helpers (buildBergamotVocabSources,
    deriveBergamotVocabSources, deriveBergamotRegistryVocabSources) for
    both shared-vocab and CJK split-vocab pairs

---------

Co-authored-by: Opanin Akuffo <46673050+opaninakuffo@users.noreply.github.com>

* QVAC-17324 feat[api]: add harmony tool-call dialect (gpt-oss) (#1878)

* chore: bump @qvac/llm-llamacpp to 0.17.2 for harmony EOG fix
- Picks up #1812 which stops the addon from suppressing
  the `<|call|>` end-of-generation token, unblocking harmony tool-call
  parsing on the SDK side.

* feat: expose toolDialect override and add harmony to the dialect enum

* feat: add harmony tool-call parser for gpt-oss models

* feat: stream harmony frames and decouple dialect from tool-call activation

* test: add harmony unit coverage for parser and normalizer

* chore: consolidate tool examples under examples/tools/

* doc: clarify emitThinking capture-gating contract

* chore: remove conflict markers

* feat[mod]: sync sdk model registry to bergamot base-memory and drop deprecated marian opus (#1903)

- Bumps BERGAMOT_EN_IT and BERGAMOT_ES_EN to the base-memory variant at
  bergamot-{enit,esen}/2026-04-28/. Fixes leading "- " hallucinations on
  short inputs and en->it quality regression (registry seed updated in
  #1785, synced to DHT 2026-05-05).
- Drops 32 deprecated Marian Opus entries (NMT_Q0F16*, NMT_Q4_0*)
  auto-deprecated by #1680.
- Auto-generated by `bun update-models`.

* chore[skiplog]: release sdk 0.10.1

---------

Co-authored-by: Simon Iribarren <simon.ig13@gmail.com>
Proletter pushed a commit that referenced this pull request May 24, 2026
Remove 32 Opus/Marian-MT model entries from models.prod.json and
corresponding attribution entries from NOTICE files. This ensures
sync-models will deprecate these models in the live registry.

Co-authored-by: Ramaz Tskhadadze <bubu@Ramazs-MacBook-Pro-2.local>
Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Co-authored-by: alsrivas <40749307+Alok-Ranjan23@users.noreply.github.com>
Co-authored-by: Yury Samarin <yuri.a.samarin@gmail.com>
Proletter pushed a commit that referenced this pull request May 24, 2026
…eprecated marian opus (#1903)

- Bumps BERGAMOT_EN_IT and BERGAMOT_ES_EN to the base-memory variant at
  bergamot-{enit,esen}/2026-04-28/. Fixes leading "- " hallucinations on
  short inputs and en->it quality regression (registry seed updated in
  #1785, synced to DHT 2026-05-05).
- Drops 32 deprecated Marian Opus entries (NMT_Q0F16*, NMT_Q4_0*)
  auto-deprecated by #1680.
- Auto-generated by `bun update-models`.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants