feat(tts-ggml): Android dynamic ggml backends by GustavoA1604 · Pull Request #2168 · tetherto/qvac

GustavoA1604 · 2026-05-20T20:44:53Z

What problem does this PR solve?

No dynamic-backend packaging in @qvac/tts-ggml: Android builds need libqvac-speech-ggml-{vulkan,opencl,cpu-android_armv*_*.so} next to the .bare module so ggml_backend_load_all_from_path() can discover backends at runtime. The addon had no BACKENDS_SUBDIR staging, no backendsDir / openclCacheDir JS surface, and no forwarding into tts_cpp::EngineOptions — unlike @qvac/transcription-parakeet and @qvac/llm-llamacpp.
Upstream tts-cpp not consumable yet: Registry-based backend selection landed in qvac-ext-lib-whisper.cpp#29 (EngineOptions::backends_dir, Adreno tier policy, init_gpu_backend()). This package must pin tts-cpp >= 2026-05-20 and depend on qvac-registry-vcpkg#159 (ggml-speech#4 + tts-cpp port bump).
Chatterbox useGPU: true default OOMs on Android: GPU backends mirror large f16 S3Gen weights (~1 GB) on top of the mmap'd CPU copy; 8 GB devices hit lmkd SIGKILL. Default should be CPU with explicit opt-in on capable hosts.
Mobile / CI test friction: iOS bundled reference wav paths were not readable from native code; integration tests relied on local HF→GGUF conversion in CI instead of the QVAC model registry; Chatterbox mobile tests used oversized f16 GGUFs.

How does it solve it?

CMake / prebuilds (`packages/tts-ggml/CMakeLists.txt`)

find_package(ggml) for GGML_AVAILABLE_BACKENDS + loose .so glob from vcpkg lib/.
bare_target + bare_module_target → BACKENDS_SUBDIR (android-arm64/qvac__tts-ggml, …).
add_bare_module(... EXPORTS INSTALL TARGET ggml::<backend>) + install(FILES libqvac-speech-ggml-*.so) for Vulkan/OpenCL MODULE backends not exposed as IMPORTED targets.
Android 16 KB page-size link flags (-Wl,-z,max-page-size=16384) — same as parakeet (Pixel 9 class devices).
Apple compiler-rt force_load for @available → __isPlatformVersionAtLeast (iOS Metal stability on unload/reload).

JS + C++ wiring

backendsDir / openclCacheDir on constructor options and TTSGgmlRuntimeConfig; default backendsDir → path.join(__dirname, 'prebuilds').
ChatterboxModel / SupertonicModel compose backendsDir / BACKENDS_SUBDIR into opts.backends_dir; forward opencl_cache_dir.
JSAdapter reads both fields for Chatterbox and Supertonic configs.

GPU policy

useGPU defaults to false for Chatterbox (was true in 0.1.1). Opt in with config: { useGPU: true } on Metal / Vulkan / OpenCL hosts.
#ifdef __ANDROID__ in loadLocked(): forces n_gpu_layers = 0 and logs a warning if the host requested GPU — Vulkan (Mali) and OpenCL (Adreno) paths for Chatterbox/Supertonic graphs are not validated yet. Dynamic backend .so staging still matters on Android for per-arch CPU dlopen even while GPU stays off.

Tests & CI

@qvac/registry-client devDependency + downloadModel.js registry fetch (q4_0 Chatterbox T3, f16 S3Gen, Supertonic q4_0) with min/max size bands to reject stale caches.
resolveRefWavPath() — mobile global.assetPaths / Library/Caches/jfk.wav before in-bundle path (fixes iOS ModelFileNotFound).
Remove iOS workflow HF→GGUF conversion step; models fetched via registry in mobile/integration runs.
GPU smoke tests skip Android; lifecycle tests use CPU defaults.

Docs & version

README updated: CPU-by-default, useGPU table, backendsDir / openclCacheDir knobs (mirrors parakeet).
CHANGELOG 0.1.2, package.json 0.1.2.
vcpkg.json: tts-cpp >= 2026-05-20.

Breaking changes

Change	Migration
Chatterbox `useGPU` default `true` → `false`	Pass `config: { useGPU: true }` where you previously relied on the implicit GPU default (macOS Metal, CUDA desktop, etc.).
Android GPU requests ignored at addon boundary	Expected until Vulkan/Mali + OpenCL/Adreno validation completes; CPU + dynamic CPU backends still work.

Supertonic remains CPU-only at construction time when useGPU: true is passed.

How was it tested

Manual CI run here

…tion tests

github-actions · 2026-05-21T11:26:44Z

Tier-based Approval Status

**PR Tier:** TIER1

**Current Status:** ✅ APPROVED

**Requirements:**
- 1 Team Member approval ✅ (1/1)
- 1 Team Lead OR Management approval ✅ (1/1)



---
*This comment is automatically updated when reviews change.*

GustavoA1604 · 2026-05-21T11:29:42Z

/review

* Add dynamic backend loading for android and model download in integration tests * Remove gguf bundling from mobile integration test * Add missing registry-client dependency * Remove non-working GPUs * Fix failing test * Point to tetherto repo * Remove redundant comments * Update readme

GustavoA1604 added 4 commits May 20, 2026 15:18

Add dynamic backend loading for android and model download in integra…

e569060

…tion tests

Remove gguf bundling from mobile integration test

a900fe5

Add missing registry-client dependency

a4653cc

Remove non-working GPUs

587a63d

GustavoA1604 requested review from a team as code owners May 20, 2026 20:44

GustavoA1604 added 3 commits May 20, 2026 18:09

Fix failing test

5dbd4d6

Point to tetherto repo

0b83c89

Remove redundant comments

0b7eedb

GustavoA1604 had a problem deploying to release May 20, 2026 21:17 — with GitHub Actions Failure

GustavoA1604 temporarily deployed to release May 20, 2026 21:17 — with GitHub Actions Inactive

GustavoA1604 had a problem deploying to release May 20, 2026 21:17 — with GitHub Actions Error

GustavoA1604 temporarily deployed to release May 20, 2026 21:17 — with GitHub Actions Inactive

GustavoA1604 added the tier1 label May 20, 2026

Update readme

7768ada

GustavoA1604 changed the title ~~Feat/tts ggml dynamic backend~~ feat(tts-ggml): Android dynamic ggml backends May 20, 2026

Merge 'main' into 'feat/tts-ggml-dynamic-backend'

31d8c53

olyasir approved these changes May 21, 2026

View reviewed changes

pratiknarola-t approved these changes May 21, 2026

View reviewed changes

GustavoA1604 merged commit ec70978 into main May 21, 2026
20 of 22 checks passed

GustavoA1604 deleted the feat/tts-ggml-dynamic-backend branch May 21, 2026 11:30

freddy311082 mentioned this pull request May 27, 2026

QVAC-19266 infra: migrate tts-ggml desktop CI to registry-based model download #2290

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(tts-ggml): Android dynamic ggml backends#2168

feat(tts-ggml): Android dynamic ggml backends#2168
GustavoA1604 merged 9 commits into
mainfrom
feat/tts-ggml-dynamic-backend

GustavoA1604 commented May 20, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented May 21, 2026 •

edited

Loading

Uh oh!

GustavoA1604 commented May 21, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

GustavoA1604 commented May 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What problem does this PR solve?

How does it solve it?

CMake / prebuilds (packages/tts-ggml/CMakeLists.txt)

JS + C++ wiring

GPU policy

Tests & CI

Docs & version

Breaking changes

How was it tested

Uh oh!

github-actions Bot commented May 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Tier-based Approval Status

Uh oh!

GustavoA1604 commented May 21, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

GustavoA1604 commented May 20, 2026 •

edited

Loading

CMake / prebuilds (`packages/tts-ggml/CMakeLists.txt`)

github-actions Bot commented May 21, 2026 •

edited

Loading