Skip to content

test-qvac-lib-infer-nmtcpp#43

Closed
andretetherio wants to merge 250 commits into
tetherto:mainfrom
andretetherio:andre/test-qvac-lib-infer-nmtcpp
Closed

test-qvac-lib-infer-nmtcpp#43
andretetherio wants to merge 250 commits into
tetherto:mainfrom
andretetherio:andre/test-qvac-lib-infer-nmtcpp

Conversation

@andretetherio

Copy link
Copy Markdown

No description provided.

Proletter and others added 30 commits January 6, 2026 06:00
chore(qvac-cli): testing pr-request-trigger
added pullreques:write permission to qvac-cli workflow
added qvac-lib-dl-hyperdrive trigger-reusable-lb workflow
@andretetherio andretetherio changed the title Test test-qvac-lib-infer-nmtcpp Jan 22, 2026
@github-actions

Copy link
Copy Markdown
Contributor

🧪 C++ Test Coverage Report

Coverage:

📊 Detailed Coverage
Filename                      Regions    Missed Regions     Cover   Functions  Missed Functions  Executed       Lines      Missed Lines     Cover    Branches   Missed Branches     Cover
-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
TranslationModel.cpp              142                21    85.21%          24                 3    87.50%         305                31    89.84%          56                17    69.64%
TranslationModel.hpp                1                 0   100.00%           1                 0   100.00%           1                 0   100.00%           0                 0         -
nmt.cpp                            86                23    73.26%           9                 1    88.89%         153                35    77.12%          52                11    78.85%
nmt.hpp                            51                 1    98.04%          11                 1    90.91%          55                 3    94.55%          28                 0   100.00%
nmt_beam_search.cpp               122                22    81.97%          10                 2    80.00%         260                30    88.46%          78                15    80.77%
nmt_graph_decoder.cpp             228                11    95.18%          15                 0   100.00%         620                66    89.35%         166                37    77.71%
nmt_graph_encoder.cpp              87                16    81.61%           3                 0   100.00%         332                43    87.05%          64                23    64.06%
nmt_loader.cpp                    330                73    77.88%          14                 0   100.00%         848               104    87.74%         186                66    64.52%
nmt_state_backend.cpp             199                45    77.39%          20                 0   100.00%         433                73    83.14%         122                49    59.84%
nmt_tokenization.cpp              130                39    70.00%          10                 0   100.00%         185                57    69.19%          90                44    51.11%
nmt_utils.cpp                      27                 6    77.78%           4                 0   100.00%          39                 6    84.62%          20                 9    55.00%
-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
TOTAL                            1403               257    81.68%         121                 7    94.21%        3231               448    86.13%         862               271    68.56%

@github-actions

Copy link
Copy Markdown
Contributor

❌ E2E Mobile Test Results - Android

Overall Status: FAILED
Device Farm Result: UNKNOWN
Platform: Android
Addon: @qvac/translation-nmtcpp
PR: #43
Commit: 26de2d2

Test Summary

Metric Count
Total Tests 0
✅ Passed 0
❌ Failed 0
⏭️ Skipped 0

Links


Automated E2E mobile testing powered by AWS Device Farm
Tests located in: test/mobile/

@github-actions

Copy link
Copy Markdown
Contributor

❌ E2E Mobile Test Results - iOS

Overall Status: FAILED
Device Farm Result: UNKNOWN
Platform: iOS
Addon: @qvac/translation-nmtcpp
PR: #43
Commit: 26de2d2

Test Summary

Metric Count
Total Tests 0
✅ Passed 0
❌ Failed 0
⏭️ Skipped 0

Links


Automated E2E mobile testing powered by AWS Device Farm
Tests located in: test/mobile/

@Proletter Proletter closed this Feb 1, 2026
ogad-tether added a commit to ogad-tether/qvac that referenced this pull request Jun 12, 2026
…x default 4096

Builds on the upstream kv_cache_type support
(qvac-ext-lib-whisper.cpp#43): the T3 KV cache is allocated up-front
at nCtx, and q8_0 stores it at ~27% of f32 — so the new defaults
(nCtx=4096 + kvCacheType="q8_0", ~210 MB of KV for ~160 s of audio
per synthesize() call) use HALF the memory of the previous
f32@2048 plan while doubling the usable context.

- New `kvCacheType` constructor option ('f32'|'f16'|'q8_0'), plumbed
  JS -> JSAdapter -> ChatterboxConfig -> EngineOptions.  Unknown
  values are rejected at construction (tts-cpp's own fallback would
  silently revert to f32 and change the memory profile the caller
  asked for).  kvCacheType:"f32" restores bit-exact pre-quantisation
  behaviour.
- nCtx default 2048 -> 4096 (cheaper than the old default AND longer,
  per the review suggestion to raise ctx alongside q8 KV).
- vcpkg tts-cpp pin -> 2026-06-12.  This pin is Android-safe: the
  revision removes the last direct ggml_backend_is_cpu /
  ggml_get_type_traits_cpu references from tts-cpp (the
  unresolvable-UND dlopen crash behind the 0.2.2 revert), routing
  them through the backend registry + ggml_quantize_chunk (ggml-base).

Upstream validation on real GGUFs (see tetherto#43): Turbo greedy token
sequences byte-identical across f32/f16/q8_0 on CPU and Metal; MTL
CFG can flip a near-tie argmax (same class of variation as a seed
change; whisper transcribes the q8_0 output to the exact input
text); Metal decode 20-30% faster from the KV bandwidth saving.

Tests: gtest covers the q8_0 default, explicit forwarding, the f32
escape hatch, and unknown-value rejection (42/42 against tts-cpp
2026-06-12); JS unit suite 63/63; lint clean.

Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants