QVAC-18717 feat[api]: add Qwen3.5, Gemma4 tool-call dialects and reasoning_budget param by donriddo · Pull Request #1974 · tetherto/qvac

donriddo · 2026-05-11T13:55:18Z

🎯 What problem does this PR solve?

The SDK had no support for Qwen3.5/Qwen3.6 or Gemma4 tool-call output formats, so calling tools with those models produced no parsed tool calls.
@qvac/llm-llamacpp@0.20.0 (llamacpp 8189+) broke all model loads: system_prompt from LlmConfig was forwarded to the C++ arg parser as --system-prompt, which was removed in that release.
No way to pass the reasoning_budget parameter introduced in @qvac/llm-llamacpp@0.20.0.

📝 How does it solve it?

Adds Qwen3.5 Pythonic-XML parser: <tool_call><function=NAME><parameter=KEY>VALUE</parameter></function></tool_call>. String values are raw text; arrays/objects are JSON-parsed; integers reject non-integer floats. Errors surface as PARSE_ERROR (matches hermes/pythonic pattern).
Adds Gemma4 native parser: <|tool_call>call:NAME{key:<|"|>val<|"|>,...}<tool_call|>. Splits on <|"|> delimiter, quotes bare keys only in structural parts so , key: patterns inside string values are never misquoted as object keys.
Wires both parsers into the dialect dispatch and the default catch-all chain in parser.ts.
Adds dialect specs to completion-normalizer.ts: qwen35 reuses <tool_call> framing; gemma4 uses asymmetric <|tool_call>/<tool_call|> + thinking channel frames.
Auto-detects qwen35/gemma4 from model name/path in dialect.ts with guards against Q4_K_M/5b quantization/size suffix collision and Qwen3 5B parameter-count collision.
Adds reasoning_budget: -1 | 0 to LlmConfig (load-time) and GenerationParams (per-request). Passes through transformLlmConfig unchanged.
Exposes reasoning_budget as boolean in the CLI SDKGenerationParams interface (true → -1, false → 0); extractGenerationParams parses it from the request body.
Fixes system_prompt being forwarded to the C++ arg parser: system_prompt is JS-only (used by completion-stream.ts to seed conversation history). It is now excluded from transformLlmConfig alongside modelType.
Adds completion-reasoning-budget-disabled and completion-reasoning-budget-unrestricted to tests-qvac.
Adds tool-calling examples for qwen35 and gemma4 under examples/tools/.
Wires toolDialect and resourceKey through ToolsExecutor and createToolsTest so dialect-specific e2e tests can be added once model constants are available.
Bumps @qvac/llm-llamacpp to ^0.20.0.

🧪 How was it tested?

Unit tests: 75/75 pass in tool-parser.test.ts — includes regression tests for integer rejection, array/object PARSE_ERROR propagation, and all dialect negative-case coverage.
Security tests: 7/7 pass.
CLI tests: 112/112 pass in translate.test.ts — includes reasoning_budget boolean extraction tests.
Tests-qvac: completion-reasoning-budget-disabled and completion-reasoning-budget-unrestricted added to e2e suite.
Examples: llamacpp-tools-qwen35.ts and llamacpp-tools-gemma4.ts verified locally with Bare runtime.

🔌 API Changes

Qwen3.5 / Qwen3.6 — dialect auto-detected from model name/path:

import { loadModel, completion } from "@qvac/sdk";

const modelId = await loadModel({
  modelSrc: "/models/Qwen3.5-7B-Instruct-Q4_K_M.gguf",
  modelType: "llm",
  modelConfig: { ctx_size: 4096, tools: true },
});

const run = completion({
  modelId,
  history: [{ role: "user", content: "What's the weather in Paris?" }],
  tools: [weatherTool],
  // toolDialect: "qwen35" — auto-detected; override only if needed
});

Gemma4 — dialect auto-detected from model name/path:

const modelId = await loadModel({
  modelSrc: "/models/gemma-4-9b-it-Q4_K_M.gguf",
  modelType: "llm",
  modelConfig: { ctx_size: 4096, tools: true },
});

const run = completion({
  modelId,
  history: [{ role: "user", content: "What's the weather in Paris?" }],
  tools: [weatherTool],
  // toolDialect: "gemma4" — auto-detected; override only if needed
});

reasoning_budget — load-time default and per-request override:

// -1 = unrestricted thinking, 0 = disabled
const modelId = await loadModel({
  modelSrc: "/models/Qwen3.5-7B-Instruct-Q4_K_M.gguf",
  modelType: "llm",
  modelConfig: { ctx_size: 4096, reasoning_budget: -1 },
});

const run = completion({
  modelId,
  history: [{ role: "user", content: "Think step by step." }],
  generationParams: { reasoning_budget: 0 }, // override per-request
});

…t param - Extend toolDialectSchema with 'qwen35' and 'gemma4' values - Add Qwen3.5 Pythonic-XML parser (qwen35.ts): <tool_call><function=NAME> <parameter=KEY>VALUE</parameter></function></tool_call>; string values are raw text, arrays/objects are JSON; type coercion from tool schema - Add Gemma4 native parser (gemma4native.ts): <|tool_call>call:NAME{...}<tool_call|>; JS-literal args with <|"|> quote tokens, split-then-transliterate approach to safely quote bare keys without corrupting string values containing ', key:' - Wire both parsers into parser.ts dispatch and the default catch-all chain - Add dialect specs to completion-normalizer.ts: qwen35 reuses <tool_call> framing; gemma4 has asymmetric <|tool_call>/<tool_call|> + thinking frames - Auto-detect qwen35/gemma4 from model name/path in dialect.ts with guards against Gemma3+Q4 quant suffix and Qwen3 5B parameter-count collisions - Add reasoning_budget (-1 | 0) to LlmConfig (load-time) and GenerationParams (per-request); passes through transformLlmConfig unchanged (snake_case key bypasses camelCase regex, number-to-string conversion handles the value) - Mirror reasoning_budget in CLI SDKGenerationParams type - Add tests-qvac completion tests for reasoning_budget passthrough - Add tool-calling examples for qwen35 and gemma4 in examples/tools/ - Bump @qvac/llm-llamacpp to ^0.20.0 (adds reasoning_budget and new model support shipped in fabric-8189)

…udget to completion-executor llamacpp 8189+ (in @qvac/llm-llamacpp@0.20.0) removed --system-prompt from its CLI argument parser. The SDK was forwarding system_prompt through transformLlmConfig causing all model loads to fail with 'invalid argument: --system-prompt'. system_prompt is JS-only: completion-stream.ts reads it to seed the conversation history. It has no meaning at the C++ level and must be excluded alongside modelType. Also mirrors reasoning_budget in completion-executor.ts GenerationParams so the new tests-qvac reasoning_budget tests type-check correctly.

…on tests - Drop the over-broad qwen.*3\.5 alternative from the qwen35 regex and tighten the lookahead to (?![a-z0-9]) so qwen3-50b-instruct no longer false-matches as qwen35 - Tighten gemma4 lookahead to (?=[^a-z0-9]|$) so gemma-40b no longer false-matches as gemma4 - Extract transformLlmConfig to transform.ts (no addon imports) so it can be unit-tested without the native addon loading - Add llm-plugin-transform.test.ts pinning that system_prompt and modelType are never forwarded to C++ and that reasoning_budget survives - Add negative test cases for qwen3-50b and gemma-40b to tool-parser.test.ts - Fix stale default-chain comment in parser.ts (was 'Harmony first', actual order is Gemma4 first) - Add inline justification for qwen35/gemma4 fallback asymmetry

donriddo · 2026-05-12T15:03:57Z

/review

gianni-cor · 2026-05-12T16:06:29Z

/review

gianni-cor · 2026-05-12T17:41:35Z

/review

Bump @qvac/cli to 0.4.0 and add the v0.4.0 changelog set. Includes all 5 cli-scoped PRs landed on release-cli-0.4.0 since cli-v0.3.0: - QVAC-18677 feat[api]: qvac verify deps (#1969) - QVAC-18717 feat[api]: Qwen3.5 / Gemma4 tool-call dialects + reasoning_budget (#1974) - QVAC-18678 feat[api]: qvac verify bundle (#1984) - QVAC-18730 feat[api]: POST /v1/images/generations on qvac serve (#2008) - chore: consolidate PR templates and hide style note in HTML comment (#1924) PR #1924's title lacked a ticket or [notask], so the changelog generator's strict validator dropped it. It is added manually under the Chores section to keep the changelog truthful to what shipped on release-cli-0.4.0.

Bump @qvac/cli to 0.4.0 and add the v0.4.0 changelog set. Includes all 5 cli-scoped PRs landed on release-cli-0.4.0 since cli-v0.3.0: - QVAC-18677 feat[api]: qvac verify deps (#1969) - QVAC-18717 feat[api]: Qwen3.5 / Gemma4 tool-call dialects + reasoning_budget (#1974) - QVAC-18678 feat[api]: qvac verify bundle (#1984) - QVAC-18730 feat[api]: POST /v1/images/generations on qvac serve (#2008) - chore: consolidate PR templates and hide style note in HTML comment (#1924) PR #1924's title lacked a ticket or [notask], so the changelog generator's strict validator dropped it. It is added manually under the Chores section to keep the changelog truthful to what shipped on release-cli-0.4.0. (cherry picked from commit 22462c8)

…35/gemma4 dialect E2E tests Examples llamacpp-tools-qwen35 and llamacpp-tools-gemma4 were using raw HuggingFace URLs as fallback defaults because the registry had not yet been seeded with Qwen3.5 and Gemma4 models. Now that those constants exist (QWEN3_5_0_8B_MULTIMODAL_Q8_0, GEMMA4_2B_MULTIMODAL_Q4_K_M), use them directly, matching the pattern of all other SDK examples. Adds tools-qwen35 and tools-gemma4 resources to the desktop consumer and two dialect-specific E2E tests (tools-simple-function-qwen35, tools-simple-function-gemma4). PR tetherto#1974 wired toolDialect and resourceKey through ToolsExecutor and createToolsTest specifically to enable these tests once constants were available.

…odel registry (#2046) * feat(sdk): expose diffusion_fa in sdcppConfigSchema Adds diffusion_fa to sdcppConfigSchema so callers can explicitly control per-transformer flash attention. The addon enables this by default (required for FLUX.2 to avoid materialising the full Q·Kᵀ attention matrix); the field is a no-op escape hatch for backends that don't support ggml_flash_attn_ext. The plugin's ...rest spread already forwards it to the native layer; no plugin changes required. * fix(sdk): remove flux_flow from prediction enum flux_flow (FLUX.1) was never a supported model family — only flux2_flow (FLUX.2) is. Remove the stale enum value so the SDK schema matches the diffusion addon surface. * fix(sdk): simplify diffusion_fa description and add unit test coverage Shorten the describe() string to match the terse style of adjacent boolean fields. Add diffusion_fa to the "accepts valid full config" fixture in sdcpp-plugin.test.ts so the field has schema-parse coverage. * test(sdk): add diffusion_fa E2E test to tests-qvac Adds a dedicated 'diffusion-fa' resource in the desktop consumer loaded with diffusion_fa: true, a matching executor method that calls ensureLoaded('diffusion-fa'), and a test definition 'diffusion-fa-accepted' that generates a 256x256 image through the full SDK -> plugin -> addon path, confirming the field is accepted and forwarded without breaking inference. * test(sdk): remove misleading comment from diffusion-fa resource * test(sdk): add rejection tests for flux_flow and diffusion_fa type; fix E2E test name and remove redundant preload Add two missing schema rejection tests: non-boolean diffusion_fa and the removed flux_flow prediction value. Rename diffusion-fa-accepted to diffusion-fa-loads-and-runs to match what the test actually verifies (load + generate, not FA effect). Remove preLoadUnload from diffusion-fa resource — it reuses the same Flux2 model files as the diffusion resource, so the extra load+unload at bootstrap is redundant cost. * feat[mod](sdk): add Gemma4-E2B/E4B/31B, Qwen3.5-0.8B/2B/4B/9B, Qwen3.6-27B/35B-A3B to SDK registry * fix[notask]: bump @qvac/diffusion-cpp to ^0.8.0 * test(sdk): prove diffusion_fa:false override path end-to-end Unit test verifies sdcppConfigSchema preserves false through parsing (not just rejects non-booleans). E2E adds diffusion-fa-disabled resource with diffusion_fa:false and a matching test so the full SDK→plugin→addon path is exercised for the opt-out case, not just the addon default. * fix(sdk): replace hardcoded HF URLs with registry constants; add qwen35/gemma4 dialect E2E tests Examples llamacpp-tools-qwen35 and llamacpp-tools-gemma4 were using raw HuggingFace URLs as fallback defaults because the registry had not yet been seeded with Qwen3.5 and Gemma4 models. Now that those constants exist (QWEN3_5_0_8B_MULTIMODAL_Q8_0, GEMMA4_2B_MULTIMODAL_Q4_K_M), use them directly, matching the pattern of all other SDK examples. Adds tools-qwen35 and tools-gemma4 resources to the desktop consumer and two dialect-specific E2E tests (tools-simple-function-qwen35, tools-simple-function-gemma4). PR #1974 wired toolDialect and resourceKey through ToolsExecutor and createToolsTest specifically to enable these tests once constants were available. --------- Co-authored-by: gianni-cor <gianfranco.cordella@tether.io>

Bump @qvac/cli to 0.4.0 and add the v0.4.0 changelog set. Includes all 5 cli-scoped PRs landed on release-cli-0.4.0 since cli-v0.3.0: - QVAC-18677 feat[api]: qvac verify deps (#1969) - QVAC-18717 feat[api]: Qwen3.5 / Gemma4 tool-call dialects + reasoning_budget (#1974) - QVAC-18678 feat[api]: qvac verify bundle (#1984) - QVAC-18730 feat[api]: POST /v1/images/generations on qvac serve (#2008) - chore: consolidate PR templates and hide style note in HTML comment (#1924) PR #1924's title lacked a ticket or [notask], so the changelog generator's strict validator dropped it. It is added manually under the Chores section to keep the changelog truthful to what shipped on release-cli-0.4.0.

…oning_budget param (#1974) * feat[api]: add Qwen3.5, Gemma4 tool-call dialects and reasoning_budget param - Extend toolDialectSchema with 'qwen35' and 'gemma4' values - Add Qwen3.5 Pythonic-XML parser (qwen35.ts): <tool_call><function=NAME> <parameter=KEY>VALUE</parameter></function></tool_call>; string values are raw text, arrays/objects are JSON; type coercion from tool schema - Add Gemma4 native parser (gemma4native.ts): <|tool_call>call:NAME{...}<tool_call|>; JS-literal args with <|"|> quote tokens, split-then-transliterate approach to safely quote bare keys without corrupting string values containing ', key:' - Wire both parsers into parser.ts dispatch and the default catch-all chain - Add dialect specs to completion-normalizer.ts: qwen35 reuses <tool_call> framing; gemma4 has asymmetric <|tool_call>/<tool_call|> + thinking frames - Auto-detect qwen35/gemma4 from model name/path in dialect.ts with guards against Gemma3+Q4 quant suffix and Qwen3 5B parameter-count collisions - Add reasoning_budget (-1 | 0) to LlmConfig (load-time) and GenerationParams (per-request); passes through transformLlmConfig unchanged (snake_case key bypasses camelCase regex, number-to-string conversion handles the value) - Mirror reasoning_budget in CLI SDKGenerationParams type - Add tests-qvac completion tests for reasoning_budget passthrough - Add tool-calling examples for qwen35 and gemma4 in examples/tools/ - Bump @qvac/llm-llamacpp to ^0.20.0 (adds reasoning_budget and new model support shipped in fabric-8189) * fix: exclude system_prompt from C++ config transform; add reasoning_budget to completion-executor llamacpp 8189+ (in @qvac/llm-llamacpp@0.20.0) removed --system-prompt from its CLI argument parser. The SDK was forwarding system_prompt through transformLlmConfig causing all model loads to fail with 'invalid argument: --system-prompt'. system_prompt is JS-only: completion-stream.ts reads it to seed the conversation history. It has no meaning at the C++ level and must be excluded alongside modelType. Also mirrors reasoning_budget in completion-executor.ts GenerationParams so the new tests-qvac reasoning_budget tests type-check correctly. * fix: tighten dialect regexes, extract transformLlmConfig, add exclusion tests - Drop the over-broad qwen.*3\.5 alternative from the qwen35 regex and tighten the lookahead to (?![a-z0-9]) so qwen3-50b-instruct no longer false-matches as qwen35 - Tighten gemma4 lookahead to (?=[^a-z0-9]|$) so gemma-40b no longer false-matches as gemma4 - Extract transformLlmConfig to transform.ts (no addon imports) so it can be unit-tested without the native addon loading - Add llm-plugin-transform.test.ts pinning that system_prompt and modelType are never forwarded to C++ and that reasoning_budget survives - Add negative test cases for qwen3-50b and gemma-40b to tool-parser.test.ts - Fix stale default-chain comment in parser.ts (was 'Harmony first', actual order is Gemma4 first) - Add inline justification for qwen35/gemma4 fallback asymmetry * fix: extend qwen35 dialect to Qwen3.6; escape newlines in Gemma4 arg transliterator * fix: update toolDialect docs to list all dialects; add qwen35/gemma4 normalizer tests * fix: harden qwen35 coercion errors and gemma4 control-char escaping - qwen35: boolean coercion now throws on non-"true"/"false" values ("True" from Python models) instead of silently returning false - qwen35: integer/number coercion now throws on NaN values - qwen35: parameter coercion errors caught per-call and surfaced as PARSE_ERROR instead of propagating as uncaught exceptions - gemma4: control-char escape regex corrected to cover full U+0000-U+001F range using \x00-\x1f escape-sequence text - add 19 new unit tests: typed coercions, error cases, multiple calls, unknown-tool and validation errors, hermes-JSON fallback in qwen35 chain, bare numerics/booleans, nested objects/arrays, tab and CR round-trips, malformed-args PARSE_ERROR in gemma4 * fix: align qwen35 coercion error handling with pythonic/hermes pattern Wrap the full parameter extraction block in a single try/catch instead of an inner try/catch inside the while loop. Matches the convention used by parsePythonicFormat and parseHermesFormat. * test: fix incorrect comments in dialect negative-case tests - gemma3 comment: removed misleading "Q4 quantization suffix" framing; the real concern is a Gemma 3 4B model not being detected as Gemma 4 - gemma-40b comment: corrected factually wrong "4 billion params" description; the actual mechanism is the trailing '0' digit blocking the gemma4 lookahead * test: remove confusing dialect negative-case comments 'digit after 5, not a letter' and 'digit after 6, not a letter' were both wrong — the negative lookahead (?![a-z0-9]) blocks any alphanumeric character, not just digits. Remove rather than rephrase. * fix: reject non-integer floats and malformed array/object params in qwen35 parser integer schema type now rejects non-integer floats (e.g. 1.5) via Number.isInteger check. array/object schema types now propagate PARSE_ERROR on JSON.parse failure instead of silently falling back to the raw string. Add regression tests for both cases. * fix: expose reasoning_budget as boolean in CLI, transform to -1|0 for SDK SDKGenerationParams.reasoning_budget changes from -1|0 (SDK-internal representation) to boolean (true = keep reasoning on, false = disable). sdkCompletion now maps true→-1 and false→0 before forwarding to the SDK. extractGenerationParams parses incoming boolean reasoning_budget from the request body. Tests added for both true and false paths. * feat: wire toolDialect and resourceKey through ToolsExecutor and createToolsTest ToolsExecutor.generic now reads toolDialect (forwarded to completion()) and resourceKey (selects which loaded model to use) from test params. The createToolsTest helper accepts both as optional options, so dialect-specific e2e test definitions can be added once the model constants are available from update-models. * fix: reject empty numeric params in qwen35, allow hyphens in gemma4 tool names, add qwen35 to default parser chain - coerceParamValue: reject empty/whitespace-only numeric params before Number() for both number and integer types; Number("") === 0 caused silent semantic corruption - gemma4native callRegex and bare-key quoting regex: broaden [A-Za-z_]\w* to [A-Za-z_][\w-]* so hyphenated tool names (and param keys) are matched instead of returning matched=false and leaking raw frame markers as contentDelta - pickFormatParsers default chain: insert parseQwen35Format ahead of parseHermesFormat so raw Qwen XML payloads are recovered when the model-name heuristic misses - regression tests for all three cases --------- Co-authored-by: gianni-cor <gianfranco.cordella@tether.io>

Bump @qvac/cli to 0.4.0 and add the v0.4.0 changelog set. Includes all 5 cli-scoped PRs landed on release-cli-0.4.0 since cli-v0.3.0: - QVAC-18677 feat[api]: qvac verify deps (#1969) - QVAC-18717 feat[api]: Qwen3.5 / Gemma4 tool-call dialects + reasoning_budget (#1974) - QVAC-18678 feat[api]: qvac verify bundle (#1984) - QVAC-18730 feat[api]: POST /v1/images/generations on qvac serve (#2008) - chore: consolidate PR templates and hide style note in HTML comment (#1924) PR #1924's title lacked a ticket or [notask], so the changelog generator's strict validator dropped it. It is added manually under the Chores section to keep the changelog truthful to what shipped on release-cli-0.4.0. (cherry picked from commit 22462c8)

…odel registry (#2046) * feat(sdk): expose diffusion_fa in sdcppConfigSchema Adds diffusion_fa to sdcppConfigSchema so callers can explicitly control per-transformer flash attention. The addon enables this by default (required for FLUX.2 to avoid materialising the full Q·Kᵀ attention matrix); the field is a no-op escape hatch for backends that don't support ggml_flash_attn_ext. The plugin's ...rest spread already forwards it to the native layer; no plugin changes required. * fix(sdk): remove flux_flow from prediction enum flux_flow (FLUX.1) was never a supported model family — only flux2_flow (FLUX.2) is. Remove the stale enum value so the SDK schema matches the diffusion addon surface. * fix(sdk): simplify diffusion_fa description and add unit test coverage Shorten the describe() string to match the terse style of adjacent boolean fields. Add diffusion_fa to the "accepts valid full config" fixture in sdcpp-plugin.test.ts so the field has schema-parse coverage. * test(sdk): add diffusion_fa E2E test to tests-qvac Adds a dedicated 'diffusion-fa' resource in the desktop consumer loaded with diffusion_fa: true, a matching executor method that calls ensureLoaded('diffusion-fa'), and a test definition 'diffusion-fa-accepted' that generates a 256x256 image through the full SDK -> plugin -> addon path, confirming the field is accepted and forwarded without breaking inference. * test(sdk): remove misleading comment from diffusion-fa resource * test(sdk): add rejection tests for flux_flow and diffusion_fa type; fix E2E test name and remove redundant preload Add two missing schema rejection tests: non-boolean diffusion_fa and the removed flux_flow prediction value. Rename diffusion-fa-accepted to diffusion-fa-loads-and-runs to match what the test actually verifies (load + generate, not FA effect). Remove preLoadUnload from diffusion-fa resource — it reuses the same Flux2 model files as the diffusion resource, so the extra load+unload at bootstrap is redundant cost. * feat[mod](sdk): add Gemma4-E2B/E4B/31B, Qwen3.5-0.8B/2B/4B/9B, Qwen3.6-27B/35B-A3B to SDK registry * fix[notask]: bump @qvac/diffusion-cpp to ^0.8.0 * test(sdk): prove diffusion_fa:false override path end-to-end Unit test verifies sdcppConfigSchema preserves false through parsing (not just rejects non-booleans). E2E adds diffusion-fa-disabled resource with diffusion_fa:false and a matching test so the full SDK→plugin→addon path is exercised for the opt-out case, not just the addon default. * fix(sdk): replace hardcoded HF URLs with registry constants; add qwen35/gemma4 dialect E2E tests Examples llamacpp-tools-qwen35 and llamacpp-tools-gemma4 were using raw HuggingFace URLs as fallback defaults because the registry had not yet been seeded with Qwen3.5 and Gemma4 models. Now that those constants exist (QWEN3_5_0_8B_MULTIMODAL_Q8_0, GEMMA4_2B_MULTIMODAL_Q4_K_M), use them directly, matching the pattern of all other SDK examples. Adds tools-qwen35 and tools-gemma4 resources to the desktop consumer and two dialect-specific E2E tests (tools-simple-function-qwen35, tools-simple-function-gemma4). PR #1974 wired toolDialect and resourceKey through ToolsExecutor and createToolsTest specifically to enable these tests once constants were available. --------- Co-authored-by: gianni-cor <gianfranco.cordella@tether.io>

donriddo added the test-e2e-smoke Triggers smoke e2e test suite [Currently SDK-only] label May 11, 2026

donriddo had a problem deploying to release May 11, 2026 13:55 — with GitHub Actions Failure

This comment has been minimized.

Sign in to view

donriddo added test-e2e-smoke Triggers smoke e2e test suite [Currently SDK-only] and removed test-e2e-smoke Triggers smoke e2e test suite [Currently SDK-only] labels May 11, 2026

donriddo had a problem deploying to release May 11, 2026 14:17 — with GitHub Actions Failure

This comment has been minimized.

Sign in to view

donriddo had a problem deploying to release May 11, 2026 14:18 — with GitHub Actions Failure

donriddo requested a deployment to release May 11, 2026 14:19 — with GitHub Actions Waiting

This comment has been minimized.

Sign in to view

Merge branch 'main' into feat/sdk-qwen35-gemma4-reasoning-budget

85e083b

donriddo had a problem deploying to release May 12, 2026 15:05 — with GitHub Actions Failure

Merge branch 'main' into feat/sdk-qwen35-gemma4-reasoning-budget

32791a3

gianni-cor dismissed stale reviews from simon-iribarren and NamelsKing via 32791a3 May 12, 2026 16:06

gianni-cor had a problem deploying to release May 12, 2026 16:06 — with GitHub Actions Failure

donriddo had a problem deploying to release May 12, 2026 16:06 — with GitHub Actions Error

Merge branch 'main' into feat/sdk-qwen35-gemma4-reasoning-budget

b977f43

donriddo temporarily deployed to release May 12, 2026 16:25 — with GitHub Actions Inactive

opaninakuffo approved these changes May 12, 2026

View reviewed changes

NamelsKing approved these changes May 12, 2026

View reviewed changes

Merge branch 'main' into feat/sdk-qwen35-gemma4-reasoning-budget

52fda71

gianni-cor had a problem deploying to release May 12, 2026 17:41 — with GitHub Actions Failure

gianni-cor merged commit f12b236 into tetherto:main May 12, 2026
13 of 14 checks passed

gianni-cor had a problem deploying to release May 12, 2026 17:42 — with GitHub Actions Failure

opaninakuffo mentioned this pull request May 13, 2026

QVAC-18805 chore[skiplog]: backmerge release-cli-0.4.0 — version bump, changelog, NOTICE #2038

Merged

This was referenced May 16, 2026

QVAC-17940 chore[skiplog]: release sdk 0.11.0 #2090

Merged

QVAC-17940 chore[skiplog]: backmerge release-sdk-0.11.0 — version bump, changelog, NOTICE, tooling #2091

Merged

donriddo mentioned this pull request May 20, 2026

QVAC-18873 feat[api|mod]: expose diffusion_fa, drop flux_flow, sync model registry #2046

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

QVAC-18717 feat[api]: add Qwen3.5, Gemma4 tool-call dialects and reasoning_budget param#1974

QVAC-18717 feat[api]: add Qwen3.5, Gemma4 tool-call dialects and reasoning_budget param#1974
gianni-cor merged 19 commits into
tetherto:mainfrom
donriddo:feat/sdk-qwen35-gemma4-reasoning-budget

donriddo commented May 11, 2026 •

edited

Loading

Uh oh!

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

donriddo commented May 12, 2026

Uh oh!

gianni-cor commented May 12, 2026

Uh oh!

gianni-cor commented May 12, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

donriddo commented May 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🎯 What problem does this PR solve?

📝 How does it solve it?

🧪 How was it tested?

🔌 API Changes

Uh oh!

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

donriddo commented May 12, 2026

Uh oh!

gianni-cor commented May 12, 2026

Uh oh!

gianni-cor commented May 12, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

donriddo commented May 11, 2026 •

edited

Loading