Add NVIDIA provider, and improve declarative provider UX by jh-block · Pull Request #8798 · aaif-goose/goose

jh-block · 2026-04-24T08:43:37Z

Fixes #8505

Summary

Add NVIDIA NIM as a declarative provider with NVIDIA-specific docs and setup steps.
Allow declarative providers to override model docs and setup metadata, and stop inheriting OpenAI config fields in the settings UI.
Fix canonical model limit handling so models with full-context output limits do not request the entire context as output.
Refresh provider state immediately after config changes so NVIDIA appears in the model picker without reopening the modal.

Testing

Added unit tests for NVIDIA declarative provider deserialization and registry wiring.
Added coverage for backward-compatible declarative provider config deserialization.
Added a regression test for the model limit behavior that triggered false compaction.
Ran cargo fmt --check, targeted cargo test -p goose ... provider/model tests, and desktop TypeScript typecheck.

Signed-off-by: jh-block <jhugo@block.xyz>

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: f04b989c88

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-04-24T08:56:16Z

+                self.max_tokens = canonical
+                    .limit
+                    .output
+                    .filter(|&output| output < canonical.limit.context)
+                    .map(|output| output as i32);


Preserve canonical output limits above context

This filter now drops every canonical output limit that is not strictly less than context, so models with output > context in the bundled registry lose their explicit limit and fall back to max_output_tokens() = 4096. In practice that silently shrinks allowed output for affected models (for example entries in canonical_models.json with larger output caps), even though this change was intended to handle the output == context sentinel case. Restricting the skip to equality (or clamping) avoids this unintended 4k cap.

Useful? React with 👍 / 👎.

Will fix the canonical data and then we can look at removing this workaround anyway.

* main: (29 commits) chore(deps): bump winreg from 0.55.0 to 0.56.0 (#8829) Fix grammar issue (#8669) colorize context window indicator (#8851) Refresh canonical model metadata from models.dev (#8838) fix(ci): prevent flaky smoke test timeouts from failing the build (#8837) updates: release 0.19.0 of the tui/sdk/etc (#8806) add a goose2 signed release flow (#8728) Port provider tests to typescript (#8237) refactor: make ACP server smaller (#8787) Add NVIDIA provider, and improve declarative provider UX (#8798) fix: removed failed provider test for deprecated providers (#8801) fix: only call cleanup when the pr is from same repo (#8799) chore: check stale for draft pr (#8803) fix: use _meta instead of meta in newSession request (#8796) fix: add missing underscore prefix in updateWorkingDir method name (#8743) feat: migrate session metadata storage from frontend overlay to backend (#8769) Add more info to BUILDING_LINUX (#8789) feat(acp): Align to new request patterns of ACP Streamable HTTP/WS transport (#8605) Dedupe and organize skills/sources (#8731) docs: add skills slash command (#8783) ...

Fix NVIDIA provider metadata and model limits

b39ff8c

Signed-off-by: jh-block <jhugo@block.xyz>

jh-block force-pushed the jhugo/nvidia-provider branch from f04b989 to b39ff8c Compare April 24, 2026 08:49

jh-block changed the title ~~Fix NVIDIA provider configuration and model defaults~~ Add NVIDIA provider, and improve declarative provider UX Apr 24, 2026

chatgpt-codex-connector Bot reviewed Apr 24, 2026

View reviewed changes

alexhancock approved these changes Apr 24, 2026

View reviewed changes

jh-block added this pull request to the merge queue Apr 24, 2026

Merged via the queue into main with commit 4065d44 Apr 24, 2026
21 of 22 checks passed

jh-block deleted the jhugo/nvidia-provider branch April 24, 2026 14:02

colesmcintosh mentioned this pull request Apr 24, 2026

Add support for NVIDIA API key to use NVIDIA models #8505

Closed

1 task

github-actions Bot mentioned this pull request Apr 28, 2026

chore(release): release version 1.33.0 #8875

Closed

freshyjmp mentioned this pull request May 1, 2026

feat: add NVIDIA NIM as a native Rust provider #8948

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add NVIDIA provider, and improve declarative provider UX#8798

Add NVIDIA provider, and improve declarative provider UX#8798
jh-block merged 1 commit into
mainfrom
jhugo/nvidia-provider

jh-block commented Apr 24, 2026 •

edited

Loading

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot Apr 24, 2026

Uh oh!

jh-block Apr 24, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

jh-block commented Apr 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Testing

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Apr 24, 2026

Choose a reason for hiding this comment

Uh oh!

jh-block Apr 24, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jh-block commented Apr 24, 2026 •

edited

Loading