Swap canonical model from openrouter to models.dev #6625

katzdave · 2026-01-22T01:01:30Z

Summary

Makes the lookups a lot more straightforward since models.dev has data keyed by (provider, model). Gets rid of lots of hacky text parsing. Still leaving in some level of text parsing for 'meta-providers' like databricks and bedrock. Sort by release date (works great!).

Testing

Tested Model filtering, sorting cost with:

[ x ] OpenAI
[ x ] Databricks
[ x ] Anthropic
[ x ] Google
[ x ] xai
[ x ] OpenRouter
[] Tetrate (need key)

Coming in Followup

Use canonical model for context limits. Want to aggressively delete a bunch of code here + verify with some oneoff scripts that we don't lose context data for major providers.

…ovider * 'main' of github.com:block/goose: increase worker threads for ci (#6614) docs: todo tutorial update (#6613) Added goose doc map md file for goose agent to find relevant doc easily. (#6598) add back goose branding to home (#6617) fix: actually set the working dir for extensions from session (#6612) Multi chat (#6428) Lifei/fixed accumulated token count (#6587) Dont show MCP UI/Apps until tool is approved (#6492) docs: max tokens config (#6596) User configurable templates (#6420) docs: http proxy environment variables (#6594) feat: exclude subagent tool from code_execution filtering (#6531)

…ovider * 'main' of github.com:block/goose: PR Code Review (#6043) fix(docs): use dynamic import for globby ESM module (#6636) chore: trigger CI Document tab completion (#6635) Install goose-mcp crate dependencies (#6632) feat(goose): standardize agent-session-id for session correlation (#6626) chore: tweak release docs (#6571) fix(goose): propagate session_id across providers and MCP (#6584)

Copilot

Pull request overview

This PR swaps the canonical model data source from OpenRouter to models.dev. The key change simplifies model lookups by using (provider, model) keyed data instead of parsing model IDs. The PR also implements sorting by release date and updates pricing to be per-million tokens.

Changes:

Switched canonical model API from OpenRouter to models.dev
Refactored registry to use (provider, model) tuple keys instead of single ID strings
Updated pricing fields from prompt/completion to input/output with per-million token rates
Added release date sorting for recommended models
Implemented fetch_supported_models for xAI provider

Reviewed changes

Copilot reviewed 13 out of 15 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
ui/desktop/src/api/types.gen.ts	Updated comment to clarify costs are in USD
ui/desktop/openapi.json	Updated OpenAPI spec comments for cost fields
crates/goose/tests/providers.rs	Added openrouter to default truncation test exception
crates/goose/src/providers/xai.rs	Added fetch_supported_models implementation
crates/goose/src/providers/utils.rs	Added "maximum prompt length" to context exceeded detection
crates/goose/src/providers/canonical/registry.rs	Changed registry from HashMap to HashMap<(String, String)>
crates/goose/src/providers/canonical/name_builder.rs	Simplified lookup logic for non-meta providers
crates/goose/src/providers/canonical/model.rs	Restructured model to match models.dev schema
crates/goose/src/providers/canonical/mod.rs	Updated to parse canonical IDs for registry lookup
crates/goose/src/providers/canonical/data/canonical_mapping_report.json	Updated with new mapping data
crates/goose/src/providers/canonical/build_canonical_models.rs	Rewrote to fetch from models.dev instead of OpenRouter
crates/goose/src/providers/base.rs	Added release date sorting for recommended models
crates/goose-server/src/routes/config_management.rs	Converted pricing from per-million to per-token
crates/goose-cli/src/session/output.rs	Converted pricing from per-million to per-token

Copilot · 2026-01-22T17:22:14Z

crates/goose/src/providers/base.rs

+                let is_text_capable = canonical_id
+                    .split_once('/')
+                    .and_then(|(p, m)| registry.get(p, m))
+                    .map(|m| m.modalities.input.contains(&"text".to_string()))
+                    .unwrap_or(false);
+
+                if !is_text_capable {
+                    return None;
+                }
+
+                let release_date = canonical_id
+                    .split_once('/')
+                    .and_then(|(p, m)| registry.get(p, m))
+                    .and_then(|canonical_model| canonical_model.release_date.clone());


The registry is looked up twice for each model in the filter_map - once to check is_text_capable (lines 462-466) and again to get release_date (lines 472-475). This is inefficient. Consider looking up the canonical model once and reusing it.

Suggested change

let is_text_capable = canonical_id

.split_once('/')

.and_then(|(p, m)| registry.get(p, m))

.map(|m| m.modalities.input.contains(&"text".to_string()))

.unwrap_or(false);

if !is_text_capable {

return None;

}

let release_date = canonical_id

.split_once('/')

.and_then(|(p, m)| registry.get(p, m))

.and_then(|canonical_model| canonical_model.release_date.clone());

let (provider, model_name) = match canonical_id.split_once('/') {

Some(parts) => parts,

None => return None,

};

let canonical_model = match registry.get(provider, model_name) {

Some(m) => m,

None => return None,

};

if !canonical_model

.modalities

.input

.contains(&"text".to_string())

{

return None;

}

let release_date = canonical_model.release_date.clone();

Copilot · 2026-01-22T17:22:14Z

crates/goose/src/providers/canonical/mod.rs


 pub fn maybe_get_canonical_model(provider: &str, model: &str) -> Option<CanonicalModel> {
    let registry = CanonicalModelRegistry::bundled().ok()?;
+    map_to_canonical_model(provider, model, registry)?;


This function calls map_to_canonical_model twice with the same arguments. The first call on line 26 is unused - its result is ignored. This appears to be leftover debug code that should be removed.

Suggested change

map_to_canonical_model(provider, model, registry)?;

Copilot

Pull request overview

Copilot reviewed 13 out of 15 changed files in this pull request and generated no new comments.

…ovider * 'main' of github.com:block/goose: fix: Manual compaction does not update context window. (#6682) Removed the Acceptable Usage Policy (#6204) Document spellcheck toggle (#6721) fix: docs workflow cleanup and prevent cancellations (#6713) Docs: file bug directly (#6718) fix: dispatch ADD_ACTIVE_SESSION event before navigating from "View All" (#6679) Speed up Databricks provider init by removing fetch of supported models (#6616) fix: correct typos in documentation and Justfile (#6686) docs: frameDomains and baseUriDomains for mcp apps (#6684)

…ovider * 'main' of github.com:block/goose: fix slash and @ keyboard navigation popover background color (#6550) fix[format/openai]: return error on empty msg. (#6511) Fix: ElevenLabs API Key Not Persisting (#6557) Logging uplift for model training purposes (command injection model) [Small change] (#6330) fix(goose): only send agent-session-id when a session exists (#6657) BERT-based command injection detection in tool calls (#6599) chore: [CONTRIBUTING.md] add Hermit to instructions (#6518) fix: update Gemini context limits (#6536) Document r slash command (#6724) Upgrade GitHub Actions to latest versions (#6700)

Remove session_id parameters from fetch_supported_models and fetch_recommended_models calls to match the updated Provider trait signature that doesn't include session_id. - xai.rs: Remove session_id param, pass None to response_get - build_canonical_models.rs: Remove session_id arg from fetch_recommended_models call

Copilot

Pull request overview

Copilot reviewed 13 out of 15 changed files in this pull request and generated no new comments.

DOsinga

Yeah seems good. My only concern is the whole tokens per millions and where convert this. We should fix this at the beginning or the end

DOsinga · 2026-01-29T17:17:58Z

crates/goose-server/src/routes/config_management.rs

            model: query.model.clone(),
-            input_token_cost: input_cost,
-            output_token_cost: output_cost,
+            // Canonical model costs are per million tokens, convert to per-token


needing this comment suggests that we need to refactor this more either at the beginning or at the end - let's have one unit we pass around

My followup pr deletes the pricing API and replaces it with a full canonical model fetch (or at least just the fields the client needs, but gives us a place to layer on more). will keep as cost / mtokens until we render.

DOsinga · 2026-01-29T17:19:00Z