feat: multi model and multi provider config and auto switching #4035

michaelneale · 2025-08-12T07:58:38Z

This generalises the GOOSE_LEAD/WORKER variable structure with a more general multi model approach (ie can remove that old code path).
This is clearly marked with an x- experimental config prefix (as per convention) so the exact format can evolve as people use this.

This is really phase 1, phase 2 will be both GUI/ux changes to support and also having more opionionated defaults when providers are configured, but need to get this out in the wild so we can see how things perform with all the permutations of providers we can't test by hand.

For example, in config:

x-advanced-models:
- provider: databricks
  model: goose-gpt-5
  role: reviewer
- provider: anthropic
  model: claude-opus-4-1-20250805
  role: deep-thinker
- provider: anthropic
  model: claude-opus-4-1-20250805
  role: lead

which maps to premade_roles.yaml which define rules for those models and when they activate:

roles:
  # Deep reasoning and analysis
  - role: "deep-thinker"
    rules:
      triggers:
        keywords: ["think", "reason", "analyze", "explain why", "how does", "what if"]
        match_type: "any"
        complexity_threshold: "high"
        source: "human"  # Only trigger on human messages
      active_turns: 3
      priority: 10

which can be used as is - or each trigger value/setting can be overridding in the personal config (if one of the pre-made set doesn't fit - normally you just say role + provider + model - and let it work it out, but you can customise).

By default the main provider/model is used, and these will supplement when the rules are activate/met, and run for a certain amount of time (this already helped me as the gptoss:120B model was good at spotting short circuit logic bugs other models failed to see).

This could also be used to do most of the work in a low cost or zero cost local model as well.

run with --debug to see it log switching providers and models as it works

discussion #3980
implements: #4036

* main: (67 commits) blog: Transforming AI Assistance with Goose Mentor Mode (#4151) upgraded all npm packages and fixed related issues (#4072) Docs: @-mentions in goosehints (#4171) fix: consistent font sizing in ToolCallWithResponse (#4167) Temporarily disable TODO Tool (#4158) docs: add integrated MCP server config to jetbrains tutorial (#4120) docs: remove figma MCP from suggested servers (#4123) Blog: The AI Skeptic’s Guide to Context Windows (#4152) Docs: Auto-compact context (#4116) chore(deps): bump brace-expansion from 1.1.11 to 1.1.12 in /documentation (#4149) Recipe config to limit tool availability (#4020) docs: fix warning message (#4148) feat: adds cursor-agent as a cli provider (#4101) chore: remove vector search tool selection strategy (#3933) docs: add streamable_http install links (#4130) feat: iterating on summarize oneshot prompt (#4113) feat(mcp): Persist OAuth credentials to keyring (#4007) Sanitize Tags Unicode Block at prompt level (#4047) Fixing typos (#4114) chore(release): release version 1.4.0 (#4069) ...

…m:block/goose into micn/multi-model-multi-provider-autopilot * 'micn/multi-model-multi-provider-autopilot' of github.com:block/goose: printing out debugging simplifying

* main: (42 commits) feat: Add message queue system with interruption handling (#4179) Start extensions concurrently (#4234) Add X-Title and referer headers on exchange to tetrate (#4250) docs: update View/Edit Recipe menu item name (#4267) Remove unused game (#4226) fix issue where app redirects to home after initialization but user has already started a chat (#4260) Feat: Let providers configure a fast model for summarization (#4228) docs: update tool selection strategy (#4258) feat: upgrade `@mcp-ui/client` package and improve UI message handling (#4164) stop replacing chat window when changing working directory (#4200) Only fetch session tokens when chat state is idle to avoid resetting during streaming (#4104) bump timeouts for e2e tests (#4251) docs: custom context files improvements (#4096) chore: upgrade rmcp to 0.6.0 (#4243) doc: uvx not npx (#4240) Add PKCE support for Tetrate Agent Router Service (#4165) Read AGENTS.md by default (#4232) docs: configure provider and model (#4235) docs: add figma tutorial (#4231) Add Nix flake for reproducible builds (#4213) ...

joeeuston-dev · 2025-08-27T08:36:23Z

This sounds like an awesome feature. Currently I'm having to switch models when I do more multi modal work as compared to my 'everything' model.

michaelneale · 2025-08-27T10:05:19Z

@joeeuston-dev yeah - I keep finding uses for it - still tuning what defaults I want, but lots of possibilities here and fairly simple in the end!

jamadeo

This is very cool and definitely something Goose needs IMO. I'm worried a bit about the number of knobs to turn with a setup like this though. Would users dive into things like triggering keywords, tool/turn counts and priority? If they would, how do you judge if you're tuning it well? Seems it would take quite a lot of trial and error.

What about an approach that uses a model to judge when to switch roles? If you're already configuring Goose to be multi-model, it could be a reasonable requirement to assign a model to have the role of "routing".

It could also be interesting to combine this with the TO-DO list strategy. Each TODO item gets a role pre-assigned.

crates/goose/src/agents/agent.rs

crates/goose/src/agents/autopilot.rs

michaelneale · 2025-08-27T23:28:52Z

@jamadeo "This is very cool and definitely something Goose needs IMO. I'm worried a bit about the number of knobs to turn with a setup like this though. Would users dive into things like triggering keywords, tool/turn counts and priority?"

no - I would hope they don't need to, they only time would be if they really went bespoke, and it is more about simplifying the changes to goose itself, as people discover patterns or learn them, vs needing to change code (similar to with providers we make it as easy as possible ideally to add another one). But absolutely no - they should only need to care about a few roles, we can even have some sensible default setup, and cli/gui can also suggest others - make sense?

* main: (38 commits) feat: linux computer control for android (termux) (#3890) feat: Added scroll state support for chat-session-list navigation (#4360) docs: typo fix (#4376) blog: goose janitor (#4131) Fix eleven labs audio transcription and added more logging (#4358) feat: re-introduce session sharing (#4370) remove duplicate blog post (#4369) fix focus ring under form submits (#4332) Trigger docs deployment update tetrate blog date to today (#4368) tetrate signup: blog/launch post (#4313) Implement graceful recipe error handling with filename display (#4363) docs: airgapped operation by bypassing hermit for desktop app (#4063) remove Ollama card from welcome screen (#4348) feat: initial implementation of extension malware check (#4272) Add Tetrate Agent Router Service to Provider Registry (#4354) Goose Simple Compact UX (#4202) Refactor Extensions Install Modal (#4328) fix: url path trailing slash for custom-providers (#4345) docs: update available and onboarding providers list (#4356) ...

michaelneale · 2025-08-28T08:37:09Z

@DOsinga thoughts on if we should consoldate model config to:

models:

... like this (can have that instead of a variable if we want)? this then can help the cli and GUI have a simpler structure to target when storing a model.

michaelneale · 2025-08-28T08:39:00Z

@jamadeo What about an approach that uses a model to judge when to switch roles?

I like that, wouldn't want it on each turn. One option is that early on it could use that model to "turn the nobs and dials" (ie can help select appropriate roles) which will then kick in so you don't have to? What I would like to get to is what is the minimal setup (ie if you setup N providers and M models - how can we best make use of them, how much do we do automatically and how much do we let users direct it).

* main: new recipe to lint-check my code (#4416) removing a leftover syntax error (#4415) Iand/updating recipe validation workflow (#4413) Iand/updating recipe validation workflow (#4410) Fix (Ollama provider): Unsupported operation: streaming not implemented (#4303) change databricks default to claude sonnet 4 (#4405) Iand/updating recipe validation workflow (#4406) Add metrics for recipe metadata in scheduler, UI, and CLI (#4399) Iand/updating recipe validation workflow (#4403) making small updates to recipe validation workflow (#4401) Automate OpenRouter API Key Distribution for External Recipe Contributors (#3198) Enhance `convert_path_with_tilde_expansion` to handle Windows (#4390) make sure all cookbook recipes have a title and version, but no id (#4395) Nest TODO State in session data (#4361) Fast model falls back to regular (#4375) Update windows instructions (#4333)

* main: chore: move list recipes and archive recipe to goose server (#4422) deleting a recipe and testing workflow (#4451) adding a new recipe (#4449) docs: autovisualiser extension (#4380) trying to restore functionality for api-key sending after merging a recipe (#4446) restoring a deleted recipe (#4445) testing recipe removal (#4443) updating our 3 workflows to only operate if the PR is adding/editing a recipe (#4441) [cookbook recipe] Update Wording (#4438) feat: show enabled extensions at top of extensions page (#4423) test recipe (#4436) Extensions loading indicator on desktop launch (#4412) removing trailing slash (#4433) [recipe cookbook] test recipe (#4431) [recipe cookbook] switching to SHA (#4429) [recipe cookbook] Update url build (#4427) [Recipe Cookbook] test recipe flow (#4426) [Recipe cookbook] Addressing GitHub api format issue (#4424) feat: integrate tool call icons with status indicators and daisy chaining (#4279)

* main: Align Dynamic Task Interface with Recipe Interface (#4311) docs: copilot auth and mcp-ui links (#4497) docs: July and August 2025 Community All-Stars Update (#4501) remove clicking outside to close recipe warning (#4502) lower min width to 450 for small screens Convert recipe create and import forms to use tanstack form and zod schema validation (#4499) Repo CI: use a writable location for Goose home directory (#4500) feat: Add functionality to delete session in history list view (#4480) fix: recipe deeplink "+" characters and folder change (#4471) Add session to agents (#4216) fix: need to send errors to appropriate stream (#4491) Add Docker support for Goose in CI/CD pipelines (#4434) Add visual indicator while recipe loads (#4447) Disable chat input while extensions load (#4417) chore(release): release version 1.7.0 (#4391) fix double filtering (#4409) Rewrite the developer mcp using the rmcp sdk (#4297) docs: sessions reorg and conversation features (#4462)

jamadeo

Let's do it. I like it as an experiment and we definitely need the model-switching infrastructure. I think we'll change our minds on the switching logic and eventually have preferred models that fulfill each role.

* 'main' of github.com:block/goose: Fix databricks streaming errors (#4506) docs: malware check for uvx and npx extensions (#4508) fix: auto-compact on context limit error (#3635) feat: multi model and multi provider config and auto switching (#4035)

…#4035) Signed-off-by: Matt Donovan <mattddonovan@protonmail.com>

…#4035) Signed-off-by: HikaruEgashira <hikaru-egashira@c-fo.com>

checkpoint

3308eae

michaelneale self-assigned this Aug 12, 2025

michaelneale mentioned this pull request Aug 12, 2025

feat: multi model and multi provider config and auto switching (lead/worker consolidation/simplification) #4036

Closed

michaelneale added 20 commits August 12, 2025 18:22

nicer refactoring

aae9c0b

test coverage

66d5030

WIP - testing out turn logic CONFIRM OR REMOVE

3f1ac4c

simplifying

1066976

printing out debugging

6557b37

Merge branch 'micn/multi-model-multi-provider-autopilot' of github.co…

2f3c594

…m:block/goose into micn/multi-model-multi-provider-autopilot * 'micn/multi-model-multi-provider-autopilot' of github.com:block/goose: printing out debugging simplifying

new version of flexible

7fbc2df

flexible refactoring 2

8e0ab84

second opionion

d527964

clarifying tests

7a262ba

Merge branch 'main' into micn/multi-model-multi-provider-autopilot

ea8b364

fmt

411fbcd

fix bug where it wouldn't stick

05540d7

now working a bit better

aa416a5

refactoring names and rules

2306c7f

checkpoint

0d10b90

checkpoint

1d17e2f

remove slop

653b547

michaelneale marked this pull request as ready for review August 26, 2025 07:50

fmt

9742989

michaelneale requested review from DOsinga and jamadeo August 26, 2025 09:35

michaelneale added 3 commits August 27, 2025 15:34

unneeeded

2bdb004

tidy roles

ebc8a96

tidy test

39b4855

michaelneale added 2 commits August 27, 2025 20:03

add lead support

99cee31

fmt

e3b8e8d

jamadeo reviewed Aug 27, 2025

View reviewed changes

crates/goose/src/agents/agent.rs Outdated Show resolved Hide resolved

crates/goose/src/agents/autopilot.rs Outdated Show resolved Hide resolved

crates/goose/src/agents/autopilot.rs Outdated Show resolved Hide resolved

michaelneale added 3 commits August 28, 2025 17:37

add a bunch of algorithms to classify complexity

e2a1109

block scope idiomatic

43d45f4

michaelneale requested a review from jamadeo August 28, 2025 23:48

michaelneale added 6 commits August 29, 2025 15:58

tidying up

15d5f46

fmt

f5329e8

adding docs and renaming config

4ddfef3

michaelneale assigned katzdave and jamadeo Sep 4, 2025

jamadeo approved these changes Sep 4, 2025

View reviewed changes

michaelneale merged commit 439d293 into main Sep 4, 2025
10 checks passed

michaelneale deleted the micn/multi-model-multi-provider-autopilot branch September 4, 2025 01:24

dianed-square pushed a commit that referenced this pull request Sep 4, 2025

feat: multi model and multi provider config and auto switching (#4035)

a704cc2

This was referenced Sep 9, 2025

chore(release): release version 1.8.0 #4572

Closed

release/1.8.0 #4577

Merged

thebristolsound pushed a commit to thebristolsound/goose that referenced this pull request Sep 11, 2025

feat: multi model and multi provider config and auto switching (block…

c676363

…#4035) Signed-off-by: Matt Donovan <mattddonovan@protonmail.com>

HikaruEgashira pushed a commit to HikaruEgashira/goose that referenced this pull request Oct 3, 2025

feat: multi model and multi provider config and auto switching (block…

a99e1f7

…#4035) Signed-off-by: HikaruEgashira <hikaru-egashira@c-fo.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: multi model and multi provider config and auto switching #4035

feat: multi model and multi provider config and auto switching #4035

michaelneale commented Aug 12, 2025 •

edited

Loading

Uh oh!

joeeuston-dev commented Aug 27, 2025

Uh oh!

michaelneale commented Aug 27, 2025

Uh oh!

jamadeo left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

michaelneale commented Aug 27, 2025

Uh oh!

michaelneale commented Aug 28, 2025

Uh oh!

michaelneale commented Aug 28, 2025

Uh oh!

jamadeo left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

feat: multi model and multi provider config and auto switching #4035

feat: multi model and multi provider config and auto switching #4035

Conversation

michaelneale commented Aug 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

joeeuston-dev commented Aug 27, 2025

Uh oh!

michaelneale commented Aug 27, 2025

Uh oh!

jamadeo left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

michaelneale commented Aug 27, 2025

Uh oh!

michaelneale commented Aug 28, 2025

Uh oh!

michaelneale commented Aug 28, 2025

Uh oh!

jamadeo left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

michaelneale commented Aug 12, 2025 •

edited

Loading