feat: run sub recipe multiple times in parallel (Experimental feature in CLI) #3274

lifeizhou-ap · 2025-07-07T08:38:06Z

Adds ability to run sub-recipes multiple times with parallel execution, real-time progress tracking.

Key Features:

Parallel Execution of sub-recipes: Run multiple sub-recipe instances concurrently with configurable worker pools.
- You can run different sub-recipes in parallel, as long as you mentioned in your prompt. For example: analyze-pagerduty-incident and analyze-service-issues in parallel
- You can also run the same sub-recipe in parallel. For example: get the weather for 3 largest cities in Australia in parallel
Real-time Dashboard: Live progress tracking with task statistics and status updates

Changes:

Add executions and sequential_when_repeated in the sub_recipe section
```
```
sub_recipes:
- name: weather
  path: "{{recipe_dir}}/weather.yaml"
  values:
    date: "2025-01-01"
 sequential_when_repeated: true
```
```
- values section: always use the data in this section when running sub-recipe
- sequential_when_repeated: whether it should be sequential when running this sub recipe with different parameters. optional field, default is false
  - If sequential_when_repeated: true means the sub-recipe should NOT run in parallel with different params even user requested in prompts.
  - If sequential_when_repeated is false, or not specified, it is by default to run parallel unless user requested to run sequentially in prompts
New notification Event (notification_events.rs) for real-time task updates
Enhanced executor engine with parallel worker support and error handling
Console UI components for task execution dashboard (task_execution_display/)

How it works

Input Phase

User provides recipe with multiple sub-recipe executions
Recipe parser creates individual tasks from the input
Tasks are queued for parallel execution

Execution Phase

Executor spawns configurable number of workers
Workers pull tasks from queue and execute sub-recipes concurrently
Each worker runs independently with its own sub-recipe instance

Monitoring Phase

Task Execution Tracker monitors all worker progress
Real-time events are generated for status changes
Statistics are calculated and updated continuously

UI Updates

Notification events stream to the dashboard
Real-time display shows: completed, running, failed, pending counts
Users see live progress without polling

Error Handling

Failed tasks are captured with detailed error context

* main: (23 commits) docs: VS Code MCP video (#3307) docs: fixed broken link (#3306) Add YouTube video to Netlify MCP documentation (#3302) docs: add sub-recipes topic (#3241) docs: move topics to tutorials section (#3297) site analytics (#3293) chore(release): release version 1.0.35 (#3292) docs: enhanced code editing topic (#3287) fix cu (#3291) feat: Add environment variables to override model context limits (#3260) chore(release): release version 1.0.34 (#3285) fix(devcontainer): install protoc to fix build (#3267) Enabling npx command to install on Windows Desktop (#3283) Fix: Allow native Cmd+Up/Down cursor movement when user has typed text (#3246) chore(release): release version 1.0.33 (#3284) fix Windows Env Vars (#3282) feat: bedrock image content support (#3266) Add support in goose configure for streaming http mcp tools (#3256) docs: add Alby MCP tutorial (#3217) refactor(tests): make logging test in goose-cli less flaky on macos (#3273) ...

lifeizhou-ap · 2025-07-11T06:09:53Z

I wonder if it would be even more powerful if we could somehow move the recipe path into executions. Currently this PR allows us to run one recipe with different params multiple times in parallel. But what if we had a predefined collection of recipes that we wanted to all run in parallel (effectively as one tool for the agent). That could be powerful.

Something like:
sub_recipes:
- name: code_review
  values:
    severity_level: "high"
  executions:
    parallel: true
    runs:
    - path: "{{recipe_dir}}/lint.yaml"
      values:
        tool: "clippy"
    - path: "{{recipe_dir}}/sql-bugs.yaml" 
      values:
        db: "postgres"

I feel this is like an inline subrecipe with subrecipe :) (ie. nested subrecipe)

Currently subrecipe block maps to one recipe. and the values are only for the subrecipe

values:
>     severity_level: "high"

This part will be a little bit confusing with this block. This requires both lint.yaml and sql-bugs.yaml recipes have the same params severity_level.

In the main branch, the parallel has been implemented for different recipes as below

sub_recipes:

name: lint
path: "{{recipe_dir}}/lint.yaml"
values:
severity_level: "high"
. tool: "clippy"

name: sql-bugs
path: "{{recipe_dir}}/sql-bugs.yaml"
values:
severity_level: "high"
. db: "postgres"

lifeizhou-ap · 2025-07-11T06:30:23Z

There is also another perspective to take on this. You could argue that the agent can already choose to run multiple sub_recipes in parallel (and they are getting better at understanding prompts) so we should not add code complexity and schema rigidity in this PR.

Instead we should rely entirely on the llm/agent to follow prompts and run tools in the correct order/parallelism natively.

There is the danger here that we are getting very close to building a limited version of prescribed workflows (particularly if we go this route). If we do decide we want to be able to program full on workflows then we would be better off building dedicated workflow logic outside of recipes and have recipes just as the agent node (specification) building block.

If we go ahead with this functionality for recipes as is I think we should mark it as 'experimental' so that its more feasible to remove if we discover it doesn't align well with goose future recipe directions.

The schema level parallelism is an optional field for user to enforce the deterministic flow. I guess you are concerning the field has been defined in the recipe schema for now, and later on it is hard to remove it if we don't need it?

Marking this as experimental makes sense if we proceed — it'll give us room to iterate or revert if needed.

I’m on board with marking it experimental if we decide to include it, but I’m also up for talking more about where we see this going long term

jsibbison-square · 2025-07-11T07:15:58Z

Am I understanding correctly that this new executions and runs section of recipe is more like a tool 'prompt' influence for the agent? The only thing that's enforced is the number of params and maybe number of times the recipe will be called (runs).

If I understand right (I'm new to sub-agent code 😬 ) but it looks like the agent first calls a tool to 'create sub recipe tasks' and then with the response it is supposed to call a 'execute tasks' tool. But ultimately if the agent decides it wants to run the recipes sequential instead of parallel or run less/more subrecipes or override the params itself it total can because the there is not a lot of enforcement to follow the subrecipe config. It mostly generates prompt engineering for the agent (apart from the value defaults we provide).

If this is the case that these new schema fields are mostly optional guidance that the agent can potentially ignore and do differently then I think this is a lot of extra complexity and schema changes are painful to revert.

Sub_recipes can already be run in parallel with different parameters without this PR so I think we should rely on standard prompt engineering in the prompt/system prompt rather than trying to use this schema as hints for the agent.

I think there a lot of other useful things in this PR around subtask rendering in the cli and probably other refactoring but I think we should not merge the recipe schema changes 🙅 . But happy to chat and hear other opinions 😃 .

michaelneale · 2025-07-11T08:07:08Z

conceptually - would you describe sub recipes as a way to do "sub agents" (just as that seems to be the topic of the day - ie many agents working together, would that be one way to describe this to users not familiar with goose terminology?)

lifeizhou-ap · 2025-07-13T23:18:06Z

conceptually - would you describe sub recipes as a way to do "sub agents" (just as that seems to be the topic of the day - ie many agents working together, would that be one way to describe this to users not familiar with goose terminology?)

Examples of use case

In the mono repo build, there are 3 services that failed the build. I can find that 3 build urls and use a sub-recipe (diagnose failure sub recipe) by passing the build url to diagnose these 3 service in parallel.
I have a list of document link in a csv file, I would like to summarise each document (summarise document sub recipe)

jamadeo

My main feedback is around the need for all the flexibility of parallel vs sequential execution within a task. IMO the model should not decide how that works -- it's just getting a batch of results anyway. Sub-recipes setting a flag to specify their execution when they get a batch of parameters seems OK.

jamadeo · 2025-07-15T15:28:40Z

crates/goose-cli/src/recipes/extract_from_cli.rs

                    let additional_sub_recipe = SubRecipe {
                        path: recipe_file_path.to_string_lossy().to_string(),
                        name,
+                        timeout_in_seconds: None,


Yeah you could probably leave it out -- the sub recipes themselves will be calling tools, and they'll have their own timeouts.

One thing we definitely don't want here is a default 5-minute timeout. IMO that doesn't make much sense for tool calls in general, but that's a separate issue.

jamadeo · 2025-07-15T15:29:20Z

crates/goose-cli/src/session/mod.rs

                                                progress_bars.log(&formatted_message);
                                            }
-                                        } else {
+                                        } else if let Some(ref notification_type) = _notification_type {


probably should rename this without the underscore since it's used now

jamadeo · 2025-07-15T15:37:33Z

crates/goose/src/agents/recipe_tools/sub_recipe_tools.rs

+
 pub const SUB_RECIPE_TASK_TOOL_NAME_PREFIX: &str = "subrecipe__create_task";
+const EXECUTION_MODE_PARALLEL: &str = "parallel";
+const EXECUTION_MODE_SEQUENTIAL: &str = "sequential";


should this be an enum?

good catch, i thought I have refactored, but missed out

jamadeo · 2025-07-15T15:42:11Z

crates/goose/src/agents/recipe_tools/sub_recipe_tools.rs

+const EXECUTION_MODE_SEQUENTIAL: &str = "sequential";

 pub fn create_sub_recipe_task_tool(sub_recipe: &SubRecipe) -> Tool {
    let input_schema = get_input_schema(sub_recipe).unwrap();


I know it's not in this PR but this should probably handle an Err some other way than .unwrap()

jamadeo · 2025-07-15T16:05:24Z

crates/goose/src/agents/sub_recipe_execution_tool/sub_recipe_execute_task_tool.rs

+1. PRE-CREATED TASKS: If tasks were created by subrecipe__create_task_* tools, check the execution_mode in the response:
+   - If execution_mode is 'parallel', use parallel execution
+   - If execution_mode is 'sequential', use sequential execution
+   - Always respect the execution_mode from task creation to maintain consistency


IMO the model shouldn't pick how the executor runs a set of tasks. The model will always get the results back together, so it only impacts the implementation details of the tasks themselves. If the model decides it needs the result of one task to run the next, it will have to separate them anyway.

jamadeo · 2025-07-15T16:12:36Z

crates/goose/src/agents/sub_recipe_execution_tool/sub_recipe_execute_task_tool.rs

+                    "description": "Execution strategy for multiple tasks. For pre-created tasks, respect the execution_mode from task creation. For user intent, use 'sequential' (default) unless user explicitly requests parallel execution with words like 'parallel', 'simultaneously', 'at the same time', or 'concurrently'."
                },
                "tasks": {
                    "type": "array",


from earlier discussion -- consider whether you can just have the model pass the ID and leave it at that, since the task has already been "created"

This reverts commit 0e01630.

* main: (54 commits) UI update with sidebar and settings tabs (#3288) docs: add CLIStreamExtensionInstructions component (#3443) chore(release): release version 1.0.36 (#3436) [goose-llm] fix image content bug, add optional request_id field (#3439) fix: Set include_usage=true for OpenAI streaming (#3441) feat: `recipe list` (#2814) (#2815) docs: update github mcp config (#3433) feat: Implement streaming for OpenAI (#3413) fix: improve extension startup error messages with command details (#2694) [feat]: improve file search tools to add globsearch / grep tools (#3368) docs: typo in guide description (#3429) fix: use safe_truncate to truncate charactor (#3263) (#3264) fix: convert invalid recipe variable name to raw content (#3420) center goose mobile screenshot (#3418) docs: model context limit overrides (#3377) docs: Subagents (#3402) fix: avoid pass encoded empty string to goose run --recipe (#3361) ux: alphabetize extensions (#3416) fix: message concatenation in server session management (#3412) refactor: streamline memory directory management (#3345) ...

…hen sequential_when_repeated is true

* main: fix: Fixes structured output after streaming change to agent (#3448)

jamadeo

It still seems like we're going down a pretty complex road here, moving away from how models would typically call tools, but I think that conclusion should really come from experimentation. As long as we're still calling this experimental let's get it merged and keep trying things.

jamadeo · 2025-07-16T16:08:18Z

crates/goose-cli/src/recipes/extract_from_cli.rs

                        path: recipe_file_path.to_string_lossy().to_string(),
                        name,
                        values: None,
+                        sequential_when_repeated: true,


This property name is a little confusing IMO, maybe use something like allow_concurrent?

Hi @jamadeo

Thank you for the review and your time!

I thought of allow_concurrent, and i feel that it might have 2 concerns:

allow_concurrent: false it might confuse people that the sub-recipe cannot be running concurrently with other sub-recipes. The ...when_repeated makes it clear that it cannot be run parallel when running repeatedly.

By default the sub-recipe can be run in parallel. Usually if the boolean attribute is not specified, the default value is false. If allow_concurrent is specified, people might be think the allow_concurrent: false

Naming is usually hard. Happy to change to a better name.

* main: feat(gcpvertexai): do HTTP 429 like retries for Anthropic API HTTP 529 overloaded status code (#3026) Fix a few ui edge cases - refresh occasionally crashing, chat loader over text and chat input height returning to auto (#3469) Don't default to main for build-cli (#3467) docs: add MongoDB MCP server tutorial (#2660) feat: run sub recipe multiple times in parallel (Experimental feature) (#3274) chore(release): release version 1.1.0 (#3465) chore: implement streaming for anthropic.rs firstparty provider (#3419) Fix regression: add back detail to tool-call banners (#3231) Document release process and update some just recipes (#3460) feat: add download_cli.ps1 file for windows (#3354) fix: session_file is optional (#3462) Bump more space for goose is working on it so it doesnt overlap incoming agent chat messages (#3453) Align chat input action buttons to bottom when large amount of text (#3455) docs: add Cloudflare MCP Server tutorial (#3278) feat(cli): Clear persisted session file with /clear command (#3145)

block#3274) Co-authored-by: Wendy Tang <[email protected]> Signed-off-by: Soroosh <[email protected]>

block#3274) Co-authored-by: Wendy Tang <[email protected]> Signed-off-by: Kyle Santiago <[email protected]>

block#3274) Co-authored-by: Wendy Tang <[email protected]>

block#3274) Co-authored-by: Wendy Tang <[email protected]> Signed-off-by: Adam Tarantino <[email protected]>

lifeizhou-ap added 2 commits July 7, 2025 13:10

initial version of run sub recipe multiple times

75d3cd1

added tests, rename functions

cb0e6a5

lifeizhou-ap force-pushed the lifei/run-sub-recipe-multiple-times branch from 0a50a86 to cb0e6a5 Compare July 7, 2025 08:43

lifeizhou-ap added 20 commits July 8, 2025 14:13

get sub recipe json output

476651f

added showing parallel running dashboard

b50d245

better view

9ca6b25

clean up dashboard.rs

7446692

fixed tests

b848409

fixed scenario that only run params are specified

a02c056

display param values in the task list

f7832da

use stream to send the output to session

e601ca2

refactored the files

4d11f37

better console ui and run sub recipe in no-session mode

d8fdedd

added timeout in task execution

f1c38f9

cleaned up some utils and added the error tool response

54d5224

return task output if task times out

47e769e

changed console output a bit

85d5286

test fmt

2ccfb68

refactored the code

4453998

refactored parsing recipe

c14da81

created taskExecutionNotificationEvent

a01d712

added tests

41c46a6

lifeizhou-ap marked this pull request as ready for review July 9, 2025 09:09

revert some unexpected deletion

2c0f24c

lifeizhou-ap requested review from DOsinga, cloud-on-prem, jamadeo, jsibbison-square and wendytang July 9, 2025 09:35

renamed types.rs to task_types.rs

4f52ab1

jamadeo reviewed Jul 15, 2025

View reviewed changes

feat: dynamic tasks (#3414)

0e01630

jsibbison-square mentioned this pull request Jul 16, 2025

Add subrecipe config #3445

Closed

lifeizhou-ap added 9 commits July 16, 2025 10:24

Revert "feat: dynamic tasks (#3414)"

adf378a

This reverts commit 0e01630.

remove runs and timeout_in_seconds

7dd8587

renamed variable

64c6240

used task ids instead task payload as input schema for executor tool

3c7238c

change parallel to sequential_when_repeated

3d8782b

make sure same sub recipe with different params to run sequentially w…

990e9ab

…hen sequential_when_repeated is true

Merge branch 'main' into lifei/run-sub-recipe-multiple-times

eb94b36

* main: fix: Fixes structured output after streaming change to agent (#3448)

apply character-boundary-safe slicing

fad41f8

lifeizhou-ap changed the title ~~feat: run sub recipe multiple times in parallel~~ feat: run sub recipe multiple times in parallel (Experimental feature) Jul 16, 2025

jamadeo approved these changes Jul 16, 2025

View reviewed changes

DOsinga assigned wendytang Jul 16, 2025

lifeizhou-ap merged commit e5a55db into main Jul 16, 2025
8 checks passed

lifeizhou-ap deleted the lifei/run-sub-recipe-multiple-times branch July 16, 2025 22:39

lifeizhou-ap changed the title ~~feat: run sub recipe multiple times in parallel (Experimental feature)~~ feat: run sub recipe multiple times in parallel (Experimental feature in CLI) Jul 17, 2025

s-soroosh pushed a commit to s-soroosh/goose that referenced this pull request Jul 18, 2025

feat: run sub recipe multiple times in parallel (Experimental feature) (

1cb617f

block#3274) Co-authored-by: Wendy Tang <[email protected]> Signed-off-by: Soroosh <[email protected]>

kwsantiago pushed a commit to kwsantiago/goose that referenced this pull request Jul 19, 2025

feat: run sub recipe multiple times in parallel (Experimental feature) (

e2ab5dd

block#3274) Co-authored-by: Wendy Tang <[email protected]> Signed-off-by: Kyle Santiago <[email protected]>

cbruyndoncx pushed a commit to cbruyndoncx/goose that referenced this pull request Jul 20, 2025

feat: run sub recipe multiple times in parallel (Experimental feature) (

b87cc42

block#3274) Co-authored-by: Wendy Tang <[email protected]>

atarantino pushed a commit to atarantino/goose that referenced this pull request Aug 5, 2025

feat: run sub recipe multiple times in parallel (Experimental feature) (

9b39ab1

block#3274) Co-authored-by: Wendy Tang <[email protected]> Signed-off-by: Adam Tarantino <[email protected]>

lifeizhou-ap mentioned this pull request Aug 13, 2025

feat: add task_timeout field support in sub-recipe payloads #3691

Closed

feat: run sub recipe multiple times in parallel (Experimental feature in CLI) #3274

feat: run sub recipe multiple times in parallel (Experimental feature in CLI) #3274

Uh oh!

Conversation

lifeizhou-ap commented Jul 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lifeizhou-ap commented Jul 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lifeizhou-ap commented Jul 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jsibbison-square commented Jul 11, 2025

Uh oh!

michaelneale commented Jul 11, 2025

Uh oh!

lifeizhou-ap commented Jul 13, 2025

Uh oh!

jamadeo left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jamadeo left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

lifeizhou-ap commented Jul 7, 2025 •

edited

Loading

lifeizhou-ap commented Jul 11, 2025 •

edited

Loading

lifeizhou-ap commented Jul 11, 2025 •

edited

Loading