mcp/developer: Refactor to use tokio SplitStream #3894

cgwalters · 2025-08-06T17:58:13Z

I was working on a previous PR with a lot larger output set sizes and I think I was seeing the last chunk of output sometimes being lost.

That said I asked Gemini to try to reproduce the failure with a case and I couldn't get it to fail offhand. In looking at the original code here it looks OK...

Async rust is hard; tokio::select! in particular can cause subtle bugs. For a lot more on this see
https://blog.yoshuawuyts.com/futures-concurrency-3 and various previous and future blog entries.

Anyways this code is I think cleaner, deduplicating the code to send the notification message.

michaelneale · 2025-08-07T04:18:07Z

@cgwalters very interesting - I know from past experience stderro/out for shell and commands can be tricky when on different platform and when running bundled in the electron app (signed) for macos - are you able to verify it works well in that scenario as well? (lost a day or 2 to that once, different ways to spawn). How can we validate that?

edit: seems fine to me - doesn't change how the streams are started.

michaelneale · 2025-08-07T04:19:55Z

I do like this and does look better, I am sure there are latent bugs in the old code here so I think this one is worth a real close look, nice one @cgwalters - how has it been performing for you?

michaelneale · 2025-08-07T07:23:52Z

this seems to work really well, hard to know what to look for but so far so good, and much much better than old code (and I think the dependency was already pulled in from another crate so no extra cost really). This could wipe out a few tricky bugs - and yes I think the old way could lose the end of a stream if there is no newline and it ends. @DOsinga I think this may be worth getting in ASAP

very nice @cgwalters

edit, if goose is right about this, very big:

1. Deadlock Scenario with `tokio::select!`

The old code could hang if one stream closes while the other still has data:

// Old code
loop {
    tokio::select! {
        n = stdout_reader.read_until(b'\n', &mut stdout_buf), if !stdout_done => {
            if n? == 0 {
                stdout_done = true;
            }
            // ... process stdout
        }
        
        n = stderr_reader.read_until(b'\n', &mut stderr_buf), if !stderr_done => {
            if n? == 0 {
                stderr_done = true;
            }
            // ... process stderr
        }
        
        else => break,
    }
}

I was working on a PR with a lot larger output set sizes and I *think* I was seeing the last chunk of output sometimes being lost. That said I asked Gemini to try to reproduce the failure with a case and I couldn't get it to fail offhand. In looking at the original code here it looks OK... Async rust is hard; `tokio::select!` in particular can cause subtle bugs. For a lot more on this see https://blog.yoshuawuyts.com/futures-concurrency-3 and various previous and future blog entries. Anyways this code is I think cleaner, deduplicating the code to send the notification message. Signed-off-by: Colin Walters <[email protected]>

cgwalters · 2025-08-07T13:32:14Z

and yes I think the old way could lose the end of a stream if there is no newline and it ends.

You're right! I asked AI to generate a test case based on that and indeed it fails with the old code and passes with the new.

michaelneale

yeah have given this a good run and seems solid.

* 'main' of github.com:block/goose: remove fallback routing to hub/home for unknown routes (#3954) Use cross in linux bundle workflow (#3950) fix: disable signing for release branches until we figure out keys for this flow (#3951) Sanitize Tags Unicode Block (#3920) Add a message about DCO to CONTRIBUTING.md (#3741) Move hardcoded LLM prompts to template files (#3934) docs: migrate streamable config to consolidated component (#3936) feat: streamline list args on cli (#3937) mcp/developer: Refactor to use tokio SplitStream (#3894) feat: first time automated ollama install experience and openrouter (#3881) chore: rmcp 0.5.0 (#3935) add gpt-5 to openai provider format (#3924) added gpt5 context limit (#3927) show status of osx codesigning and increase timeout (#3926) Bump auto-compact threshold to 80% (#3925) FIX: gemini tool call hanging (#3898) feat(deps): upgrade rmcp to 0.4.1 (#3918) Fix dark mode rendering of config form and centered providers grid for wider screens. (#3837) fix: extension list not refreshing after installing from deeplink (#3878)

* 'main' of github.com:block/goose: remove fallback routing to hub/home for unknown routes (#3954) Use cross in linux bundle workflow (#3950) fix: disable signing for release branches until we figure out keys for this flow (#3951) Sanitize Tags Unicode Block (#3920) Add a message about DCO to CONTRIBUTING.md (#3741) Move hardcoded LLM prompts to template files (#3934) docs: migrate streamable config to consolidated component (#3936) feat: streamline list args on cli (#3937) mcp/developer: Refactor to use tokio SplitStream (#3894) feat: first time automated ollama install experience and openrouter (#3881) chore: rmcp 0.5.0 (#3935) add gpt-5 to openai provider format (#3924) added gpt5 context limit (#3927) show status of osx codesigning and increase timeout (#3926) Bump auto-compact threshold to 80% (#3925)

Signed-off-by: Colin Walters <[email protected]> Signed-off-by: Jack Wright <[email protected]>

michaelneale added p2 Priority 2 - Medium performance Performance related labels Aug 7, 2025

michaelneale self-assigned this Aug 7, 2025

michaelneale requested a review from DOsinga August 7, 2025 04:18

michaelneale added p0 Priority 0 - Critical/Urgent and removed p2 Priority 2 - Medium labels Aug 7, 2025

cgwalters force-pushed the developer-shell-tokio-cleanup branch from ee7b82c to 032f920 Compare August 7, 2025 13:31

alexhancock approved these changes Aug 8, 2025

View reviewed changes

michaelneale approved these changes Aug 8, 2025

View reviewed changes

michaelneale merged commit 719b569 into block:main Aug 8, 2025
10 checks passed

alexhancock mentioned this pull request Aug 13, 2025

chore(release): release version 1.4.0 #4069

Merged

ayax79 pushed a commit to ayax79/goose that referenced this pull request Aug 21, 2025

mcp/developer: Refactor to use tokio SplitStream (block#3894)

1fb5326

Signed-off-by: Colin Walters <[email protected]> Signed-off-by: Jack Wright <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

mcp/developer: Refactor to use tokio SplitStream #3894

mcp/developer: Refactor to use tokio SplitStream #3894

Uh oh!

cgwalters commented Aug 6, 2025 •

edited

Loading

Uh oh!

michaelneale commented Aug 7, 2025 •

edited

Loading

Uh oh!

michaelneale commented Aug 7, 2025

Uh oh!

michaelneale commented Aug 7, 2025 •

edited

Loading

Uh oh!

cgwalters commented Aug 7, 2025

Uh oh!

michaelneale left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

mcp/developer: Refactor to use tokio SplitStream #3894

mcp/developer: Refactor to use tokio SplitStream #3894

Uh oh!

Conversation

cgwalters commented Aug 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

michaelneale commented Aug 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

michaelneale commented Aug 7, 2025

Uh oh!

michaelneale commented Aug 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

1. Deadlock Scenario with tokio::select!

Uh oh!

cgwalters commented Aug 7, 2025

Uh oh!

michaelneale left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

cgwalters commented Aug 6, 2025 •

edited

Loading

michaelneale commented Aug 7, 2025 •

edited

Loading

michaelneale commented Aug 7, 2025 •

edited

Loading

1. Deadlock Scenario with `tokio::select!`