chore: cover code mode with end to end provider tests #6183

michaelneale · 2025-12-19T00:25:44Z

Attempting to cover the new code mode with same tests for providers. Should work the same as stock developer extension

Copilot

Pull request overview

This PR extends the provider test script to support testing the new code execution mode alongside the existing developer extension mode. The test can now be run in two modes: normal mode (direct tool calls) and code execution mode (JS batching), allowing verification that code execution mode works equivalently to the stock developer extension across all providers.

Adds --code-exec flag to enable code execution mode testing
Configures mode-specific settings for builtins and success patterns
Parameterizes test assertions to work with both modes

Copilot · 2025-12-19T00:28:22Z

scripts/test_providers.sh

+  case $arg in
+    --code-exec)
+      CODE_EXEC_MODE=true
+      shift


The shift command has no effect here because the loop iterates over "$@" directly, not positional parameters. Remove this line as it serves no purpose in this context.

Suggested change

shift

Copilot · 2025-12-19T00:28:23Z

scripts/test_providers.sh

+if [ "$CODE_EXEC_MODE" = true ]; then
+  echo "Mode: code_execution (JS batching)"
+  BUILTINS="developer,code_execution"
+  SUCCESS_PATTERN="execute_code | code_execution|read_module | code_execution"


The regex pattern mixes literal pipes (with spaces) and regex OR operator (pipe without spaces), making it unclear. Consider using explicit grouping like "(execute_code | code_execution)|(read_module | code_execution)" to make the intent clearer, or escape the literal pipes if they should be matched literally.

Suggested change

SUCCESS_PATTERN="execute_code | code_execution|read_module | code_execution"

SUCCESS_PATTERN="(execute_code | code_execution)|(read_module | code_execution)"

Copilot

Pull request overview

Copilot reviewed 2 out of 2 changed files in this pull request and generated 1 comment.

Copilot · 2025-12-19T01:05:12Z

scripts/test_providers.sh

+  case $arg in
+    --code-exec)
+      CODE_EXEC_MODE=true
+      ;;


The shift command inside the loop will cause the loop to skip arguments. Since you're iterating over "$@" with "for arg in", the shift modifies the positional parameters but doesn't affect the loop iteration. Remove the shift command as it serves no purpose here and could cause unexpected behavior if additional arguments are added in the future.

Suggested change

;;

codefromthecrypt

ah so the main change here is we are still doing "list the files...", just that we are adding code execution to do it which forces the other extensions inside. Same test

alexhancock · 2025-12-19T13:29:38Z

Nice work @michaelneale !

cover code mode with end to end provider tests

7da64ae

michaelneale requested review from codefromthecrypt, Copilot and tlongwell-block December 19, 2025 00:25

michaelneale assigned alexhancock Dec 19, 2025

michaelneale requested a review from alexhancock December 19, 2025 00:26

Copilot started reviewing on behalf of michaelneale December 19, 2025 00:26 View session

Copilot AI reviewed Dec 19, 2025

View reviewed changes

michaelneale added 2 commits December 19, 2025 11:59

run code mode as well

b52e948

tidy up parallel

4639b35

Copilot AI review requested due to automatic review settings December 19, 2025 01:02

Copilot started reviewing on behalf of michaelneale December 19, 2025 01:03 View session

pattern fixes

d5aac03

Copilot AI reviewed Dec 19, 2025

View reviewed changes

codefromthecrypt approved these changes Dec 19, 2025

View reviewed changes

codefromthecrypt merged commit d481404 into main Dec 19, 2025
20 checks passed

codefromthecrypt deleted the micn/test-code-mode branch December 19, 2025 04:02

github-actions bot mentioned this pull request Dec 19, 2025

chore(release): release version 1.18.0 (minor) #6194

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

chore: cover code mode with end to end provider tests #6183

chore: cover code mode with end to end provider tests #6183

Uh oh!

michaelneale commented Dec 19, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Dec 19, 2025

Uh oh!

Copilot AI Dec 19, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Dec 19, 2025

Uh oh!

codefromthecrypt left a comment

Uh oh!

Uh oh!

alexhancock commented Dec 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

	SUCCESS_PATTERN="execute_code \| code_execution\|read_module \| code_execution"
	SUCCESS_PATTERN="(execute_code \| code_execution)\|(read_module \| code_execution)"

chore: cover code mode with end to end provider tests #6183

chore: cover code mode with end to end provider tests #6183

Uh oh!

Conversation

michaelneale commented Dec 19, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Dec 19, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 19, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Dec 19, 2025

Choose a reason for hiding this comment

Uh oh!

codefromthecrypt left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

alexhancock commented Dec 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants