Managed OpenAI Inference + Native Web Search#26251
Conversation
…ve web_search tool (#26212)
…26218) * [macOS] Allow managed inference provider selection beyond Anthropic * fix: capture draftProvider before async Task to prevent race condition Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
…der supports it (#26230) * [macOS] Allow provider-native web search when managed inference provider supports it * fix: gate mode-specific web search invalidation on modeChanging to prevent false-positive alerts Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: generalize alert message to cover both mode and provider changes * fix: scope provider-native invalidation to your-own web search mode --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Review (Worktree + Targeted Validation)Pulled this PR into a dedicated worktree and reviewed all touched files end-to-end (backend + macOS settings + tests). Findings1) [P1] Persisting unpaired
|
…ry corruption The OpenAI Responses provider emitted server_tool_use content blocks for web_search_call items but did not emit matching web_search_tool_result blocks. repairHistory() treats any unpaired server_tool_use as an interrupted search and injects a synthetic web_search_tool_result_error, which corrupts conversation history by making successful searches appear as failures. After each server_tool_use block, also emit a paired web_search_tool_result with empty content (since OpenAI weaves search results into the text output). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
| managed: true, | ||
| proxyPath: "/v1/runtime-proxy/openai", |
There was a problem hiding this comment.
🚩 OpenAI managed proxy requires companion platform-side route support
The new proxy path /v1/runtime-proxy/openai at assistant/src/providers/managed-proxy/constants.ts:32 requires the companion vellum-assistant-platform repo to have a corresponding proxy route that forwards OpenAI Responses API requests. The AGENTS.md mentions checking the sibling repo for compatibility when making HTTP API or container changes. The OpenAI SDK will construct URLs like {baseURL}/responses for the Responses API stream endpoint, so the platform proxy needs to handle this path correctly.
Was this helpful? React with 👍 or 👎 to provide feedback.
… llm.default API The tests were added in #26251 against the old `setInferenceProvider` / `services.inference.provider` API. #26159 merged afterward renamed that API to `setLLMDefaultProvider` and moved the config path to `llm.default.provider`, leaving the tests unable to compile. Rename the calls and update the patch assertions to match the new shape.
… llm.default API (#26287) The tests were added in #26251 against the old `setInferenceProvider` / `services.inference.provider` API. #26159 merged afterward renamed that API to `setLLMDefaultProvider` and moved the config path to `llm.default.provider`, leaving the tests unable to compile. Rename the calls and update the patch assertions to match the new shape.
Summary
Enable OpenAI as a first-class managed inference provider and route
inference-provider-nativeweb search through OpenAI's native Responses web search tool. Remove Anthropic-only assumptions in macOS settings so users can choose OpenAI in managed mode.Self-review result
PASS after 3 rounds of review+remediation (4 fix PRs addressing integration gaps)
PRs merged into feature branch
Fix PRs
Part of plan: managed-openai-native-web-search.md