Refactor Bifrost for plugin support and multi-provider architecture along with tests by danpiths · Pull Request #1 · maximhq/bifrost

danpiths · 2025-03-21T04:20:40Z

No description provided.

danpiths · 2025-03-21T04:20:59Z

Refactor Bifrost for plugin support and multi-provider architecture along with tests #1 : 2 dependent PRs (#2 , #9 ) 👈 (View in Graphite)
main

This stack of pull requests is managed by Graphite. Learn more about stacking.

danpiths

why did we remove omitempty for all optional fields @Pratham-Mishra04 ?

danpiths · 2025-03-22T09:22:44Z

also can we add proper descriptions with each PR that lists the changes made and why did we do it

CodeRabbit round 2: 10. /models query-param fallback now checks for existing client_version before injecting. Preserves caller-supplied params like foo=bar and picks up caller's own client_version when present. Uses simple string split (no net/url allocation) since paths are well-formed. 11. chatGPTOAuthWebSocketHeaders now routes through mergeHeadersCaseInsensitive so caller-supplied headers with different casing (OPENAI-BETA, Openai-Beta, etc.) can't cause duplicate-case headers reaching the wire — OAuth values and the Authorization bearer always win deterministically. Greptile findings: P1 maximhq#1 Non-streaming Responses() path: return a clear BifrostOperationError pointing callers to ResponsesStream when chatgpt_oauth is on. The ChatGPT backend only accepts stream=true (matching the openai-oauth reference proxy behavior), so non-streaming cannot work via Bifrost without server-side SSE aggregation which is out of scope for this PR. In practice Codex always streams. P1 maximhq#2 /models response format: OpenAIListModelsResponse now has a UnmarshalJSON that accepts BOTH the standard OpenAI shape ({"data":[{id,...}]}) and the ChatGPT backend shape ({"models":[{slug}]}). ChatGPT entries are projected into OpenAIModel with ID=slug, Object="model", OwnedBy="chatgpt-oauth". P2 maximhq#3 client_version doc comment: corrected — the fallback only applies when the inbound path reaches chatGPTOAuthPath without a caller-supplied client_version (previously the doc implied forwarding logic that didn't exist; it now does). P2 maximhq#4 store doc comment: corrected from "sets store to false if not already present" to "forces store to false" to match the code and explanation. P2 maximhq#5 Double JWT decode per Responses/ResponsesStream: fixed by passing raw networkConfig.ExtraHeaders to chatGPTOAuthApplyRequest instead of the already-merged effectiveExtraHeaders(key). The merge now happens exactly once per call. Test coverage: 100% on every function in chatgpt_oauth.go. Added tests for query-param preservation, models dual-shape parsing, and the streaming-only sentinel error. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…output tool Bedrock Converse rejects toolConfig.toolChoice.tool on Meta Llama variants with HTTP 400 ("This model doesn't support the toolConfig.toolChoice.tool field. Remove toolConfig.toolChoice.tool and try again."). The synthetic bf_so_* tool path used for structured output unconditionally pinned the synthetic tool via toolChoice.tool, which broke every with_structured_output caller against Llama 4 Maverick / Scout / Llama 3.x on Bedrock. This patch adds an IsLlamaModel helper next to the existing IsNovaModel / IsAnthropicModel family and gates the forced-tool emission off it in four places (covering both API surfaces and both the synthetic and explicit tool_choice paths on each): 1. core/providers/bedrock/utils.go: ChatCompletions synthetic-tool injection (response_format=json_schema -> bf_so_* tool path). 2. core/providers/bedrock/responses.go: OpenAI Responses API mirror of maximhq#1 (text.format=json_schema -> bf_so_* tool path). 3. core/providers/bedrock/utils.go: convertToolConfigFromFiltered's explicit tool_choice path (defense-in-depth for ChatCompletions callers using bind_tools(tool_choice={"type": "function", "function": {"name": "X"}}) directly). 4. core/providers/bedrock/responses.go: Responses API mirror of maximhq#3 — the explicit tool_choice path runs convertResponsesToolChoice and writes BedrockToolChoice{Tool: ...} straight onto the request, which trips the same Llama 400 without the gate. With one synthetic tool bound and Bedrock's default "auto" behavior, omitting tool_choice yields the same outcome on Llama because there's exactly one tool the model can call. This mirrors the gate langchain-aws ships (supports_tool_choice_values for llama4 / llama3-1 / llama3-3 sets only "auto", causing with_structured_output to emit no tool_choice at all). Tests added (mirroring PR maximhq#3184's TestBedrockAnthropicChatStructuredOutputUsesSyntheticTool naming and structure): - TestBedrockLlamaChatStructuredOutputOmitsForcedToolChoice — synthetic tool present, ToolChoice nil for Llama on the ChatCompletions path. - TestBedrockNonLlamaChatStructuredOutputForcesToolChoice — regression guard: Nova / Anthropic still get the forced tool_choice (gate doesn't over-fire). - TestToBedrockResponsesRequest_LlamaStructuredOutputOmitsForcedToolChoice — same negative+positive on the Responses API synthetic-tool path. - TestBedrockLlamaConvertToolConfigOmitsForcedToolChoice — defense-in-depth coverage for ChatCompletions explicit tool_choice struct callers. - TestToBedrockResponsesRequest_LlamaConvertResponsesToolChoiceOmitsForcedToolChoice — defense-in-depth coverage for Responses API explicit tool_choice (function-typed) struct callers — the gap surfaced in code review. - TestToBedrockResponsesRequest_NonLlamaConvertResponsesToolChoiceForcesToolChoice — Anthropic regression guard for maximhq#5 so the Responses-side Llama gate doesn't over-fire. Refs: - https://docs.aws.amazon.com/bedrock/latest/APIReference/API_runtime_ToolChoice.html - https://github.com/langchain-ai/langchain-aws (bedrock_converse.py supports_tool_choice_values family-detection lines 678-711). - PR maximhq#3184 (Anthropic structured-output fallback to synthetic tool path) — immediate predecessor; this PR completes the per-family gating story.

…output tool (#3196) Bedrock Converse rejects toolConfig.toolChoice.tool on Meta Llama variants with HTTP 400 ("This model doesn't support the toolConfig.toolChoice.tool field. Remove toolConfig.toolChoice.tool and try again."). The synthetic bf_so_* tool path used for structured output unconditionally pinned the synthetic tool via toolChoice.tool, which broke every with_structured_output caller against Llama 4 Maverick / Scout / Llama 3.x on Bedrock. This patch adds an IsLlamaModel helper next to the existing IsNovaModel / IsAnthropicModel family and gates the forced-tool emission off it in four places (covering both API surfaces and both the synthetic and explicit tool_choice paths on each): 1. core/providers/bedrock/utils.go: ChatCompletions synthetic-tool injection (response_format=json_schema -> bf_so_* tool path). 2. core/providers/bedrock/responses.go: OpenAI Responses API mirror of #1 (text.format=json_schema -> bf_so_* tool path). 3. core/providers/bedrock/utils.go: convertToolConfigFromFiltered's explicit tool_choice path (defense-in-depth for ChatCompletions callers using bind_tools(tool_choice={"type": "function", "function": {"name": "X"}}) directly). 4. core/providers/bedrock/responses.go: Responses API mirror of #3 — the explicit tool_choice path runs convertResponsesToolChoice and writes BedrockToolChoice{Tool: ...} straight onto the request, which trips the same Llama 400 without the gate. With one synthetic tool bound and Bedrock's default "auto" behavior, omitting tool_choice yields the same outcome on Llama because there's exactly one tool the model can call. This mirrors the gate langchain-aws ships (supports_tool_choice_values for llama4 / llama3-1 / llama3-3 sets only "auto", causing with_structured_output to emit no tool_choice at all). Tests added (mirroring PR #3184's TestBedrockAnthropicChatStructuredOutputUsesSyntheticTool naming and structure): - TestBedrockLlamaChatStructuredOutputOmitsForcedToolChoice — synthetic tool present, ToolChoice nil for Llama on the ChatCompletions path. - TestBedrockNonLlamaChatStructuredOutputForcesToolChoice — regression guard: Nova / Anthropic still get the forced tool_choice (gate doesn't over-fire). - TestToBedrockResponsesRequest_LlamaStructuredOutputOmitsForcedToolChoice — same negative+positive on the Responses API synthetic-tool path. - TestBedrockLlamaConvertToolConfigOmitsForcedToolChoice — defense-in-depth coverage for ChatCompletions explicit tool_choice struct callers. - TestToBedrockResponsesRequest_LlamaConvertResponsesToolChoiceOmitsForcedToolChoice — defense-in-depth coverage for Responses API explicit tool_choice (function-typed) struct callers — the gap surfaced in code review. - TestToBedrockResponsesRequest_NonLlamaConvertResponsesToolChoiceForcesToolChoice — Anthropic regression guard for #5 so the Responses-side Llama gate doesn't over-fire. Refs: - https://docs.aws.amazon.com/bedrock/latest/APIReference/API_runtime_ToolChoice.html - https://github.com/langchain-ai/langchain-aws (bedrock_converse.py supports_tool_choice_values family-detection lines 678-711). - PR #3184 (Anthropic structured-output fallback to synthetic tool path) — immediate predecessor; this PR completes the per-family gating story.

danpiths requested a review from akshaydeo March 21, 2025 04:20

Pratham-Mishra04 force-pushed the feature/03-21-base_structure_fixes branch from 50ad314 to 40ce794 Compare March 21, 2025 06:28

This was referenced Mar 21, 2025

feat: plugin interface and tests added #2

Closed

feat: plugin context added #3

Closed

danpiths commented Mar 22, 2025

View reviewed changes

Comment thread interfaces/provider.go Outdated

danpiths commented Mar 22, 2025

View reviewed changes

Comment thread interfaces/provider.go Outdated

danpiths commented Mar 22, 2025

View reviewed changes

Comment thread providers/anthropic.go Outdated

Pratham-Mishra04 mentioned this pull request Mar 22, 2025

fix: bifrost context fixes #4

Closed

This was referenced Mar 31, 2025

feat: configs added for providers #5

Closed

feat: bedrock provider added #6

Closed

feat: cohere provider added #7

Closed

feat: provider and interface fixes #8

Closed

Pratham-Mishra04 force-pushed the feature/03-21-base_structure_fixes branch from 40ce794 to a54ac5d Compare March 31, 2025 07:57

Pratham-Mishra04 changed the title ~~base structure fixes~~ Refactor Bifrost for plugin support and multi-provider architecture along with tests Mar 31, 2025

Pratham-Mishra04 mentioned this pull request Mar 31, 2025

feat: image inputs added #9

Closed