feat: plugin context added by Pratham-Mishra04 · Pull Request #3 · maximhq/bifrost

Pratham-Mishra04 · 2025-03-21T19:25:36Z

No description provided.

Pratham-Mishra04 · 2025-03-21T19:26:02Z

Warning

This pull request is not mergeable via GitHub because a downstack PR is open. Once all requirements are satisfied, merge this PR as a stack on Graphite.
Learn more

feat: provider and interface fixes #8
feat: cohere provider added #7
feat: bedrock provider added #6
feat: configs added for providers #5
fix: bifrost context fixes #4
feat: plugin context added #3 👈 (View in Graphite)
feat: plugin interface and tests added #2
Refactor Bifrost for plugin support and multi-provider architecture along with tests #1 : 1 other dependent PR (#9 )
main

This stack of pull requests is managed by Graphite. Learn more about stacking.

danpiths · 2025-03-22T09:28:09Z

 type Plugin interface {
-	PreHook(req *BifrostRequest) (*BifrostRequest, error)
-	PostHook(result *CompletionResult) (*CompletionResult, error)
+	PreHook(ctx *context.Context, req *BifrostRequest) (*BifrostRequest, error)


let us pass in the context object and not a pointer to the context for both the pre and post hook

danpiths · 2025-03-22T09:29:42Z

-
-		req, err = plugin.PreHook(req)
+	// Create a context with timeout same as the provider/request config
+	ctx, cancel := context.WithTimeout(context.Background(), 30*time.Second)


we will not create our own context, we shall ask users to pass in the context from the chat/text completion functions. keep it as the first param there. they would pass in their context to the completion functions that we shall use with in the plugin system

danpiths

the comments

CodeRabbit round 2: 10. /models query-param fallback now checks for existing client_version before injecting. Preserves caller-supplied params like foo=bar and picks up caller's own client_version when present. Uses simple string split (no net/url allocation) since paths are well-formed. 11. chatGPTOAuthWebSocketHeaders now routes through mergeHeadersCaseInsensitive so caller-supplied headers with different casing (OPENAI-BETA, Openai-Beta, etc.) can't cause duplicate-case headers reaching the wire — OAuth values and the Authorization bearer always win deterministically. Greptile findings: P1 maximhq#1 Non-streaming Responses() path: return a clear BifrostOperationError pointing callers to ResponsesStream when chatgpt_oauth is on. The ChatGPT backend only accepts stream=true (matching the openai-oauth reference proxy behavior), so non-streaming cannot work via Bifrost without server-side SSE aggregation which is out of scope for this PR. In practice Codex always streams. P1 maximhq#2 /models response format: OpenAIListModelsResponse now has a UnmarshalJSON that accepts BOTH the standard OpenAI shape ({"data":[{id,...}]}) and the ChatGPT backend shape ({"models":[{slug}]}). ChatGPT entries are projected into OpenAIModel with ID=slug, Object="model", OwnedBy="chatgpt-oauth". P2 maximhq#3 client_version doc comment: corrected — the fallback only applies when the inbound path reaches chatGPTOAuthPath without a caller-supplied client_version (previously the doc implied forwarding logic that didn't exist; it now does). P2 maximhq#4 store doc comment: corrected from "sets store to false if not already present" to "forces store to false" to match the code and explanation. P2 maximhq#5 Double JWT decode per Responses/ResponsesStream: fixed by passing raw networkConfig.ExtraHeaders to chatGPTOAuthApplyRequest instead of the already-merged effectiveExtraHeaders(key). The merge now happens exactly once per call. Test coverage: 100% on every function in chatgpt_oauth.go. Added tests for query-param preservation, models dual-shape parsing, and the streaming-only sentinel error. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

/speckit-clarify session recording 4 integrated decisions. All mechanical consequences applied inline. Q1 — US14 Prompt Library: **Split.** Basic prompt library (templates, variables, folders, versioning) is IN SCOPE, already satisfied by the re-enabled upstream `plugins/prompts/` in Phase 1 T012. Deployment-strategies subset (A/B, canary, production pointer) is DESCOPED — new extension of the prompts plugin, needs its own spec. SR-01 table gets a US14-deployments row in the out-of-scope column. FR-029/030/031 marked ALREADY_SHIPPED; FR-029 note covers the deployment-strategy split. US14 story header and CI whitelist already align. Q2 — Audit actor attribution in v1: **Synthetic "upstream-admin".** No session→user middleware yet. All admin actions reach handlers via the single upstream basic-auth credential, so audit entries carry `actor_type="system"`, `actor_id="upstream-admin"` — self-documenting, matches reality. `handlers/rbac_audit.go` updated; FR-010 annotated. When proper middleware lands in a future spec, `emitAudit` callers need no change. Verified live on :8088 — latest role.delete entry shows the new attribution. Q3 — v1 RBAC server-side enforcement: **UI-only.** `useRbac` hides UI controls; no server-side scope check today. Adding a server enforcement layer is net-new with no upstream precedent (same SR-01 class as admin API keys) and belongs in its own spec. FR-003 and FR-004 annotated; FR-004 resource list corrected from "29" to "24" (matching `framework/tenancy/scopes.go`). Limitation already documented in `docs/enterprise/rbac.mdx` §"Notes & limits". Q4 — Audit store location: **Configstore** (not logstore). Rows are semantically closer to `ent_roles`/`ent_users`; single SQLite file simpler for self-hosted deployments. Plugin doc comment in `plugins/audit/main.go` updated to match shipped reality; `docs/enterprise/audit-logs.mdx` already said "configuration database" (no change). FR-012 annotated, including the explicit note that automatic retention cleanup is out of v1 scope — rows accumulate until cleanup SQL is run or a retention task ships. Session used 4 of 5 questions; remaining potential items are stylistic and don't change architecture or test design. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

/speckit-clarify session recording 4 integrated decisions. All mechanical consequences applied inline. Q1 — US14 Prompt Library: **Split.** Basic prompt library (templates, variables, folders, versioning) is IN SCOPE, already satisfied by the re-enabled upstream `plugins/prompts/` in Phase 1 T012. Deployment-strategies subset (A/B, canary, production pointer) is DESCOPED — new extension of the prompts plugin, needs its own spec. SR-01 table gets a US14-deployments row in the out-of-scope column. FR-029/030/031 marked ALREADY_SHIPPED; FR-029 note covers the deployment-strategy split. US14 story header and CI whitelist already align. Q2 — Audit actor attribution in v1: **Synthetic "upstream-admin".** No session→user middleware yet. All admin actions reach handlers via the single upstream basic-auth credential, so audit entries carry `actor_type="system"`, `actor_id="upstream-admin"` — self-documenting, matches reality. `handlers/rbac_audit.go` updated; FR-010 annotated. When proper middleware lands in a future spec, `emitAudit` callers need no change. Verified live on :8088 — latest role.delete entry shows the new attribution. Q3 — v1 RBAC server-side enforcement: **UI-only.** `useRbac` hides UI controls; no server-side scope check today. Adding a server enforcement layer is net-new with no upstream precedent (same SR-01 class as admin API keys) and belongs in its own spec. FR-003 and FR-004 annotated; FR-004 resource list corrected from "29" to "24" (matching `framework/tenancy/scopes.go`). Limitation already documented in `docs/enterprise/rbac.mdx` §"Notes & limits". Q4 — Audit store location: **Configstore** (not logstore). Rows are semantically closer to `ent_roles`/`ent_users`; single SQLite file simpler for self-hosted deployments. Plugin doc comment in `plugins/audit/main.go` updated to match shipped reality; `docs/enterprise/audit-logs.mdx` already said "configuration database" (no change). FR-012 annotated, including the explicit note that automatic retention cleanup is out of v1 scope — rows accumulate until cleanup SQL is run or a retention task ships. Session used 4 of 5 questions; remaining potential items are stylistic and don't change architecture or test design.

CodeRabbit round 2: 10. /models query-param fallback now checks for existing client_version before injecting. Preserves caller-supplied params like foo=bar and picks up caller's own client_version when present. Uses simple string split (no net/url allocation) since paths are well-formed. 11. chatGPTOAuthWebSocketHeaders now routes through mergeHeadersCaseInsensitive so caller-supplied headers with different casing (OPENAI-BETA, Openai-Beta, etc.) can't cause duplicate-case headers reaching the wire — OAuth values and the Authorization bearer always win deterministically. Greptile findings: P1 maximhq#1 Non-streaming Responses() path: return a clear BifrostOperationError pointing callers to ResponsesStream when chatgpt_oauth is on. The ChatGPT backend only accepts stream=true (matching the openai-oauth reference proxy behavior), so non-streaming cannot work via Bifrost without server-side SSE aggregation which is out of scope for this PR. In practice Codex always streams. P1 maximhq#2 /models response format: OpenAIListModelsResponse now has a UnmarshalJSON that accepts BOTH the standard OpenAI shape ({"data":[{id,...}]}) and the ChatGPT backend shape ({"models":[{slug}]}). ChatGPT entries are projected into OpenAIModel with ID=slug, Object="model", OwnedBy="chatgpt-oauth". P2 maximhq#3 client_version doc comment: corrected — the fallback only applies when the inbound path reaches chatGPTOAuthPath without a caller-supplied client_version (previously the doc implied forwarding logic that didn't exist; it now does). P2 maximhq#4 store doc comment: corrected from "sets store to false if not already present" to "forces store to false" to match the code and explanation. P2 maximhq#5 Double JWT decode per Responses/ResponsesStream: fixed by passing raw networkConfig.ExtraHeaders to chatGPTOAuthApplyRequest instead of the already-merged effectiveExtraHeaders(key). The merge now happens exactly once per call. Test coverage: 100% on every function in chatgpt_oauth.go. Added tests for query-param preservation, models dual-shape parsing, and the streaming-only sentinel error. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

… rules instead of halting (#3082) ## Summary Fixes a bug where a routing rule chain would incorrectly halt when a `chain_rule=true` rule resolved to the same provider/model as the current context (a self-loop). Previously, this was treated as a cycle and stopped chain evaluation immediately. Now, each rule is tracked by its ID rather than by the resulting provider/model state, allowing a self-looping rule to fire once and then be skipped so subsequent rules in the chain can continue to evaluate. ## Changes - Replaced provider/model state-based cycle detection (`visited` map keyed on `provider|model`) with rule ID-based tracking (`visitedRuleIDs` map keyed on `rule.ID`). Each rule may now fire at most once per chain evaluation, regardless of whether its target matches the current state. - A self-looping chain rule (one that resolves to the same provider/model it was invoked with) now fires once and is then skipped, allowing lower-priority rules to match on subsequent chain steps. - Updated termination condition #3 in the `EvaluateRoutingRules` documentation to reflect that the chain stops when all candidate chain-rules have already fired, rather than when a previously visited provider/model state is encountered. - Renamed `TestEvaluateRoutingRules_ConvergenceStopsChain` to `TestEvaluateRoutingRules_SelfLoopContinuesToNextRule` and updated it to assert that a self-looping rule correctly yields to the next matching rule. - Added `TestEvaluateRoutingRules_SelfLoopAloneTerminates` to verify that a self-looping chain rule with no other rules terminates cleanly via the "no remaining rule matches" condition. ## Type of change - [x] Bug fix - [ ] Feature - [ ] Refactor - [ ] Documentation - [ ] Chore/CI ## Affected areas - [ ] Core (Go) - [ ] Transports (HTTP) - [ ] Providers/Integrations - [x] Plugins - [ ] UI (React) - [ ] Docs ## How to test ```sh go test ./plugins/governance/... -run TestEvaluateRoutingRules_SelfLoop go test ./plugins/governance/... ``` Expected: `TestEvaluateRoutingRules_SelfLoopContinuesToNextRule` passes, asserting the chain resolves to `anthropic/claude-3` after the self-looping rule fires once. `TestEvaluateRoutingRules_SelfLoopAloneTerminates` passes, asserting the chain resolves to `openai/gpt-4o` and terminates cleanly. ## Breaking changes - [ ] Yes - [x] No ## Related issues ## Security considerations None. ## Checklist - [ ] I read `docs/contributing/README.md` and followed the guidelines - [x] I added/updated tests where appropriate - [ ] I updated documentation where needed - [x] I verified builds succeed (Go and UI) - [ ] I verified the CI pipeline passes locally if applicable

…output tool Bedrock Converse rejects toolConfig.toolChoice.tool on Meta Llama variants with HTTP 400 ("This model doesn't support the toolConfig.toolChoice.tool field. Remove toolConfig.toolChoice.tool and try again."). The synthetic bf_so_* tool path used for structured output unconditionally pinned the synthetic tool via toolChoice.tool, which broke every with_structured_output caller against Llama 4 Maverick / Scout / Llama 3.x on Bedrock. This patch adds an IsLlamaModel helper next to the existing IsNovaModel / IsAnthropicModel family and gates the forced-tool emission off it in four places (covering both API surfaces and both the synthetic and explicit tool_choice paths on each): 1. core/providers/bedrock/utils.go: ChatCompletions synthetic-tool injection (response_format=json_schema -> bf_so_* tool path). 2. core/providers/bedrock/responses.go: OpenAI Responses API mirror of maximhq#1 (text.format=json_schema -> bf_so_* tool path). 3. core/providers/bedrock/utils.go: convertToolConfigFromFiltered's explicit tool_choice path (defense-in-depth for ChatCompletions callers using bind_tools(tool_choice={"type": "function", "function": {"name": "X"}}) directly). 4. core/providers/bedrock/responses.go: Responses API mirror of maximhq#3 — the explicit tool_choice path runs convertResponsesToolChoice and writes BedrockToolChoice{Tool: ...} straight onto the request, which trips the same Llama 400 without the gate. With one synthetic tool bound and Bedrock's default "auto" behavior, omitting tool_choice yields the same outcome on Llama because there's exactly one tool the model can call. This mirrors the gate langchain-aws ships (supports_tool_choice_values for llama4 / llama3-1 / llama3-3 sets only "auto", causing with_structured_output to emit no tool_choice at all). Tests added (mirroring PR maximhq#3184's TestBedrockAnthropicChatStructuredOutputUsesSyntheticTool naming and structure): - TestBedrockLlamaChatStructuredOutputOmitsForcedToolChoice — synthetic tool present, ToolChoice nil for Llama on the ChatCompletions path. - TestBedrockNonLlamaChatStructuredOutputForcesToolChoice — regression guard: Nova / Anthropic still get the forced tool_choice (gate doesn't over-fire). - TestToBedrockResponsesRequest_LlamaStructuredOutputOmitsForcedToolChoice — same negative+positive on the Responses API synthetic-tool path. - TestBedrockLlamaConvertToolConfigOmitsForcedToolChoice — defense-in-depth coverage for ChatCompletions explicit tool_choice struct callers. - TestToBedrockResponsesRequest_LlamaConvertResponsesToolChoiceOmitsForcedToolChoice — defense-in-depth coverage for Responses API explicit tool_choice (function-typed) struct callers — the gap surfaced in code review. - TestToBedrockResponsesRequest_NonLlamaConvertResponsesToolChoiceForcesToolChoice — Anthropic regression guard for maximhq#5 so the Responses-side Llama gate doesn't over-fire. Refs: - https://docs.aws.amazon.com/bedrock/latest/APIReference/API_runtime_ToolChoice.html - https://github.com/langchain-ai/langchain-aws (bedrock_converse.py supports_tool_choice_values family-detection lines 678-711). - PR maximhq#3184 (Anthropic structured-output fallback to synthetic tool path) — immediate predecessor; this PR completes the per-family gating story.

…output tool (#3196) Bedrock Converse rejects toolConfig.toolChoice.tool on Meta Llama variants with HTTP 400 ("This model doesn't support the toolConfig.toolChoice.tool field. Remove toolConfig.toolChoice.tool and try again."). The synthetic bf_so_* tool path used for structured output unconditionally pinned the synthetic tool via toolChoice.tool, which broke every with_structured_output caller against Llama 4 Maverick / Scout / Llama 3.x on Bedrock. This patch adds an IsLlamaModel helper next to the existing IsNovaModel / IsAnthropicModel family and gates the forced-tool emission off it in four places (covering both API surfaces and both the synthetic and explicit tool_choice paths on each): 1. core/providers/bedrock/utils.go: ChatCompletions synthetic-tool injection (response_format=json_schema -> bf_so_* tool path). 2. core/providers/bedrock/responses.go: OpenAI Responses API mirror of #1 (text.format=json_schema -> bf_so_* tool path). 3. core/providers/bedrock/utils.go: convertToolConfigFromFiltered's explicit tool_choice path (defense-in-depth for ChatCompletions callers using bind_tools(tool_choice={"type": "function", "function": {"name": "X"}}) directly). 4. core/providers/bedrock/responses.go: Responses API mirror of #3 — the explicit tool_choice path runs convertResponsesToolChoice and writes BedrockToolChoice{Tool: ...} straight onto the request, which trips the same Llama 400 without the gate. With one synthetic tool bound and Bedrock's default "auto" behavior, omitting tool_choice yields the same outcome on Llama because there's exactly one tool the model can call. This mirrors the gate langchain-aws ships (supports_tool_choice_values for llama4 / llama3-1 / llama3-3 sets only "auto", causing with_structured_output to emit no tool_choice at all). Tests added (mirroring PR #3184's TestBedrockAnthropicChatStructuredOutputUsesSyntheticTool naming and structure): - TestBedrockLlamaChatStructuredOutputOmitsForcedToolChoice — synthetic tool present, ToolChoice nil for Llama on the ChatCompletions path. - TestBedrockNonLlamaChatStructuredOutputForcesToolChoice — regression guard: Nova / Anthropic still get the forced tool_choice (gate doesn't over-fire). - TestToBedrockResponsesRequest_LlamaStructuredOutputOmitsForcedToolChoice — same negative+positive on the Responses API synthetic-tool path. - TestBedrockLlamaConvertToolConfigOmitsForcedToolChoice — defense-in-depth coverage for ChatCompletions explicit tool_choice struct callers. - TestToBedrockResponsesRequest_LlamaConvertResponsesToolChoiceOmitsForcedToolChoice — defense-in-depth coverage for Responses API explicit tool_choice (function-typed) struct callers — the gap surfaced in code review. - TestToBedrockResponsesRequest_NonLlamaConvertResponsesToolChoiceForcesToolChoice — Anthropic regression guard for #5 so the Responses-side Llama gate doesn't over-fire. Refs: - https://docs.aws.amazon.com/bedrock/latest/APIReference/API_runtime_ToolChoice.html - https://github.com/langchain-ai/langchain-aws (bedrock_converse.py supports_tool_choice_values family-detection lines 678-711). - PR #3184 (Anthropic structured-output fallback to synthetic tool path) — immediate predecessor; this PR completes the per-family gating story.

feat: plugin context added

cfbee13

Pratham-Mishra04 requested a review from danpiths March 21, 2025 19:25

Pratham-Mishra04 mentioned this pull request Mar 21, 2025

Refactor Bifrost for plugin support and multi-provider architecture along with tests #1

Closed

Pratham-Mishra04 mentioned this pull request Mar 21, 2025

feat: plugin interface and tests added #2

Closed

danpiths reviewed Mar 22, 2025

View reviewed changes

danpiths requested changes Mar 22, 2025

View reviewed changes

Pratham-Mishra04 mentioned this pull request Mar 22, 2025

fix: bifrost context fixes #4

Closed

This was referenced Mar 31, 2025

feat: configs added for providers #5

Closed

feat: bedrock provider added #6

Closed

feat: cohere provider added #7

Closed

feat: provider and interface fixes #8

Closed

Pratham-Mishra04 closed this Mar 31, 2025

Pratham-Mishra04 deleted the 03-22-feat_plugin_context_added branch April 30, 2025 07:02

thiscantbeserious mentioned this pull request Apr 23, 2026

[feature] OAuth passthrough for OpenAI provider (Codex CLI subscription support) #2775

Draft

22 tasks

Pratham-Mishra04 mentioned this pull request Apr 27, 2026

fix: allow self-looping chain rules to continue evaluating subsequent rules instead of halting #3082

Merged

18 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: plugin context added#3

feat: plugin context added#3
Pratham-Mishra04 wants to merge 1 commit into03-21-feat_plugin_interface_and_tests_addedfrom
03-22-feat_plugin_context_added

Pratham-Mishra04 commented Mar 21, 2025

Uh oh!

Pratham-Mishra04 commented Mar 21, 2025 •

edited

Loading

Uh oh!

danpiths Mar 22, 2025 •

edited

Loading

Uh oh!

danpiths Mar 22, 2025

Uh oh!

danpiths left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Pratham-Mishra04 commented Mar 21, 2025

Uh oh!

Pratham-Mishra04 commented Mar 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

danpiths Mar 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

danpiths Mar 22, 2025

Choose a reason for hiding this comment

Uh oh!

danpiths left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Pratham-Mishra04 commented Mar 21, 2025 •

edited

Loading

danpiths Mar 22, 2025 •

edited

Loading