main -> v1.5.0 backmerge by sammaji · Pull Request #2732 · maximhq/bifrost

sammaji · 2026-04-15T12:30:21Z

Summary

Briefly explain the purpose of this PR and the problem it solves.

Changes

What was changed and why
Any notable design decisions or trade-offs

Type of change

Affected areas

How to test

Describe the steps to validate this change. Include commands and expected outcomes.

# Core/Transports
go version
go test ./...

# UI
cd ui
pnpm i || npm i
pnpm test || npm test
pnpm build || npm run build

If adding new configs or environment variables, document them here.

Screenshots/Recordings

If UI changes, add before/after screenshots or short clips.

Breaking changes

Yes
No

If yes, describe impact and migration instructions.

Related issues

Link related issues and discussions. Example: Closes #123

Security considerations

Note any security implications (auth, secrets, PII, sandboxing, etc.).

Checklist

I read docs/contributing/README.md and followed the guidelines
I added/updated tests where appropriate
I updated documentation where needed
I verified builds succeed (Go and UI)
I verified the CI pipeline passes locally if applicable

## Summary This PR updates the changelog for v1.5.0-prerelease3, documenting new features, bug fixes, and improvements across all components including object store log support, Azure passthrough, Mistral OCR, Redis TLS/cluster mode, and various UI enhancements. ## Changes - Added comprehensive feature documentation including Object Store Log Store, Azure Passthrough, Mistral OCR endpoint support, OpenRouter embeddings, and Redis TLS/cluster mode - Documented critical bug fixes for streaming post-hook race conditions, async user value propagation, and model alias tracking improvements - Added new pricing tier support (272k token, flex, and priority tiers) - Included dashboard CSV/PDF export capabilities and OpenAPI security scheme enhancements - Added OAuth MCP improvements and v1.4.x compatibility mode for smoother upgrades ## Type of change - [x] Documentation ## Affected areas - [x] Core (Go) - [x] Transports (HTTP) - [x] Providers/Integrations - [x] Plugins - [x] UI (Next.js) - [x] Docs ## How to test This is a documentation change for the changelog. Verify the changelog renders correctly: ```sh # Verify changelog formatting cd docs # Check that the MDX file is properly formatted and renders without errors ``` ## Screenshots/Recordings N/A - Documentation change only ## Breaking changes - [ ] Yes - [x] No ## Related issues N/A - Changelog update for prerelease ## Security considerations Documents CVE fixes and dependency updates that address security vulnerabilities. ## Checklist - [x] I read `docs/contributing/README.md` and followed the guidelines - [x] I added/updated tests where appropriate - [x] I updated documentation where needed - [x] I verified builds succeed (Go and UI) - [x] I verified the CI pipeline passes locally if applicable

…#2658) When converting Anthropic Messages API tool results to Bedrock Converse API format, image content blocks in function_call_output were silently dropped. The existing code only processed text blocks: for _, block := range output.ResponsesFunctionToolCallOutputBlocks { if block.Text != nil { resultContent = append(resultContent, ...) } } This caused tools that return image-only results (e.g. screenshot tools in computer-use agents) to produce empty tool results. The model would receive no visual feedback from screenshots, making it unable to determine what is on screen and causing it to loop endlessly retaking screenshots. The fix adds an else-if branch that converts input_image blocks to BedrockContentBlock with Image data, using the existing convertImageToBedrockSource utility that already handles image conversion for regular message content blocks. Note: convertBifrostResponsesMessageContentBlocksToBedrockContentBlocks already handles images correctly for regular messages — this gap only affected the function_call_output (tool result) code path. Made-with: Cursor

## Summary Briefly explain the purpose of this PR and the problem it solves. ## Changes - What was changed and why - Any notable design decisions or trade-offs ## Type of change - [ ] Bug fix - [ ] Feature - [ ] Refactor - [x] Documentation - [ ] Chore/CI ## Affected areas - [ ] Core (Go) - [ ] Transports (HTTP) - [ ] Providers/Integrations - [ ] Plugins - [ ] UI (Next.js) - [x] Docs ## How to test Describe the steps to validate this change. Include commands and expected outcomes. ```sh # Core/Transports go version go test ./... # UI cd ui pnpm i || npm i pnpm test || npm test pnpm build || npm run build ``` If adding new configs or environment variables, document them here. ## Screenshots/Recordings If UI changes, add before/after screenshots or short clips. ## Breaking changes - [ ] Yes - [x] No If yes, describe impact and migration instructions. ## Related issues Link related issues and discussions. Example: Closes #123 ## Security considerations Note any security implications (auth, secrets, PII, sandboxing, etc.). ## Checklist - [x] I read `docs/contributing/README.md` and followed the guidelines - [ ] I added/updated tests where appropriate - [x] I updated documentation where needed - [ ] I verified builds succeed (Go and UI) - [ ] I verified the CI pipeline passes locally if applicable

… any model rather than restrict it on vk (#2694)

## Summary Updates the LangChain SDK documentation to use the correct model version and parameter names for Google Generative AI integration. ## Changes - Updated Google model from `gemini-1.5-flash` to `gemini-2.5-flash` in Python and JavaScript examples - Changed parameter name from `google_api_base` to `base_url` in Python examples to match the correct LangChain API ## Type of change - [ ] Bug fix - [ ] Feature - [ ] Refactor - [x] Documentation - [ ] Chore/CI ## Affected areas - [ ] Core (Go) - [ ] Transports (HTTP) - [ ] Providers/Integrations - [ ] Plugins - [ ] UI (Next.js) - [x] Docs ## How to test Verify the documentation examples work correctly with the updated model and parameter names: ```sh # Test Python example pip install langchain-google-genai python -c "from langchain_google_genai import ChatGoogleGenerativeAI; llm = ChatGoogleGenerativeAI(model='gemini-2.5-flash', base_url='http://localhost:8080/langchain')" # Test JavaScript example npm install @langchain/google-genai node -e "const { ChatGoogleGenerativeAI } = require('@langchain/google-genai'); const llm = new ChatGoogleGenerativeAI({ model: 'gemini-2.5-flash', baseURL: 'http://localhost:8080/langchain' });" ``` ## Screenshots/Recordings N/A ## Breaking changes - [ ] Yes - [x] No ## Related issues N/A ## Security considerations No security implications - documentation-only changes. ## Checklist - [x] I read `docs/contributing/README.md` and followed the guidelines - [ ] I added/updated tests where appropriate - [x] I updated documentation where needed - [ ] I verified builds succeed (Go and UI) - [ ] I verified the CI pipeline passes locally if applicable

…e stream (#2429) * fix(bedrock): emit message_stop event for Anthropic invoke stream The Bedrock invoke-with-response-stream endpoint converts Bifrost internal format to Anthropic Messages API SSE events. Previously, the Completed event only emitted message_delta but not message_stop, causing clients like Claude Code to hang indefinitely waiting for the stream termination signal. This change: - Renames InvokeModelRawChunk to InvokeModelRawChunks ([][]byte) to support multiple events from a single Bifrost event - Updates toAnthropicInvokeStreamBytes to return [][]byte - For Completed events, emits both message_delta and message_stop - Updates ToEncodedEvents to iterate over multiple chunks - Adds test verifying both events are emitted This fixes cross-provider routing (e.g., OpenAI → Bedrock) where the conversion layer is required. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: address PR review feedback - Add error handling to mustMarshalJSON and mustMarshalToolParams test helpers - Update OpenAPI schema: rename invokeModelRawChunk to invokeModelRawChunks (array type) in converse.yaml and openapi.json Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by: Akshay Deo <akshay@akshaydeo.com>

github-advanced-security · 2026-04-15T12:30:24Z

You are seeing this message because GitHub Code Scanning has recently been set up for this repository, or this pull request contains the workflow file for the Code Scanning tool.

What Enabling Code Scanning Means:

The 'Security' tab will display more code scanning analysis results (e.g., for the default branch).
Depending on your configuration and choice of analysis tool, future pull requests will be annotated with code scanning analysis results.
You will be able to see the analysis results for the pull request's branch on this overview once the scans have completed and the checks have passed.

For more information about GitHub Code Scanning, check out the documentation.

CLAassistant · 2026-04-15T12:30:48Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you all sign our Contributor License Agreement before we can accept your contribution.
7 out of 12 committers have signed the CLA.

✅ crust3780
✅ tefimov
✅ BearTS
✅ tom-diacono
✅ martingiguere
✅ Edward-Upton
✅ emirhanmutlu-natuvion
❌ akshaydeo
❌ github-actions[bot]
❌ danpiths
❌ Pratham-Mishra04
❌ TejasGhatte
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

greptile-apps · 2026-04-15T12:35:38Z

Confidence Score: 5/5

Safe to merge — all changes are targeted bug fixes with no new P0/P1 findings.

All remaining open concerns (insecure default, Zod enum gap) were raised in prior review threads and are P2. No new logic bugs, data-loss paths, or security issues were found in this pass.

No files require special attention; the most impactful changes (Bedrock streaming, accumulator error path) are well-guarded and covered by existing tests.

Important Files Changed

Filename	Overview
core/providers/bedrock/invoke.go	Fixes Bedrock invoke stream: returns `[][]byte` from `toAnthropicInvokeStreamBytes` so that `Completed` events emit both `message_delta` and `message_stop`, preventing clients from hanging.
core/providers/bedrock/types.go	Renames `InvokeModelRawChunk []byte` to `InvokeModelRawChunks [][]byte` to support emitting multiple Anthropic SSE events per Bifrost event; JSON tag updated accordingly.
framework/streaming/accumulator.go	Relaxes nil guard to accept `bifrostErr`-only calls; initialises `TerminalErrorChunkIndex` to -1 in both accumulator constructors; enables error-path processing without a result object.
framework/streaming/responses.go	Assigns a stable, deduplication-safe chunk index for terminal error chunks via `TerminalErrorChunkIndex`; serialises raw response on error using sonic for better debug visibility.
plugins/otel/main.go	Records per-attempt metrics across all LLM/retry spans rather than only the final span; extracts `buildSpanAttrs` helper; adds `UnmarshalJSON` defaulting `Insecure=true` (previously flagged in review threads).
plugins/otel/metrics.go	Upgrades semconv dependency from v1.39.0 to v1.40.0; no logic changes.
framework/configstore/migrations.go	Adds `migrationNormalizeOtelTraceType` to rewrite legacy `trace_type: "otel"` → `"genai_extension"`; no-ops on missing rows; relies on `BeforeSave` hook to re-encrypt if encryption is enabled.
plugins/logging/main.go	Finalises accumulated stream metadata before writing error log entry; guards raw request/response overwrites with `entry.RawRequest == ""` to avoid clobbering richer pre-error data.
plugins/logging/operations.go	Removes early `return` from error branch of `applyStreamingOutputToEntry` so partial model/token data is still applied in error cases; fixes misaligned brace on `CompletionTokensDetails` check.
core/providers/openai/openai.go	Extracts error message/code from nested `response.Response.Error` for streaming errors; passes the current SSE line as the raw response payload to `EnrichError`.
ui/app/workspace/observability/fragments/otelFormFragment.tsx	Defaults `insecure` to `true` in the form to match the new backend default; adds metrics endpoint fields; aligns with updated Zod schema.
ui/lib/types/schemas.ts	Adds metrics validation fields; keeps `"vercel"` and `"open_inference"` in the enum (broader than backend schema – previously flagged in review threads).
helm-charts/bifrost/Chart.yaml	Version bump to 2.1.1; removes prerelease tag from appVersion.

_{Reviews (6): Last reviewed commit: "removes prerelease tag from helm chart (..." | Re-trigger Greptile}

## Summary Removed the Patronus AI guardrail provider example from the enterprise guardrails documentation. ## Changes - Removed the Patronus AI provider configuration example with ID 4 from the JSON response example - The example included "Hallucination Detection" policy with API key and endpoint configuration ## Type of change - [ ] Bug fix - [ ] Feature - [ ] Refactor - [x] Documentation - [ ] Chore/CI ## Affected areas - [ ] Core (Go) - [ ] Transports (HTTP) - [ ] Providers/Integrations - [ ] Plugins - [ ] UI (Next.js) - [x] Docs ## How to test Verify the documentation renders correctly and the JSON example is valid: ```sh # Validate the documentation builds without errors # Check that the remaining JSON structure is properly formatted ``` ## Screenshots/Recordings N/A ## Breaking changes - [ ] Yes - [x] No ## Related issues ## Security considerations This change removes an example that referenced API keys, which is appropriate for documentation cleanup. ## Checklist - [x] I read `docs/contributing/README.md` and followed the guidelines - [ ] I added/updated tests where appropriate - [x] I updated documentation where needed - [x] I verified builds succeed (Go and UI) - [ ] I verified the CI pipeline passes locally if applicable

coderabbitai · 2026-04-15T13:46:14Z

📝 Walkthrough

Summary by CodeRabbit

New Features
- Added Claude Opus 4.7 support with adaptive thinking and fast mode
- Implemented Anthropic structured outputs via native response_format
- Added MCP tool annotations and eager input streaming support
- Introduced SCIM-based identity provisioning for enterprise SSO (Okta, Entra, Zitadel, Google Workspace)
- Added virtual key quota API and object storage for logs (S3/GCS)
Bug Fixes
- Fixed provider queue shutdown panics during graceful termination
- Improved streaming error capture and context propagation in async operations
- Enhanced model discovery and custom provider model acceptance
- Fixed Gemini/Bedrock/Anthropic format conversions
Documentation
- Expanded Helm deployment guides with comprehensive configuration sections
- Added SCIM setup guides for multiple identity providers
- Updated pricing and governance documentation

Walkthrough

This PR introduces extensive provider enhancements (particularly Anthropic/Bedrock/OpenAI/Gemini), adds Anthropic server-tool and eager-input-streaming support, refactors Bifrost's type system to support tool annotations, restructures config schema for SCIM/compat flags, comprehensively updates Helm deployment guides, hardens CI workflows, and downgrades Go toolchain to 1.26.1 while bumping multiple component versions.

Changes

Cohort / File(s)	Summary
Anthropic Provider Core `core/providers/anthropic/utils.go`, `core/providers/anthropic/chat.go`, `core/providers/anthropic/responses.go`, `core/providers/anthropic/types.go`	Added field sanitization, server-tool support, Opus 4.7+ model gating, task budgets, container/MCP server support, eager-input-streaming beta headers, advanced-tool-use flags, and per-request WebSearch stream state tracking; removed MapAnthropicEffortToBifrost function.
Anthropic Integration & Testing `core/providers/anthropic/anthropic.go`, `core/providers/anthropic/anthropic_test.go`, `core/providers/anthropic/chat_server_tools_test.go`, `core/providers/anthropic/validate_chat_tools_test.go`, `core/providers/anthropic/websearch_test.go`, `core/providers/anthropic/utils_test.go`, `core/providers/anthropic/chat_test.go`	Added raw-body field sanitization, beta-header injection for sanitized requests, stream finalizer defers, comprehensive server-tool round-trip tests, WebSearch tracking regression tests, and model-gated reasoning/tool conversions.
Bedrock Provider `core/providers/bedrock/bedrock_test.go`, `core/providers/bedrock/invoke.go`, `core/providers/bedrock/responses.go`, `core/providers/bedrock/types.go`, `core/providers/bedrock/utils.go`, `core/providers/bedrock/convert_tool_config_test.go`	Refactored structured-output handling for Anthropic native output_config vs synthetic tools, updated streaming to emit multiple chunks per event via InvokeModelRawChunks, enhanced tool filtering/server-tool tunneling, added OrderedMap support for output_config.format/effort preservation, and extensive server-tool conversion/beta-header tests.
OpenAI Provider `core/providers/openai/openai.go`, `core/providers/openai/chat.go`, `core/providers/openai/responses.go`, `core/providers/openai/types.go`, `core/providers/openai/chat_test.go`, `core/providers/openai/responses_marshal_test.go`	Added tool filtering for Anthropic-only flags/types, function-call-output block flattening for Responses API, effort-capping for reasoning, enhanced error handling for Responses streaming, and comprehensive marshal/tool-filtering tests.
Gemini Provider `core/providers/gemini/gemini.go`, `core/providers/gemini/responses.go`, `core/providers/gemini/chat.go`, `core/providers/gemini/types.go`, `core/providers/gemini/utils.go`, `core/providers/gemini/gemini_test.go`	Added finish-reason error handling, structured function-call-output support, expanded error FinishReason enums, case-insensitive thinking-level mapping, removed ThinkingBudget derivation, and added reverse finish-reason conversion with round-trip regression tests.
Type System & Schemas `core/schemas/chatcompletions.go`, `core/schemas/responses.go`, `core/schemas/utils.go`, `core/schemas/images.go`, `core/schemas/videos.go`, `core/schemas/serialization_test.go`	Extended ChatParameters/ChatReasoning/ChatTool with Anthropic-promoted fields (top_k, speed, inference_geo, mcp_servers, container, task_budget), added MCPToolAnnotations with json:"-" exclusion, implemented custom ChatTool/ChatContainer union marshalling, extended ResponsesTool with defer_loading/allowed_callers/input_examples/eager_input_streaming, and added deep-copy/backfill logic.
Streaming & Async `framework/streaming/accumulator.go`, `framework/streaming/responses.go`, `framework/streaming/types.go`, `framework/logstore/asyncjob.go`, `transports/bifrost-http/handlers/asyncinference.go`, `transports/bifrost-http/integrations/router.go`	Added TerminalErrorChunkIndex for dedup, RawResponse marshalling on errors, refactored AsyncJobExecutor to accept BifrostContext, and updated call sites.
Logging & Telemetry `plugins/logging/main.go`, `plugins/logging/operations.go`, `plugins/otel/main.go`, `plugins/otel/metrics.go`	Enhanced post-hook error logging for streams with accumulator finalization, refactored metrics recording to iterate all spans with per-attempt metrics, updated OTEL semconv to v1.40.0, added per-span attributes via buildSpanAttrs.
Governance & Model Catalog `plugins/governance/main.go`, `plugins/governance/resolver.go`, `plugins/governance/utils.go`, `plugins/governance/model_provider_governance_test.go`, `plugins/governance/resolver_test.go`, `plugins/governance/tracker_test.go`, `framework/modelcatalog/main.go`, `framework/modelcatalog/main_test.go`	Extended IsModelAllowedForProvider to accept providerConfig parameter, added in-memory-store support for provider-specific config, updated all NewBudgetResolver call sites with governanceInMemoryStore parameter.
Provider Queue Lifecycle `core/bifrost.go`, `core/bifrost_test.go`	Refactored provider queue shutdown to signal closing via done channel (not close queue), added worker draining with error messages, buffered queue transfer on UpdateProvider, rerouting on closing, and 997 lines of comprehensive concurrency tests.
Stream Utils & Passthrough `core/providers/utils/utils.go`, `core/providers/utils/stream.go`, `core/providers/utils/stream_test.go`, `transports/bifrost-http/integrations/anthropic.go`	Added EnsureStreamFinalizerCalled helper with panic recovery, updated CheckFirstStreamChunkForError to accept context for cancellation support, and replaced manual streaming deserialization with gjson type extraction.
Multiple Streaming Provider Defers `core/providers/azure/azure.go`, `core/providers/bedrock/bedrock.go`, `core/providers/cohere/cohere.go`, `core/providers/elevenlabs/elevenlabs.go`, `core/providers/gemini/gemini.go`, `core/providers/huggingface/huggingface.go`, `core/providers/mistral/mistral.go`, `core/providers/replicate/replicate.go`, `core/providers/vertex/vertex.go`, `core/providers/vllm/vllm.go`	Added defer providerUtils.EnsureStreamFinalizerCalled(ctx) to streaming goroutines across all providers.
MCP & Annotations `core/mcp/utils.go`, `core/mcp/utils_test.go`, `core/internal/mcptests/annotations_test.go`, `transports/bifrost-http/handlers/mcpserver.go`	Added annotation extraction/preservation through MCP tool conversion, json:"-" serialization exclusion, and comprehensive annotation round-trip tests.
Config Schema & Management `transports/config.schema.json`, `docs/openapi/openapi.json`, `docs/openapi/schemas/management/config.yaml`, `helm-charts/bifrost/values.schema.json`	Replaced enable_litellm_fallbacks with compat object (convert_text_to_chat/chat_to_responses/drop_params/convert_params), added top-level version for deny-by-default semantics, added SCIM config, object-storage logs, large-payload-optimization, websocket tuning, pricing-overrides, and updated OTEL trace_type enum.
Framework Configuration `framework/configstore/migrations.go`	Added migrationNormalizeOtelTraceType to transition OTEL trace_type from "otel" to "genai_extension".
Helm Charts - Values & Templates `helm-charts/bifrost/values.yaml`, `helm-charts/bifrost/values-examples/providers-and-virtual-keys.yaml`, `helm-charts/bifrost/values-examples/secrets-from-k8s.yaml`, `helm-charts/bifrost/templates/_helpers.tpl`, `helm-charts/bifrost/templates/deployment.yaml`, `helm-charts/bifrost/templates/stateful.yaml`	Updated compat config structure, added encryptionKeySecret, replaced SAML with SCIM config, added MCP secretRef and tool manager config, added object-storage credentials injection, updated trace_type default to genai_extension, added large-payload-optimization/websocket blocks.
Helm Chart Metadata `helm-charts/bifrost/Chart.yaml`, `helm-charts/bifrost/README.md`, `helm-charts/index.yaml`	Bumped Helm chart version 2.0.17 → 2.1.1, added v2.1.1 notes about optional virtual-key value field, documented OTEL trace_type change, added MCP secretRef documentation.
Deployment Guides - Complete Rewrite `docs/deployment-guides/helm.mdx`, `docs/deployment-guides/helm/client.mdx`, `docs/deployment-guides/helm/cluster.mdx`, `docs/deployment-guides/helm/governance.mdx`, `docs/deployment-guides/helm/guardrails.mdx`, `docs/deployment-guides/helm/plugins.mdx`, `docs/deployment-guides/helm/providers.mdx`, `docs/deployment-guides/helm/storage.mdx`, `docs/deployment-guides/helm/troubleshooting.mdx`, `docs/deployment-guides/helm/values.mdx`	Replaced monolithic Helm guide with modular organization covering client, cluster, governance, guardrails, plugins, providers, storage, troubleshooting, and values reference.
Enterprise SSO/SCIM Documentation `docs/enterprise/user-provisioning.mdx`, `docs/enterprise/setting-up-okta.mdx`, `docs/enterprise/setting-up-entra.mdx`, `docs/enterprise/setting-up-google-workspace.mdx`, `docs/enterprise/setting-up-keycloak.mdx`, `docs/enterprise/setting-up-zitadel.mdx`	Added new SCIM provisioning overview, updated Okta/Entra to use SCIM/attribute-mapping instead of SAML/role-claims, added Google Workspace/Keycloak/Zitadel setup guides.
Enterprise Config & Governance `docs/enterprise/guardrails.mdx`, `docs/enterprise/rbac.mdx`, `docs/enterprise/audit-logs.mdx`, `docs/enterprise/clustering.mdx`	Updated guardrails to use guardrails_config wrapper, changed RBAC to reference SCIM, simplified audit_logs config, added gossip config tuning blocks.
Features & Migration Docs `docs/features/litellm-compat.mdx`, `docs/features/governance/budget-and-limits.mdx`, `docs/features/governance/virtual-keys.mdx`, `docs/features/observability/prometheus.mdx`, `docs/migration-guides/v1.5.0.mdx`	Updated LiteLLM compat for three transformations with client_config.compat flags, restructured governance to use top-level budgets/rate_limits arrays with IDs, updated Prometheus to use plugin name "telemetry" with nested push_gateway, added v1.5.0 migration section for version: 1 compat mode.
Provider Docs & OpenAPI `docs/providers/custom-pricing.mdx`, `docs/providers/supported-providers/overview.mdx`, `docs/providers/request-options.mdx`, `docs/openapi/openapi.yaml`, `docs/openapi/paths/management/governance.yaml`, `docs/openapi/schemas/management/governance.yaml`	Added flex/priority pricing fields, updated request-options for store-raw-request, added getVirtualKeyQuota endpoint, updated Groq compat flag name.
Quickstart & Integration Guides `docs/quickstart/gateway/provider-configuration.mdx`, `docs/quickstart/gateway/setting-up-auth.mdx`, `docs/quickstart/gateway/setting-up.mdx`, `docs/quickstart/go-sdk/provider-configuration.mdx`, `docs/integrations/guardrails/azure-content-safety.mdx`	Added store-raw-request-response feature, added OSS-only SCIM note, updated Go SDK examples for context-based raw request/response, updated Azure guardrails docs with API key collection steps.
Plugin & Build Docs `docs/plugins/building-dynamic-binary.mdx`, `docs/plugins/writing-go-plugin.mdx`	Updated Next.js → Vite build commands, changed log setup from top-level log_level to client.enable_logging.
Documentation Navigation & Metadata `docs/docs.json`, `docs/overview.mdx`, `docs/contributing/code-conventions.mdx`, `docs/contributing/setting-up-repo.mdx`	Added enterprise/user-provisioning and SCIM provider pages, removed vault-support card, updated npm run format command, changed make dev from Next.js to Vite.
CI/Workflow Hardening `.github/workflows/dependabot-alerts.yml`, `.github/workflows/dependency-review.yml`, `.github/workflows/docs-validation.yml`, `.github/workflows/helm-release.yml`, `.github/workflows/openapi-bundle.yml`, `.github/workflows/release-cli.yml`, `.github/workflows/release-pipeline.yml`, `.github/workflows/scorecards.yml`, `.github/workflows/snyk.yml`	Changed egress-policy from audit to block with explicit allowed-endpoints allowlists across all workflows, downgraded Go from 1.26.2 to 1.26.1, removed pre-release test fan-out from release pipeline.
Workflow Utilities & Scripts `.github/workflows/e2e-tests.yml`, `.github/workflows/pr-tests.yml`, `.github/workflows/scripts/setup-go-workspace.sh`, `.github/workflows/scripts/validate-helm-schema.sh`, `.github/workflows/scripts/release-bifrost-http-prep.sh`, `.github/workflows/scripts/validate-helm-config-fields.sh`, `.github/workflows/scripts/validate-helm-templates.sh`, `.github/workflows/scripts/run-migration-tests.sh`	Updated Go version to 1.26.1, exported GOTOOLCHAIN=auto, updated SCIM/compat field validations, added flex-tier pricing column migrations, updated test configurations for trace_type/compat changes.
LLM Test Infrastructure `core/internal/llmtests/account.go`, `core/internal/llmtests/eager_input_streaming.go`, `core/internal/llmtests/server_tools_via_openai.go`, `core/internal/llmtests/tests.go`, `core/internal/llmtests/provider_feature_support_test.go`	Added EagerInputStreaming and ServerToolsViaOpenAIEndpoint test scenario flags, implemented eager-input-streaming and server-tools-via-OpenAI endpoint tests.
Provider Test Configs `core/providers/anthropic/anthropic_test.go`, `core/providers/azure/azure_test.go`, `core/providers/bedrock/bedrock_test.go`, `core/providers/vertex/vertex_test.go`	Enabled EagerInputStreaming and ServerToolsViaOpenAIEndpoint test scenarios for Anthropic/Azure/Bedrock/Vertex.
Version Files `core/version`, `framework/version`, `plugins/governance/version`, `plugins/jsonparser/version`, `plugins/litellmcompat/version`, `plugins/logging/version`, `plugins/maxim/version`, `plugins/mocker/version`, `plugins/otel/version`, `plugins/semanticcache/version`, `plugins/telemetry/version`, `transports/version`	Incremented versions across multiple components (core 1.4.19→1.4.22, framework 1.2.38→1.2.39, governance 1.4.38→1.4.39, etc.).
Go Modules & Metadata `cli/go.mod`, `core/go.mod`, `.github/workflows/scripts/release-bifrost-http-prep.sh`, `README.md`	Downgraded Go toolchain from 1.26.2 to 1.26.1 across modules, updated release script to use go mod edit instead of go get, removed Snyk badge.
Changelog Documentation `docs/changelogs/v1.5.0-prerelease2.mdx`, `docs/changelogs/v1.5.0-prerelease3.mdx`, `docs/changelogs/v1.4.23.mdx`	Removed breaking changes section from prerelease2, added new v1.5.0-prerelease3 release notes with Bifrost/Core/Framework updates, added v1.4.23 release notes.
HTTP Integration & Utils `transports/bifrost-http/handlers/providers.go`, `transports/bifrost-http/integrations/bedrock.go`	Updated provider config awareness in model-allowance filtering, adjusted InvokeModelRawChunk → InvokeModelRawChunks usage.

Sequence Diagram(s)

sequenceDiagram
    participant Client
    participant OpenAI as OpenAI Provider
    participant Bifrost
    participant Anthropic as Anthropic Provider
    
    Client->>Bifrost: ChatCompletionRequest with Anthropic server tools
    Bifrost->>Bifrost: Validate tools via ValidateChatToolsForProvider
    Bifrost->>Anthropic: ToAnthropicChatRequest (convert server tools)
    Anthropic->>Anthropic: stripUnsupportedAnthropicFields (for provider)
    Anthropic->>Anthropic: Inject beta headers (tool-examples, etc)
    Anthropic-->>Bifrost: AnthropicMessageRequest
    Bifrost->>Anthropic: Send to Anthropic API
    Anthropic-->>Bifrost: Streaming events with tool calls
    Bifrost->>Bifrost: Track WebSearch item IDs (per-request state)
    Bifrost->>Bifrost: Emit content blocks (skip tracked deltas)
    Bifrost->>Bifrost: On output_item.done: generate synthetic deltas
    Bifrost-->>Client: Bifrost stream chunks

sequenceDiagram
    participant User
    participant Helm as Helm Chart
    participant K8s as Kubernetes
    participant Bifrost as Bifrost Pod
    
    User->>Helm: helm install with values (object storage enabled)
    Helm->>K8s: Create ConfigMap (config.json with logs_store.object_storage)
    Helm->>K8s: Create Secret (S3/GCS credentials)
    Helm->>K8s: Create Deployment with env vars
    K8s->>Bifrost: Inject BIFROST_OBJECT_STORAGE_* env vars
    Bifrost->>Bifrost: Parse logs_store.object_storage config
    Bifrost->>Bifrost: Initialize S3/GCS client with env creds
    Bifrost->>Bifrost: Stream logs to object storage
    Bifrost-->>User: Health check OK

Estimated code review effort

🎯 5 (Critical) | ⏱️ ~120+ minutes

Possibly related PRs

move back go to 1.26.1 #2792: Shares Go toolchain downgrade (1.26.2 → 1.26.1) across go.mod files and GitHub Actions workflows, including Docker builder updates.
core schema changes --skip-pipeline #2787: Overlaps significantly with Anthropic provider enhancements (ToAnthropicChatRequest, ChatTool schema, beta headers, server-tool support).
anthropic container changes --skip-pipeline #2783: Related through eager-input-streaming and Anthropic container tooling additions (same types/headers/field additions).

Suggested reviewers

danpiths
Pratham-Mishra04
roroghost17

Poem

🐰 Whiskers twitching with delight,
Server tools now stream just right,
SCIM replaces SAML's old way,
Helm charts shine in grand display!
Eager streams and tooling new,
Bifrost's magic shines right through! ✨

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch main

⚔️ Resolve merge conflicts

Resolve merge conflict in branch main

coderabbitai

Actionable comments posted: 14

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (2)

docs/enterprise/setting-up-okta.mdx (1)

188-210: ⚠️ Potential issue | 🟡 Minor

Fallback note references an unrelated step.

Line 209 ties RBAC fallback to skipping Steps 4–7, but Step 7 is API-token creation for bulk sync. This makes RBAC behavior sound dependent on an unrelated action.

✏️ Suggested wording

-Role claims are available only when you configure custom claims on your authorization server. Ensure you add role claims to your chosen authorization server (for example, `/oauth2/default`) to enable RBAC. If you skipped Steps 4-7, the first user to sign in automatically receives the **Admin** role and can manage RBAC for all subsequent users through the Bifrost dashboard.
+Role claims are available only when you configure custom claims on your authorization server. Ensure you add role claims to your chosen authorization server (for example, `/oauth2/default`) to enable RBAC. If you skip role-claim setup (Steps 3-4), the first user to sign in automatically receives the **Admin** role and can manage RBAC for all subsequent users through the Bifrost dashboard.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@docs/enterprise/setting-up-okta.mdx` around lines 188 - 210, Update the
fallback Note under "Step 6: Assign Users to the Application" to reference the
correct prerequisite steps for role claims (the steps that configure the
authorization server and custom claims) rather than Step 7; edit the <Note>
block text so it says something like "If you skipped Steps 4–6 (configuring your
authorization server and adding role claims), the first user to sign in..." and
ensure the Note is adjacent to the "Step 6: Assign Users to the Application"
section so readers understand it pertains to role-claim configuration and RBAC
fallback.

core/providers/bedrock/invoke.go (1)

835-870: ⚠️ Potential issue | 🟠 Major

Preserve tool_use in the final message_delta.

The completed-event path now falls back to end_turn unless IncompleteDetails is set. For tool-call turns, Anthropic clients expect message_delta.stop_reason == "tool_use"; otherwise the invoke stream terminates like a normal assistant reply even though tool-use blocks were emitted earlier. Mirror the non-stream toBedrockInvokeAnthropicResponse logic before marshaling message_delta.

Suggested fix

-		stopReason := "end_turn"
-		if resp.Response != nil && resp.Response.IncompleteDetails != nil {
-			stopReason = resp.Response.IncompleteDetails.Reason
-		}
+		stopReason := "end_turn"
+		if resp.Response != nil && resp.Response.IncompleteDetails != nil {
+			stopReason = resp.Response.IncompleteDetails.Reason
+		} else if resp.Response != nil {
+			for _, item := range resp.Response.Output {
+				if item.Type != nil && *item.Type == schemas.ResponsesMessageTypeFunctionCall {
+					stopReason = "tool_use"
+					break
+				}
+			}
+		}

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@core/providers/bedrock/invoke.go` around lines 835 - 870, The completed-event
path currently unconditionally falls back to "end_turn" for stopReason; update
the case in invoke.go that builds messageDelta (use the resp variable and
messageDelta/stopReason) to preserve "tool_use" the same way the non-stream
toBedrockInvokeAnthropicResponse logic does: inspect the response metadata that
toBedrockInvokeAnthropicResponse uses (e.g., the presence of a tool-use marker
or the same IncompleteDetails/tool fields) and set stopReason = "tool_use" when
appropriate before marshaling messageDelta; mirror that exact decision logic
from toBedrockInvokeAnthropicResponse so streaming and non-stream behavior
match.

🧹 Nitpick comments (1)

core/providers/bedrock/responses.go (1)

2568-2579: Remove unused fmt.Errorf allocation.

Line 2576 creates an error object and immediately discards it with _ = fmt.Errorf(...). This allocates memory for no benefit. Since converter functions should be pure with no logging per coding guidelines, consider removing this line entirely—the comment already documents the intentional silent drop.

♻️ Suggested fix

 							imageSource, err := convertImageToBedrockSource(*block.ResponsesInputMessageContentBlockImage.ImageURL)
 							if err != nil {
 								// Bedrock only supports base64 data URIs for images. If conversion
 								// fails (e.g. remote URL), the image is dropped from the tool result
 								// which silently degrades the model's ability to see tool output.
-								_ = fmt.Errorf("bedrock: converting tool result image: %w", err)
 							} else {
 								resultContent = append(resultContent, BedrockContentBlock{Image: imageSource})
 							}

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@core/providers/bedrock/responses.go` around lines 2568 - 2579, Remove the
pointless allocation of an error on conversion failure: delete the `_ =
fmt.Errorf("bedrock: converting tool result image: %w", err)` line inside the
convertImageToBedrockSource error branch (where block.Type ==
ResponsesInputMessageContentBlockTypeImage) and keep the existing comment
explaining the silent drop; after this change, also remove the now-unused fmt
import if it becomes unused to avoid dead imports. Ensure the rest of the block
still appends to resultContent with BedrockContentBlock{Image: imageSource} on
success.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@core/providers/bedrock/bedrock_test.go`:
- Around line 3754-3780: The test expects
ConvertBifrostMessagesToBedrockMessages to silently drop remote URL images and
not return an error, but convertImageToBedrockSource currently returns an error
for remote URLs which propagates out; change the logic in
convertImageToBedrockSource so that when the image source is a remote URL
(http/https) it does not return an error but instead indicates "skip" (e.g.,
return nil image/source and no error), and ensure
ConvertBifrostMessagesToBedrockMessages treats a nil/skip result from
convertImageToBedrockSource as non-fatal and continues (leaving tool result
Content empty) rather than returning the conversion error.

In `@docs/architecture/framework/model-catalog.mdx`:
- Around line 321-324: The snippet uses an undefined ctx when calling
modelCatalog.UpdateSyncConfig; make the example self-contained by creating a
context before use (e.g., define ctx := context.Background() immediately above
the newConfig or call modelCatalog.UpdateSyncConfig(context.Background(),
newConfig)), and keep the rest of the example (newConfig :=
&modelcatalog.Config{PricingSyncInterval: 12 * time.Hour}) unchanged so readers
can copy/paste directly.

In `@docs/changelogs/v1.5.0-prerelease3.mdx`:
- Line 102: The changelog entry uses inconsistent casing: replace the lowercase
token "litellm" on line 102 with the canonical product name "LiteLLM" so the
entry matches other occurrences; ensure any other instances in this file follow
the same "LiteLLM" capitalization for consistency.

In `@docs/enterprise/setting-up-okta.mdx`:
- Line 77: Reword the awkward RBAC note to a concise, grammatical sentence:
replace "You can use both roles and/or groups for assigning roles to users. You
can learn more about [RBAC](/enterprise/rbac) docs. Roles take precedence over
groups in role assignment." with a single clear line such as "You can assign
roles directly or via groups; roles take precedence—see the RBAC docs." Update
the sentence containing the "[RBAC](/enterprise/rbac)" link so it reads
naturally and remove the redundant word "docs."
- Around line 214-228: Step 7 instructs creating an Okta API token but Step 8's
Okta JWT configuration (issuerUrl, clientId, clientSecret, audience, rolesField,
teamIdsField, userIdField) does not reference an apiToken; either remove Step 7
entirely or update Step 8's configuration table to add an apiToken field and
explain where/how to supply it (e.g., include apiToken in the configuration
schema and show it used for bulk user/team sync), and ensure the Step 7 text and
screenshots match the chosen approach.

In `@docs/features/litellm-compat.mdx`:
- Around line 12-16: Update the introductory sentence that currently states the
plugin provides "two transformations" to reflect the actual count of four
transformations listed; specifically edit the sentence immediately preceding the
enumerated items (which include "Chat-to-Responses Conversion", "Drop
Unsupported Params", and "Unsupported Parameter Conversion") to say "four
transformations" (or rephrase to "the following transformations" if you prefer
not to use a number) so the intro matches the list and the subsequent references
to converted_request_type and dropped_compat_plugin_params remain correct.

In `@docs/migration-guides/v1.5.0.mdx`:
- Around line 483-490: Update the table row for the VK field `provider_configs:
[]` so the “With `version: 1`” column explicitly includes both `allowed_models:
["*"]` and `key_ids: ["*"]` (e.g., “All configured providers added with
`allowed_models: ["*"]` and `key_ids: ["*"]`”) to avoid implying keys remain
blocked; edit the text mentioning `provider_configs: []`, `allowed_models`, and
`key_ids` to reflect the explicit `key_ids: ["*"]` change.

In `@docs/openapi/openapi.json`:
- Around line 51562-51585: The runtime ClientConfig struct is out of sync with
the OpenAPI schema: update framework/configstore/clientconfig.go by adding a
Compat field of a new Compat struct (name it Compat) to ClientConfig and remove
or deprecate the legacy EnableLiteLLMFallbacks boolean; define the Compat struct
with boolean fields ConvertTextToChat, ConvertChatToResponses, ShouldDropParams,
and ShouldConvertParams (use json tags "convert_text_to_chat",
"convert_chat_to_responses", "should_drop_params", "should_convert_params" and
omitempty) so the runtime shape matches the schema's compat object; ensure the
Compat type is used wherever ClientConfig is serialized/deserialized so codegen
clients receive the expected structure.

In `@docs/providers/supported-providers/overview.mdx`:
- Around line 51-52: The docs incorrectly state that text-to-chat conversion is
toggled by compat.convert_text_to_chat and disabled by default; update the
overview.mdx text to say conversion is performed automatically when a model
lacks native text-completion support based on the model catalog check (see
plugins/litellmcompat/texttochat.go IsTextCompletionSupported and the PreLLMHook
in plugins/litellmcompat/main.go), and correct any mention of the flag name to
enable_litellm_fallbacks if you must reference the global toggle (but note the
conversion logic is driven by the catalog check, not
compat.convert_text_to_chat).

In `@framework/modelcatalog/main.go`:
- Around line 676-679: The code dereferences providerConfig.CustomProviderConfig
without a nil check causing a panic; update the logic around providerConfig to
first test providerConfig.CustomProviderConfig != nil and only then set
isCustomProvider = true and compute hasListModelsEndpointDisabled =
!providerConfig.CustomProviderConfig.IsOperationAllowed(schemas.ListModelsRequest);
otherwise set isCustomProvider = false and leave hasListModelsEndpointDisabled
as false (or its safe default) so IsOperationAllowed is never called on a nil
CustomProviderConfig.

In `@helm-charts/index.yaml`:
- Around line 4-24: index.yaml entries (e.g., version 2.0.18 with urls entry
bifrost-2.0.18.tgz) contain empty digest: "" values; regenerate the Helm index
so Helm computes and populates those digests instead of editing them manually by
either triggering the existing helm-release.yml workflow or running the same
command locally (helm repo index . --merge) against the chart package directory,
then commit the regenerated index.yaml so digest fields are filled for all
recent chart versions.

In `@plugins/governance/main.go`:
- Around line 579-582: The code indexes p.inMemoryStore.GetConfiguredProviders()
without checking whether the provider key exists, which may pass a zero-value
ProviderConfig into p.modelCatalog.IsModelAllowedForProvider and incorrectly
mark an unconfigured provider as allowed; change the flow in the block using
p.modelCatalog and p.inMemoryStore so you first capture the map (e.g., cfgs :=
p.inMemoryStore.GetConfiguredProviders()), then look up providerConfig, ok :=
cfgs[provider], and only call p.modelCatalog.IsModelAllowedForProvider(provider,
modelStr, &providerConfig, config.AllowedModels) when ok is true; if ok is
false, set isProviderAllowed = false (or otherwise treat the provider as
disallowed) so an absent configured provider cannot be considered eligible.

In `@plugins/governance/resolver.go`:
- Around line 339-342: The current code indexes
r.governanceInMemoryStore.GetConfiguredProviders()[provider] without checking
existence, producing a zero-value ProviderConfig and a non-nil pointer even when
the key is missing; update the block so you first retrieve the map and check
presence (e.g. cfg, ok :=
r.governanceInMemoryStore.GetConfiguredProviders()[provider]) and only pass a
pointer to cfg to r.modelCatalog.IsModelAllowedForProvider when ok is true,
otherwise pass nil (or handle the "not configured" case consistently with
IsModelAllowedForProvider) to avoid incorrect behavior or panics.

In `@plugins/otel/changelog.md`:
- Around line 1-2: Update the plugins/otel/changelog.md to include an explicit
upgrade note that the OTEL plugin's trace_type value was renamed from "otel" to
"genai_extension", indicating that database-backed installs are auto-migrated
while file-based installs using config.json must manually rename the trace_type
entry; mention any default behavior change (insecure default true) only if
relevant and provide a short actionable instruction for users to edit
config.json's trace_type field accordingly.

---

Outside diff comments:
In `@core/providers/bedrock/invoke.go`:
- Around line 835-870: The completed-event path currently unconditionally falls
back to "end_turn" for stopReason; update the case in invoke.go that builds
messageDelta (use the resp variable and messageDelta/stopReason) to preserve
"tool_use" the same way the non-stream toBedrockInvokeAnthropicResponse logic
does: inspect the response metadata that toBedrockInvokeAnthropicResponse uses
(e.g., the presence of a tool-use marker or the same IncompleteDetails/tool
fields) and set stopReason = "tool_use" when appropriate before marshaling
messageDelta; mirror that exact decision logic from
toBedrockInvokeAnthropicResponse so streaming and non-stream behavior match.

In `@docs/enterprise/setting-up-okta.mdx`:
- Around line 188-210: Update the fallback Note under "Step 6: Assign Users to
the Application" to reference the correct prerequisite steps for role claims
(the steps that configure the authorization server and custom claims) rather
than Step 7; edit the <Note> block text so it says something like "If you
skipped Steps 4–6 (configuring your authorization server and adding role
claims), the first user to sign in..." and ensure the Note is adjacent to the
"Step 6: Assign Users to the Application" section so readers understand it
pertains to role-claim configuration and RBAC fallback.

---

Nitpick comments:
In `@core/providers/bedrock/responses.go`:
- Around line 2568-2579: Remove the pointless allocation of an error on
conversion failure: delete the `_ = fmt.Errorf("bedrock: converting tool result
image: %w", err)` line inside the convertImageToBedrockSource error branch
(where block.Type == ResponsesInputMessageContentBlockTypeImage) and keep the
existing comment explaining the silent drop; after this change, also remove the
now-unused fmt import if it becomes unused to avoid dead imports. Ensure the
rest of the block still appends to resultContent with BedrockContentBlock{Image:
imageSource} on success.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: a2996596-5c5f-4d5f-9dd9-e779d4bdca46

📥 Commits

Reviewing files that changed from the base of the PR and between 10463b2 and 1efbfcd.

⛔ Files ignored due to path filters (2)

docs/media/compatibility-settings.png is excluded by !**/*.png
flake.lock is excluded by !**/*.lock

📒 Files selected for processing (42)

core/changelog.md
core/providers/bedrock/bedrock_test.go
core/providers/bedrock/invoke.go
core/providers/bedrock/responses.go
core/providers/bedrock/types.go
docs/architecture/framework/model-catalog.mdx
docs/changelogs/v1.5.0-prerelease2.mdx
docs/changelogs/v1.5.0-prerelease3.mdx
docs/docs.json
docs/enterprise/guardrails.mdx
docs/enterprise/setting-up-okta.mdx
docs/features/litellm-compat.mdx
docs/integrations/langchain-sdk.mdx
docs/migration-guides/v1.5.0.mdx
docs/openapi/openapi.json
docs/openapi/schemas/integrations/bedrock/converse.yaml
docs/openapi/schemas/management/config.yaml
docs/providers/custom-pricing.mdx
docs/providers/supported-providers/overview.mdx
framework/configstore/migrations.go
framework/modelcatalog/main.go
framework/modelcatalog/main_test.go
helm-charts/bifrost/Chart.yaml
helm-charts/bifrost/README.md
helm-charts/bifrost/templates/_helpers.tpl
helm-charts/index.yaml
plugins/governance/main.go
plugins/governance/model_provider_governance_test.go
plugins/governance/resolver.go
plugins/governance/resolver_test.go
plugins/governance/tracker_test.go
plugins/governance/utils.go
plugins/otel/changelog.md
plugins/otel/main.go
plugins/otel/metrics.go
transports/bifrost-http/handlers/providers.go
transports/bifrost-http/integrations/bedrock.go
transports/changelog.md
transports/config.schema.json
ui/app/workspace/observability/fragments/otelFormFragment.tsx
ui/components/ui/select.tsx
ui/lib/types/schemas.ts

💤 Files with no reviewable changes (2)

docs/changelogs/v1.5.0-prerelease2.mdx
docs/enterprise/guardrails.mdx

coderabbitai · 2026-04-15T13:56:02Z

+	t.Run("RemoteURLImageGracefullyDropped", func(t *testing.T) {
+		input := []schemas.ResponsesMessage{
+			{
+				Type: schemas.Ptr(schemas.ResponsesMessageTypeFunctionCallOutput),
+				ResponsesToolMessage: &schemas.ResponsesToolMessage{
+					CallID: schemas.Ptr("tooluse_remote_003"),
+					Output: &schemas.ResponsesToolMessageOutputStruct{
+						ResponsesFunctionToolCallOutputBlocks: []schemas.ResponsesMessageContentBlock{
+							{
+								Type: schemas.ResponsesInputMessageContentBlockTypeImage,
+								ResponsesInputMessageContentBlockImage: &schemas.ResponsesInputMessageContentBlockImage{
+									ImageURL: schemas.Ptr("https://example.com/screenshot.png"),
+								},
+							},
+						},
+					},
+				},
+			},
+		}
+
+		messages, _, err := bedrock.ConvertBifrostMessagesToBedrockMessages(input)
+		require.NoError(t, err)
+		require.Len(t, messages, 1)
+
+		toolResult := messages[0].Content[0].ToolResult
+		require.NotNil(t, toolResult)
+		assert.Empty(t, toolResult.Content, "remote URL image should be dropped (Bedrock only supports base64)")


⚠️ Potential issue | 🟠 Major

Match this subtest to the current converter contract.

ConvertBifrostMessagesToBedrockMessages still propagates failed to convert image in tool result when convertImageToBedrockSource sees a remote URL, so this require.NoError + empty-content expectation will fail unless the production branch in core/providers/bedrock/utils.go:561-576 is changed to skip invalid tool-result images instead of returning.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@core/providers/bedrock/bedrock_test.go` around lines 3754 - 3780, The test expects ConvertBifrostMessagesToBedrockMessages to silently drop remote URL images and not return an error, but convertImageToBedrockSource currently returns an error for remote URLs which propagates out; change the logic in convertImageToBedrockSource so that when the image source is a remote URL (http/https) it does not return an error but instead indicates "skip" (e.g., return nil image/source and no error), and ensure ConvertBifrostMessagesToBedrockMessages treats a nil/skip result from convertImageToBedrockSource as non-fatal and continues (leaving tool result Content empty) rather than returning the conversion error.

coderabbitai · 2026-04-15T13:56:02Z

 newConfig := &modelcatalog.Config{
    PricingSyncInterval: 12 * time.Hour,
 }
-err := modelCatalog.ReloadPricing(ctx, newConfig)
+err := modelCatalog.UpdateSyncConfig(ctx, newConfig)


⚠️ Potential issue | 🟡 Minor

Make this example self-contained.

ctx is undefined in the snippet, so readers can't copy/paste it as-is. Use context.Background() inline or define ctx immediately above the call.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@docs/architecture/framework/model-catalog.mdx` around lines 321 - 324, The snippet uses an undefined ctx when calling modelCatalog.UpdateSyncConfig; make the example self-contained by creating a context before use (e.g., define ctx := context.Background() immediately above the newConfig or call modelCatalog.UpdateSyncConfig(context.Background(), newConfig)), and keep the rest of the example (newConfig := &modelcatalog.Config{PricingSyncInterval: 12 * time.Hour}) unchanged so readers can copy/paste directly.

coderabbitai · 2026-04-15T13:56:02Z

+
+</Update>
+<Update label="litellmcompat" description="0.1.2">
+- fix: litellm compat


⚠️ Potential issue | 🟡 Minor

Normalize product naming for consistency.

Line 102 uses litellm while other entries use LiteLLM; use one canonical form across the changelog.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@docs/changelogs/v1.5.0-prerelease3.mdx` at line 102, The changelog entry uses inconsistent casing: replace the lowercase token "litellm" on line 102 with the canonical product name "LiteLLM" so the entry matches other occurrences; ensure any other instances in this file follow the same "LiteLLM" capitalization for consistency.

coderabbitai · 2026-04-15T13:56:03Z

+## Step 7: Create API token for bulk user and team sync
+
+To create an API token, navigate to **Security** → **API** → **Tokens**.
+
+<Frame>
+<img src="/media/user-provisioning/okta-tokens-screen.png" alt="Okta API tokens screen" />
+</Frame>
+
+1. Click on "Create token"
+
+<Frame>
+  <img src="/media/user-provisioning/okta-create-token-form.png" alt="Create token dialog in Okta" />
+</Frame>
+
+2. Copy token to be used in the next step.


⚠️ Potential issue | 🟠 Major

🧩 Analysis chain

🏁 Script executed:

#!/bin/bash set -euo pipefail # Verify whether Okta API token is actually supported in this PR stack (UI/backend/docs). # Expected: # - If supported: references to an Okta API token field/config should exist in code/schemas/docs. # - If not supported: Step 7 should not claim it is used in Step 8. echo "== Search for Okta provisioning config fields ==" rg -n -C2 --iglob '*.{ts,tsx,js,jsx,go,json,yaml,yml,md,mdx}' '\bokta\b' echo "== Search for API token references tied to Okta/user provisioning ==" rg -n -C2 --iglob '*.{ts,tsx,js,jsx,go,json,yaml,yml,md,mdx}' 'okta.{0,40}(api.?token|token)|(api.?token|token).{0,40}okta|bulk user|team sync|user provisioning' echo "== Search for known config fields currently documented ==" rg -n -C2 --iglob '*.{ts,tsx,js,jsx,go,json,yaml,yml,md,mdx}' 'issuerUrl|clientId|clientSecret|audience|rolesField|teamIdsField'

Repository: maximhq/bifrost

Length of output: 44159

Step 7 creates an API token that Step 8's configuration never requests.

The documentation instructs users in Step 7 to create an Okta API token "for bulk user and team sync" and copy it for use "in the next step," but the Step 8 configuration table lists no API token field. The actual Okta JWT configuration schema (issuerUrl, clientId, clientSecret, audience, rolesField, teamIdsField, userIdField) contains no apiToken field.

Either remove Step 7 if the API token is not required for Okta setup, or update the Step 8 configuration table to include the token field and clarify where it should be used.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@docs/enterprise/setting-up-okta.mdx` around lines 214 - 228, Step 7 instructs creating an Okta API token but Step 8's Okta JWT configuration (issuerUrl, clientId, clientSecret, audience, rolesField, teamIdsField, userIdField) does not reference an apiToken; either remove Step 7 entirely or update Step 8's configuration table to add an apiToken field and explain where/how to supply it (e.g., include apiToken in the configuration schema and show it used for bulk user/team sync), and ensure the Step 7 text and screenshots match the chosen approach.

coderabbitai · 2026-04-15T13:56:03Z

+	if providerConfig != nil {
+		isCustomProvider = providerConfig.CustomProviderConfig != nil
+		hasListModelsEndpointDisabled = !providerConfig.CustomProviderConfig.IsOperationAllowed(schemas.ListModelsRequest)
+	}


⚠️ Potential issue | 🔴 Critical

Nil dereference when providerConfig.CustomProviderConfig is absent.

When providerConfig is non-nil but CustomProviderConfig is nil, Line 678 panics on IsOperationAllowed(...).

🐛 Proposed fix

func (mc *ModelCatalog) IsModelAllowedForProvider(provider schemas.ModelProvider, model string, providerConfig *configstore.ProviderConfig, allowedModels []string) bool { isCustomProvider := false hasListModelsEndpointDisabled := false - if providerConfig != nil { - isCustomProvider = providerConfig.CustomProviderConfig != nil - hasListModelsEndpointDisabled = !providerConfig.CustomProviderConfig.IsOperationAllowed(schemas.ListModelsRequest) + if providerConfig != nil && providerConfig.CustomProviderConfig != nil { + isCustomProvider = true + hasListModelsEndpointDisabled = !providerConfig.CustomProviderConfig.IsOperationAllowed(schemas.ListModelsRequest) }

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

if providerConfig != nil {

isCustomProvider = providerConfig.CustomProviderConfig != nil

hasListModelsEndpointDisabled = !providerConfig.CustomProviderConfig.IsOperationAllowed(schemas.ListModelsRequest)

}

if providerConfig != nil && providerConfig.CustomProviderConfig != nil {

isCustomProvider = true

hasListModelsEndpointDisabled = !providerConfig.CustomProviderConfig.IsOperationAllowed(schemas.ListModelsRequest)

}

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@framework/modelcatalog/main.go` around lines 676 - 679, The code dereferences providerConfig.CustomProviderConfig without a nil check causing a panic; update the logic around providerConfig to first test providerConfig.CustomProviderConfig != nil and only then set isCustomProvider = true and compute hasListModelsEndpointDisabled = !providerConfig.CustomProviderConfig.IsOperationAllowed(schemas.ListModelsRequest); otherwise set isCustomProvider = false and leave hasListModelsEndpointDisabled as false (or its safe default) so IsOperationAllowed is never called on a nil CustomProviderConfig.

coderabbitai · 2026-04-15T13:56:03Z

+  - apiVersion: v2
+    appVersion: 1.4.11
+    created: "2026-04-13T12:00:00.000000+00:00"
+    description: A Helm chart for deploying Bifrost - AI Gateway with unified interface for multiple providers
+    digest: ""
+    home: https://www.getmaxim.ai/bifrost
+    icon: https://www.getmaxim.ai/bifrost/bifrost-logo-only.png
+    keywords:
+    - ai
+    - gateway
+    - llm
+    maintainers:
+    - email: akshay@getmaxim.ai
+      name: Bifrost Team
+    name: bifrost
+    sources:
+    - https://github.com/maximhq/bifrost
+    type: application
+    urls:
+    - https://maximhq.github.io/bifrost/helm-charts/bifrost-2.0.18.tgz
+    version: 2.0.18


⚠️ Potential issue | 🟠 Major

🧩 Analysis chain

🏁 Script executed:

#!/bin/bash set -euo pipefail python - <<'PY' import re from datetime import datetime from pathlib import Path p = Path("helm-charts/index.yaml") text = p.read_text() lines = text.splitlines() # Find v2.0.18 block and digest value version_line = next((i for i,l in enumerate(lines) if l.strip() == "version: 2.0.18"), None) if version_line is None: raise SystemExit("FAIL: version 2.0.18 entry not found") start = version_line while start >= 0 and not lines[start].lstrip().startswith("- apiVersion:"): start -= 1 block = "\n".join(lines[start:version_line+1]) m_digest = re.search(r'^\s*digest:\s*"(.*)"\s*$', block, re.M) digest = m_digest.group(1) if m_digest else None print("v2.0.18 digest:", repr(digest)) if not digest: print("FAIL: digest is empty for v2.0.18") else: print("OK: digest present for v2.0.18") # Optional sanity check: generated timestamp should be >= max created timestamp m_generated = re.search(r'^generated:\s*"([^"]+)"\s*$', text, re.M) if m_generated: generated = datetime.fromisoformat(m_generated.group(1)) created = [datetime.fromisoformat(ts) for ts in re.findall(r'^\s*created:\s*"([^"]+)"\s*$', text, re.M)] print("generated:", generated.isoformat()) print("max created:", max(created).isoformat()) if generated < max(created): print("WARN: generated is older than latest created timestamp") PY

Repository: maximhq/bifrost

Length of output: 242

🏁 Script executed:

git ls-files helm-charts/ | head -20

Repository: maximhq/bifrost

Length of output: 940

🏁 Script executed:

# Find files related to helm chart building/publishing rg -l "digest|helm.*build|helm.*publish|helm.*index" --max-count=10

Repository: maximhq/bifrost

Length of output: 177

🏁 Script executed:

# Check if there's a release workflow or automation fd -t f "release|build|workflow" helm-charts/ | head -20

Repository: maximhq/bifrost

Length of output: 41

🏁 Script executed:

# Look for any documentation about the release process rg -l "2.0.18|bifrost.*release|chart.*digest" -t md -t yaml | head -15

Repository: maximhq/bifrost

Length of output: 200

🏁 Script executed:

cat -n helm-charts/bifrost/Chart.yaml | head -30

Repository: maximhq/bifrost

Length of output: 651

🏁 Script executed:

# Look for CI/CD workflows fd -t f "workflow|github" | grep -E "\.(yml|yaml)$" | head -20

Repository: maximhq/bifrost

Length of output: 41

🏁 Script executed:

# Check for release scripts or automation fd -t f -E "node_modules" "release|publish" -t f | head -20

Repository: maximhq/bifrost

Length of output: 847

🏁 Script executed:

# Check git history to see if digest is usually populated git log --oneline helm-charts/index.yaml | head -10

Repository: maximhq/bifrost

Length of output: 112

🏁 Script executed:

find .github -type f -name "*.yml" -o -name "*.yaml" 2>/dev/null | head -20

Repository: maximhq/bifrost

Length of output: 738

🏁 Script executed:

# Look at the full index.yaml to see digest pattern in other entries cat -n helm-charts/index.yaml | grep -A 2 -B 2 "digest:"

Repository: maximhq/bifrost

Length of output: 8667

🏁 Script executed:

# Check if helm chart repo tools are mentioned in any config rg "helm.*repo|chartmuseum|artifacthub" -t yml -t yaml -t json | head -15

Repository: maximhq/bifrost

Length of output: 86

🏁 Script executed:

cat -n .github/workflows/helm-release.yml

Repository: maximhq/bifrost

Length of output: 4628

🏁 Script executed:

# Check what versions exist and correlate with digest pattern rg "version:|digest:" helm-charts/index.yaml | head -50

Repository: maximhq/bifrost

Length of output: 917

🏁 Script executed:

# Check git history of index.yaml to understand how it's typically modified git log --oneline --follow helm-charts/index.yaml | head -20

Repository: maximhq/bifrost

Length of output: 112

🏁 Script executed:

# Look for any documentation about release process rg -i "release.*process|helm.*release|index\.yaml" --max-count=15 -t md

Repository: maximhq/bifrost

Length of output: 41

Ensure digest metadata is populated for all recent chart index entries.

Empty digest: "" fields across v2.0.18 and other recent chart versions in index.yaml removes integrity verification metadata. The helm-release.yml workflow includes helm repo index which should populate these digests automatically when packaging releases.

Regenerate the index with populated digests—either by triggering the helm-release.yml workflow to rebuild the index through its normal automation path (which runs helm repo index . --merge), or by manually regenerating it in the index location.

Note: This empty digest pattern affects all recent versions in the file, not just 2.0.18, suggesting the index may have been edited directly instead of generated through the automation workflow.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@helm-charts/index.yaml` around lines 4 - 24, index.yaml entries (e.g., version 2.0.18 with urls entry bifrost-2.0.18.tgz) contain empty digest: "" values; regenerate the Helm index so Helm computes and populates those digests instead of editing them manually by either triggering the existing helm-release.yml workflow or running the same command locally (helm repo index . --merge) against the chart package directory, then commit the regenerated index.yaml so digest fields are filled for all recent chart versions.

coderabbitai · 2026-04-15T13:56:03Z

+		if p.modelCatalog != nil && p.inMemoryStore != nil {
+			provider := schemas.ModelProvider(config.Provider)
+			providerConfig := p.inMemoryStore.GetConfiguredProviders()[provider]
+			isProviderAllowed = p.modelCatalog.IsModelAllowedForProvider(provider, modelStr, &providerConfig, config.AllowedModels)


⚠️ Potential issue | 🟠 Major

Guard missing configured providers before calling the model catalog.

Line 581 indexes GetConfiguredProviders() without checking ok. If a virtual key references a provider that is not globally configured, this passes a zero-value configstore.ProviderConfig into IsModelAllowedForProvider(...), which can keep that provider eligible and let the balancer rewrite the request to an unusable provider/model target.

🛠️ Proposed fix

- if p.modelCatalog != nil && p.inMemoryStore != nil { - provider := schemas.ModelProvider(config.Provider) - providerConfig := p.inMemoryStore.GetConfiguredProviders()[provider] - isProviderAllowed = p.modelCatalog.IsModelAllowedForProvider(provider, modelStr, &providerConfig, config.AllowedModels) + if p.modelCatalog != nil && p.inMemoryStore != nil { + provider := schemas.ModelProvider(config.Provider) + providerConfig, exists := p.inMemoryStore.GetConfiguredProviders()[provider] + if !exists { + ctx.AppendRoutingEngineLog( + schemas.RoutingEngineGovernance, + fmt.Sprintf("Provider %s excluded: provider is not configured", config.Provider), + ) + continue + } + isProviderAllowed = p.modelCatalog.IsModelAllowedForProvider(provider, modelStr, &providerConfig, config.AllowedModels) } else {

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

if p.modelCatalog != nil && p.inMemoryStore != nil {

provider := schemas.ModelProvider(config.Provider)

providerConfig := p.inMemoryStore.GetConfiguredProviders()[provider]

isProviderAllowed = p.modelCatalog.IsModelAllowedForProvider(provider, modelStr, &providerConfig, config.AllowedModels)

if p.modelCatalog != nil && p.inMemoryStore != nil {

provider := schemas.ModelProvider(config.Provider)

providerConfig, exists := p.inMemoryStore.GetConfiguredProviders()[provider]

if !exists {

ctx.AppendRoutingEngineLog(

schemas.RoutingEngineGovernance,

fmt.Sprintf("Provider %s excluded: provider is not configured", config.Provider),

)

continue

}

isProviderAllowed = p.modelCatalog.IsModelAllowedForProvider(provider, modelStr, &providerConfig, config.AllowedModels)

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@plugins/governance/main.go` around lines 579 - 582, The code indexes p.inMemoryStore.GetConfiguredProviders() without checking whether the provider key exists, which may pass a zero-value ProviderConfig into p.modelCatalog.IsModelAllowedForProvider and incorrectly mark an unconfigured provider as allowed; change the flow in the block using p.modelCatalog and p.inMemoryStore so you first capture the map (e.g., cfgs := p.inMemoryStore.GetConfiguredProviders()), then look up providerConfig, ok := cfgs[provider], and only call p.modelCatalog.IsModelAllowedForProvider(provider, modelStr, &providerConfig, config.AllowedModels) when ok is true; if ok is false, set isProviderAllowed = false (or otherwise treat the provider as disallowed) so an absent configured provider cannot be considered eligible.

coderabbitai · 2026-04-15T13:56:04Z

+			if r.modelCatalog != nil && r.governanceInMemoryStore != nil {
+				providerConfig := r.governanceInMemoryStore.GetConfiguredProviders()[provider]
+				return r.modelCatalog.IsModelAllowedForProvider(provider, model, &providerConfig, pc.AllowedModels)
 			}


⚠️ Potential issue | 🟠 Major

Same map indexing issue as plugins/governance/utils.go — missing existence check.

The pattern r.governanceInMemoryStore.GetConfiguredProviders()[provider] returns a zero-initialized ProviderConfig{} for non-existent keys, not a distinguishable "not found" state. Passing &providerConfig will be a non-nil pointer to a zero struct, which may cause incorrect behavior or a panic in IsModelAllowedForProvider.

🐛 Proposed fix to check map key existence

if r.modelCatalog != nil && r.governanceInMemoryStore != nil { - providerConfig := r.governanceInMemoryStore.GetConfiguredProviders()[provider] - return r.modelCatalog.IsModelAllowedForProvider(provider, model, &providerConfig, pc.AllowedModels) + providers := r.governanceInMemoryStore.GetConfiguredProviders() + if providerConfig, exists := providers[provider]; exists { + return r.modelCatalog.IsModelAllowedForProvider(provider, model, &providerConfig, pc.AllowedModels) + } + // Provider not configured, fall through to fallback logic below }

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@plugins/governance/resolver.go` around lines 339 - 342, The current code indexes r.governanceInMemoryStore.GetConfiguredProviders()[provider] without checking existence, producing a zero-value ProviderConfig and a non-nil pointer even when the key is missing; update the block so you first retrieve the map and check presence (e.g. cfg, ok := r.governanceInMemoryStore.GetConfiguredProviders()[provider]) and only pass a pointer to cfg to r.modelCatalog.IsModelAllowedForProvider when ok is true, otherwise pass nil (or handle the "not configured" case consistently with IsModelAllowedForProvider) to avoid incorrect behavior or panics.

coderabbitai

Actionable comments posted: 7

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

helm-charts/bifrost/values.schema.json (1)
1423-1494: ⚠️ Potential issue | 🟡 Minor

Ensure migration documentation is visible to users upgrading from previous versions.

The breaking changes are confirmed and documented in the commit message:

Block renamed from saml to scim: Helm values must change from bifrost.saml to bifrost.scim; Terraform variables must change from saml_config to scim_config.

Okta config now requires apiToken (in addition to issuerUrl, clientId, clientSecret): Existing Okta integrations missing this field will fail validation.

Migration instructions are provided in the commit message, but consider adding them to a user-facing CHANGELOG or migration guide (similar to the v1.5.0 migration guide) to ensure users upgrading encounter clear instructions before validation fails.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@helm-charts/bifrost/values.schema.json` around lines 1423 - 1494, Add a
user-facing migration note documenting the breaking changes: state that the Helm
values block was renamed from bifrost.saml to bifrost.scim (and Terraform
variable saml_config -> scim_config) and that Okta config now requires apiToken
in addition to issuerUrl, clientId, clientSecret; place this guidance in the
project CHANGELOG or a dedicated migration guide (similar to the v1.5.0
migration guide) and include example before/after key names (bifrost.saml ->
bifrost.scim, saml_config -> scim_config) and a clear callout that missing
apiToken will fail schema validation.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@helm-charts/bifrost/templates/_helpers.tpl`:
- Around line 818-822: The Helm template makes connection_string come from a
secret placeholder (see the block that sets $cc "connection_string" to "env.%s")
but the downstream bifrost.validate logic still requires httpConfig.url and
doesn't accept connectionString/secret-backed values for SSE clients; update the
validation in bifrost.validate to treat a non-empty connection_string (including
the env.* placeholder pattern) as a valid endpoint alternative to httpConfig.url
and ensure the SSE client branch checks $cc.connection_string or
$client.connectionString (or the env.* form) instead of only httpConfig.url so
secretRef-backed clients pass Helm validation.
- Around line 578-635: The template silently omits S3 credential placeholders
when .Values.storage.logsStore.objectStorage.existingSecret is true but the
specific *Key fields (accessKeyIdKey, secretAccessKeyKey, sessionTokenKey) are
not set; update the helper that builds $osConfig (the block handling
$os.existingSecret) to always set access_key_id =>
"env.BIFROST_OBJECT_STORAGE_ACCESS_KEY_ID", secret_access_key =>
"env.BIFROST_OBJECT_STORAGE_SECRET_ACCESS_KEY", and session_token =>
"env.BIFROST_OBJECT_STORAGE_SESSION_TOKEN" when $os.existingSecret is true
(instead of only when accessKeyIdKey/secretAccessKeyKey/sessionTokenKey are
present), keeping role_arn => "env.BIFROST_OBJECT_STORAGE_ROLE_ARN" as-is so
logs_store.object_storage receives the expected env placeholders.

In `@helm-charts/bifrost/values.schema.json`:
- Around line 779-787: The enum for trace_type in values.schema.json has removed
the "otel" value (now only "genai_extension", "vercel", "open_inference");
update release notes for v1.5.0 to call this out as a breaking change, reference
migrationNormalizeOtelTraceType for DB-backed upgrades, and explicitly instruct
file-based config users to manually rename any "otel" values to
"genai_extension" when upgrading from earlier versions.

In `@helm-charts/bifrost/values.yaml`:
- Around line 334-340: The secretRef example currently shown under the stdio
client sample is misleading because secretRef only supplies connection_string;
move the secretRef block (the name/connectionStringKey example and its
explanation about injecting BIFROST_MCP_<NAME>_CONNECTION_STRING and rewriting
connection_string in config.json) out of the stdio example and place it under
the HTTP/SSE client sample (or a shared "connection-string-based clients"
section), or alternatively add a short note next to secretRef that it only
applies to connection-string-based clients and will not affect stdio.

In `@transports/config.schema.json`:
- Around line 1848-1855: The schema currently allows the object to be valid
without either mcp_client_id or mcp_client_name; update the JSON Schema to
require at least one of these fields (use anyOf or oneOf with two branches: one
requiring ["mcp_client_id"] and the other requiring ["mcp_client_name"]) so that
the object cannot validate unless an MCP client reference is provided; apply
this change where the mcp_client_id and mcp_client_name properties are defined
in transports/config.schema.json.
- Around line 3777-3819: The provider_pricing_override schema is too permissive:
make virtual_key_id, provider_id, and provider_key_id required conditionally
based on scope_kind and constrain request_types to the existing
pricing_override_request_type definition. Update the schema for the object that
defines scope_kind/virtual_key_id/provider_id/provider_key_id/request_types so
that: (1) when scope_kind is "virtual_key" or "virtual_key_provider" or
"virtual_key_provider_key" the virtual_key_id property is required; (2) when
scope_kind is "provider" or "provider_key" or "virtual_key_provider" or
"virtual_key_provider_key" the provider_id property is required; (3) when
scope_kind is "provider_key" or "virtual_key_provider_key" the provider_key_id
property is required; implement these with JSON Schema conditional subschemas
(if/then or anyOf) tied to scope_kind; and (4) change request_types.items to
reference or enumerate the existing pricing_override_request_type (use $ref to
the pricing_override_request_type definition or copy its enum) instead of
allowing arbitrary strings.
- Around line 2742-2748: The schema enum for transport "type" was extended with
"inprocess" but the oneOf discriminator later still lacks a matching branch,
causing validation failures; add a new oneOf branch for the "inprocess" client
that matches {"type":"inprocess"} (i.e., a schema object with properties.type
const/enum "inprocess" and required ["type"] and whatever specific properties
the in-process transport needs, or allow additionalProperties if none are
required) so the oneOf includes a case for the inprocess transport alongside the
existing stdio/http/sse branches.

---

Outside diff comments:
In `@helm-charts/bifrost/values.schema.json`:
- Around line 1423-1494: Add a user-facing migration note documenting the
breaking changes: state that the Helm values block was renamed from bifrost.saml
to bifrost.scim (and Terraform variable saml_config -> scim_config) and that
Okta config now requires apiToken in addition to issuerUrl, clientId,
clientSecret; place this guidance in the project CHANGELOG or a dedicated
migration guide (similar to the v1.5.0 migration guide) and include example
before/after key names (bifrost.saml -> bifrost.scim, saml_config ->
scim_config) and a clear callout that missing apiToken will fail schema
validation.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: df48c63e-5946-42f9-b62e-265432af117e

📥 Commits

Reviewing files that changed from the base of the PR and between 1efbfcd and f2005ef.

📒 Files selected for processing (11)

helm-charts/bifrost/Chart.yaml
helm-charts/bifrost/README.md
helm-charts/bifrost/templates/_helpers.tpl
helm-charts/bifrost/templates/deployment.yaml
helm-charts/bifrost/templates/stateful.yaml
helm-charts/bifrost/values-examples/providers-and-virtual-keys.yaml
helm-charts/bifrost/values-examples/secrets-from-k8s.yaml
helm-charts/bifrost/values.schema.json
helm-charts/bifrost/values.yaml
helm-charts/index.yaml
transports/config.schema.json

✅ Files skipped from review due to trivial changes (1)

helm-charts/bifrost/README.md

🚧 Files skipped from review as they are similar to previous changes (2)

helm-charts/bifrost/Chart.yaml
helm-charts/index.yaml

coderabbitai · 2026-04-15T14:35:37Z

+{{- /* Object Storage for log payloads */ -}}
+{{- if and .Values.storage.logsStore.objectStorage .Values.storage.logsStore.objectStorage.enabled }}
+{{- $os := .Values.storage.logsStore.objectStorage }}
+{{- $osConfig := dict "type" $os.type "bucket" $os.bucket }}
+{{- if $os.prefix }}
+{{- $_ := set $osConfig "prefix" $os.prefix }}
+{{- end }}
+{{- if $os.compress }}
+{{- $_ := set $osConfig "compress" true }}
+{{- end }}
+{{- if eq $os.type "s3" }}
+{{- if $os.region }}
+{{- $_ := set $osConfig "region" $os.region }}
+{{- end }}
+{{- if $os.endpoint }}
+{{- $_ := set $osConfig "endpoint" $os.endpoint }}
+{{- end }}
+{{- if $os.existingSecret }}
+{{- if $os.accessKeyIdKey }}
+{{- $_ := set $osConfig "access_key_id" "env.BIFROST_OBJECT_STORAGE_ACCESS_KEY_ID" }}
+{{- end }}
+{{- if $os.secretAccessKeyKey }}
+{{- $_ := set $osConfig "secret_access_key" "env.BIFROST_OBJECT_STORAGE_SECRET_ACCESS_KEY" }}
+{{- end }}
+{{- if $os.sessionTokenKey }}
+{{- $_ := set $osConfig "session_token" "env.BIFROST_OBJECT_STORAGE_SESSION_TOKEN" }}
+{{- end }}
+{{- $_ := set $osConfig "role_arn" "env.BIFROST_OBJECT_STORAGE_ROLE_ARN" }}
+{{- else }}
+{{- if $os.accessKeyId }}
+{{- $_ := set $osConfig "access_key_id" $os.accessKeyId }}
+{{- end }}
+{{- if $os.secretAccessKey }}
+{{- $_ := set $osConfig "secret_access_key" $os.secretAccessKey }}
+{{- end }}
+{{- if $os.sessionToken }}
+{{- $_ := set $osConfig "session_token" $os.sessionToken }}
+{{- end }}
+{{- if $os.roleArn }}
+{{- $_ := set $osConfig "role_arn" $os.roleArn }}
+{{- end }}
+{{- end }}
+{{- if $os.forcePathStyle }}
+{{- $_ := set $osConfig "force_path_style" true }}
+{{- end }}
+{{- end }}
+{{- if eq $os.type "gcs" }}
+{{- if $os.projectId }}
+{{- $_ := set $osConfig "project_id" $os.projectId }}
+{{- end }}
+{{- if $os.existingSecret }}
+{{- $_ := set $osConfig "credentials_json" "env.BIFROST_OBJECT_STORAGE_CREDENTIALS_JSON" }}
+{{- else if $os.credentialsJson }}
+{{- $_ := set $osConfig "credentials_json" $os.credentialsJson }}
+{{- end }}
+{{- end }}
+{{- $_ := set (index $config "logs_store") "object_storage" $osConfig }}
+{{- end }}


⚠️ Potential issue | 🟠 Major

Secret-backed S3 creds only work when the *Key fields are explicitly set.

When existingSecret is used, deployment.yaml and stateful.yaml still inject BIFROST_OBJECT_STORAGE_ACCESS_KEY_ID / ...SECRET_ACCESS_KEY / ...SESSION_TOKEN using default secret key names. This helper, though, only writes those env placeholders into logs_store.object_storage when accessKeyIdKey, secretAccessKeyKey, or sessionTokenKey are present in values. With just existingSecret, the rendered config silently omits the credentials.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@helm-charts/bifrost/templates/_helpers.tpl` around lines 578 - 635, The template silently omits S3 credential placeholders when .Values.storage.logsStore.objectStorage.existingSecret is true but the specific *Key fields (accessKeyIdKey, secretAccessKeyKey, sessionTokenKey) are not set; update the helper that builds $osConfig (the block handling $os.existingSecret) to always set access_key_id => "env.BIFROST_OBJECT_STORAGE_ACCESS_KEY_ID", secret_access_key => "env.BIFROST_OBJECT_STORAGE_SECRET_ACCESS_KEY", and session_token => "env.BIFROST_OBJECT_STORAGE_SESSION_TOKEN" when $os.existingSecret is true (instead of only when accessKeyIdKey/secretAccessKeyKey/sessionTokenKey are present), keeping role_arn => "env.BIFROST_OBJECT_STORAGE_ROLE_ARN" as-is so logs_store.object_storage receives the expected env placeholders.

coderabbitai · 2026-04-15T14:35:37Z

+{{- /* Override connection_string with env var placeholder when secretRef is set */ -}}
+{{- if and $client.secretRef $client.secretRef.name }}
+{{- $envName := printf "BIFROST_MCP_%s_CONNECTION_STRING" (regexReplaceAll "[^A-Z0-9]+" (upper $client.name) "_") }}
+{{- $_ := set $cc "connection_string" (printf "env.%s" $envName) }}
+{{- end }}


⚠️ Potential issue | 🟠 Major

secretRef MCP clients still trip Helm validation.

Lines 818-822 make connection_string secret-backed, but bifrost.validate below still requires httpConfig.url for HTTP clients and never checks connectionString for SSE clients. HTTP clients that rely on secretRef/connectionString will fail chart rendering even though the rendered schema supports them.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@helm-charts/bifrost/templates/_helpers.tpl` around lines 818 - 822, The Helm template makes connection_string come from a secret placeholder (see the block that sets $cc "connection_string" to "env.%s") but the downstream bifrost.validate logic still requires httpConfig.url and doesn't accept connectionString/secret-backed values for SSE clients; update the validation in bifrost.validate to treat a non-empty connection_string (including the env.* placeholder pattern) as a valid endpoint alternative to httpConfig.url and ensure the SSE client branch checks $cc.connection_string or $client.connectionString (or the env.* form) instead of only httpConfig.url so secretRef-backed clients pass Helm validation.

coderabbitai · 2026-04-15T14:35:37Z

                    "trace_type": {
                      "type": "string",
                      "enum": [
-                        "otel"
+                        "genai_extension",
+                        "vercel",
+                        "open_inference"
                      ],
                      "description": "Type of trace to use for the OTEL collector"
                    },


⚠️ Potential issue | 🟡 Minor

Breaking change: trace_type enum value "otel" removed.

The trace_type enum now accepts "genai_extension", "vercel", and "open_inference" — the previous "otel" value has been removed. Database-backed configurations are handled by migration migrationNormalizeOtelTraceType, but file-based config users must manually rename "otel" to "genai_extension".

Ensure this breaking change is documented in release notes for v1.5.0 users upgrading from earlier versions. Based on learnings: this is an intentional change where file-based config users must manually rename the value.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@helm-charts/bifrost/values.schema.json` around lines 779 - 787, The enum for trace_type in values.schema.json has removed the "otel" value (now only "genai_extension", "vercel", "open_inference"); update release notes for v1.5.0 to call this out as a breaking change, reference migrationNormalizeOtelTraceType for DB-backed upgrades, and explicitly instruct file-based config users to manually rename any "otel" values to "genai_extension" when upgrading from earlier versions.

coderabbitai · 2026-04-15T14:35:38Z

+      #   # Optional: source connection_string from a Kubernetes secret.
+      #   # When set, chart injects BIFROST_MCP_<NAME>_CONNECTION_STRING
+      #   # into the pod and rewrites connection_string in config.json
+      #   # to `env.BIFROST_MCP_<NAME>_CONNECTION_STRING`.
+      #   secretRef:
+      #     name: ""                               # k8s secret name
+      #     connectionStringKey: "connection-string"  # key within the secret


⚠️ Potential issue | 🟡 Minor

Move the secretRef example off the stdio sample.

secretRef only feeds connection_string, so placing it under the stdio example suggests a combination that won't do anything useful. Put this example under HTTP/SSE or call out that it only applies to connection-string-based clients.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@helm-charts/bifrost/values.yaml` around lines 334 - 340, The secretRef example currently shown under the stdio client sample is misleading because secretRef only supplies connection_string; move the secretRef block (the name/connectionStringKey example and its explanation about injecting BIFROST_MCP_<NAME>_CONNECTION_STRING and rewriting connection_string in config.json) out of the stdio example and place it under the HTTP/SSE client sample (or a shared "connection-string-based clients" section), or alternatively add a short note next to secretRef that it only applies to connection-string-based clients and will not affect stdio.

coderabbitai · 2026-04-15T14:35:38Z

        "mcp_client_id": {
          "type": "integer",
-          "description": "Associated MCP client ID"
+          "description": "Associated MCP client ID (database format)"
+        },
+        "mcp_client_name": {
+          "type": "string",
+          "description": "MCP client name (config file format \u2014 resolved to mcp_client_id at startup)"
        },


⚠️ Potential issue | 🟠 Major

Require an MCP client reference here.

This object can currently validate with neither mcp_client_id nor mcp_client_name, even though startup has no way to resolve the target MCP server in that case.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@transports/config.schema.json` around lines 1848 - 1855, The schema currently allows the object to be valid without either mcp_client_id or mcp_client_name; update the JSON Schema to require at least one of these fields (use anyOf or oneOf with two branches: one requiring ["mcp_client_id"] and the other requiring ["mcp_client_name"]) so that the object cannot validate unless an MCP client reference is provided; apply this change where the mcp_client_id and mcp_client_name properties are defined in transports/config.schema.json.

coderabbitai · 2026-04-15T14:35:38Z

          "type": "string",
          "enum": [
            "stdio",
-            "websocket",
            "http",
-            "sse"
+            "sse",
+            "inprocess"
          ],


⚠️ Potential issue | 🔴 Critical

inprocess is impossible to validate right now.

Line 2747 adds inprocess, but the oneOf at Lines 2870-2920 still has no matching branch for it. Any inprocess MCP client will fail schema validation until it gets its own case.

Also applies to: 2870-2920

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@transports/config.schema.json` around lines 2742 - 2748, The schema enum for transport "type" was extended with "inprocess" but the oneOf discriminator later still lacks a matching branch, causing validation failures; add a new oneOf branch for the "inprocess" client that matches {"type":"inprocess"} (i.e., a schema object with properties.type const/enum "inprocess" and required ["type"] and whatever specific properties the in-process transport needs, or allow additionalProperties if none are required) so the oneOf includes a case for the inprocess transport alongside the existing stdio/http/sse branches.

coderabbitai · 2026-04-15T14:35:38Z

+        "scope_kind": {
+          "type": "string",
+          "description": "Scope level for this override",
+          "enum": [
+            "global",
+            "provider",
+            "provider_key",
+            "virtual_key",
+            "virtual_key_provider",
+            "virtual_key_provider_key"
+          ]
+        },
+        "virtual_key_id": {
+          "type": "string",
+          "description": "Virtual key ID (required for virtual_key* scopes)"
+        },
+        "provider_id": {
+          "type": "string",
+          "description": "Provider ID (required for provider* scopes)"
+        },
+        "provider_key_id": {
+          "type": "string",
+          "description": "Provider key ID (required for provider_key and virtual_key_provider_key scopes)"
+        },
+        "match_type": {
+          "type": "string",
+          "description": "How the pattern is matched against model names",
+          "enum": [
+            "exact",
+            "wildcard"
+          ]
+        },
+        "pattern": {
+          "type": "string",
+          "description": "Model name pattern to match (exact name or wildcard prefix ending with *)"
+        },
+        "request_types": {
+          "type": "array",
+          "description": "Request types this override applies to. At least one value is required.",
+          "minItems": 1,
+          "items": {
+            "type": "string"
+          }


⚠️ Potential issue | 🟠 Major

Tighten provider_pricing_override validation.

Two critical contracts are only documented today: non-global scope_kind values never require their corresponding virtual_key_id / provider_id / provider_key_id, and request_types accepts any string even though pricing_override_request_type is defined below. That lets unusable overrides pass schema validation.

Also applies to: 3830-3863

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@transports/config.schema.json` around lines 3777 - 3819, The provider_pricing_override schema is too permissive: make virtual_key_id, provider_id, and provider_key_id required conditionally based on scope_kind and constrain request_types to the existing pricing_override_request_type definition. Update the schema for the object that defines scope_kind/virtual_key_id/provider_id/provider_key_id/request_types so that: (1) when scope_kind is "virtual_key" or "virtual_key_provider" or "virtual_key_provider_key" the virtual_key_id property is required; (2) when scope_kind is "provider" or "provider_key" or "virtual_key_provider" or "virtual_key_provider_key" the provider_id property is required; (3) when scope_kind is "provider_key" or "virtual_key_provider_key" the provider_key_id property is required; implement these with JSON Schema conditional subschemas (if/then or anyOf) tied to scope_kind; and (4) change request_types.items to reference or enumerate the existing pricing_override_request_type (use $ref to the pricing_override_request_type definition or copy its enum) instead of allowing arbitrary strings.

## Summary Improves error handling and logging for streaming responses by ensuring error messages and codes are properly propagated, fixing chunk indexing for error responses, and enhancing log data retention during streaming failures. ## Changes - Enhanced OpenAI streaming error handling to extract error messages and codes from nested response structures - Fixed streaming accumulator to handle cases where only error information is available (no result object) - Improved error chunk indexing to prevent collision with successful response chunks by using unique trailing indices - Added raw response data to error chunks for better debugging visibility - Enhanced logging plugin to finalize and capture accumulated stream metadata before processing errors - Prevented overwriting of existing raw request/response data in error logs ## Type of change - [x] Bug fix - [ ] Feature - [ ] Refactor - [ ] Documentation - [ ] Chore/CI ## Affected areas - [x] Core (Go) - [ ] Transports (HTTP) - [x] Providers/Integrations - [x] Plugins - [ ] UI (Next.js) - [ ] Docs ## How to test Test streaming error scenarios to verify proper error propagation and logging: ```sh # Core/Transports go version go test ./... # Test streaming responses with errors # Verify error messages are properly extracted and logged # Check that chunk indices don't collide between success and error responses # Ensure raw response data is preserved in error cases ``` ## Screenshots/Recordings N/A ## Breaking changes - [ ] Yes - [x] No ## Related issues N/A ## Security considerations No security implications - this change improves error handling and logging without exposing additional sensitive data. ## Checklist - [ ] I read `docs/contributing/README.md` and followed the guidelines - [ ] I added/updated tests where appropriate - [ ] I updated documentation where needed - [ ] I verified builds succeed (Go and UI) - [ ] I verified the CI pipeline passes locally if applicable

coderabbitai

Actionable comments posted: 1

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@framework/streaming/responses.go`:
- Around line 907-917: The synthetic terminal-error chunk currently gets a new
highest ChunkIndex (via getOrCreateStreamAccumulator,
accumulator.MaxResponsesChunkIndex and accumulator.TerminalErrorChunkIndex),
causing processAccumulatedResponsesStreamingChunks to pick the error chunk
(which lacks TokenUsage, SemanticCacheDebug, and Cost); fix by copying the
last-known non-nil metadata from the accumulator's most recent real response
into the synthetic chunk before unlocking (e.g., set chunk.TokenUsage,
chunk.SemanticCacheDebug, chunk.Cost from the accumulator's last
metadata/responses) so the terminal-error chunk preserves final
usage/cache/cost, or alternatively modify
processAccumulatedResponsesStreamingChunks to prefer the highest-index chunk
with non-nil metadata instead of blindly taking the max-index chunk.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 51332d15-fea9-439c-a2ea-44c7de0ca1f2

📥 Commits

Reviewing files that changed from the base of the PR and between 132b0d4 and 254e41e.

📒 Files selected for processing (6)

core/providers/openai/openai.go
framework/streaming/accumulator.go
framework/streaming/responses.go
framework/streaming/types.go
plugins/logging/main.go
plugins/logging/operations.go

coderabbitai · 2026-04-15T14:51:37Z

+		// Assign a stable trailing index; reuse on duplicate plugin calls so dedup fires correctly.
+		accumulator := a.getOrCreateStreamAccumulator(requestID)
+		accumulator.mu.Lock()
+		if accumulator.TerminalErrorChunkIndex >= 0 {
+			chunk.ChunkIndex = accumulator.TerminalErrorChunkIndex
+		} else {
+			accumulator.MaxResponsesChunkIndex++
+			chunk.ChunkIndex = accumulator.MaxResponsesChunkIndex
+			accumulator.TerminalErrorChunkIndex = chunk.ChunkIndex
+		}
+		accumulator.mu.Unlock()


⚠️ Potential issue | 🟠 Major

Synthetic error chunks now hide the last real metadata chunk.

processAccumulatedResponsesStreamingChunks later reads usage/cache/cost exclusively from the highest ChunkIndex (Lines 845-857). By bumping MaxResponsesChunkIndex for this synthetic terminal-error chunk, finalization will prefer the error chunk—which has no TokenUsage, SemanticCacheDebug, or Cost—over the last real provider chunk. That means streamed error logs can still lose the pre-error metadata this path is trying to preserve. Please either copy the previous metadata onto the synthetic chunk or make finalization fall back to the latest non-nil metadata instead of only the max-index chunk.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@framework/streaming/responses.go` around lines 907 - 917, The synthetic terminal-error chunk currently gets a new highest ChunkIndex (via getOrCreateStreamAccumulator, accumulator.MaxResponsesChunkIndex and accumulator.TerminalErrorChunkIndex), causing processAccumulatedResponsesStreamingChunks to pick the error chunk (which lacks TokenUsage, SemanticCacheDebug, and Cost); fix by copying the last-known non-nil metadata from the accumulator's most recent real response into the synthetic chunk before unlocking (e.g., set chunk.TokenUsage, chunk.SemanticCacheDebug, chunk.Cost from the accumulator's last metadata/responses) so the terminal-error chunk preserves final usage/cache/cost, or alternatively modify processAccumulatedResponsesStreamingChunks to prefer the highest-index chunk with non-nil metadata instead of blindly taking the max-index chunk.

coderabbitai

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

.github/workflows/scripts/validate-helm-config-fields.sh (1)
873-918: ⚠️ Potential issue | 🟡 Minor

Assert the rendered SCIM fields, not just that the config object exists.

These new fixtures only prove that scim_config.config is present. They do not verify the provider-specific field mapping, so regressions in rendered keys like the new Okta API token or Entra tenant/app identifiers would slip through this coverage. Please add assert_field_value checks for a few concrete provider-required fields in both variants.

Based on learnings, .github/workflows/scripts/validate-helm-config-fields.sh is the script responsible for full field-value assertions on rendered config.json.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In @.github/workflows/scripts/validate-helm-config-fields.sh around lines 873 -
918, Add concrete assertions for provider-specific config keys in the SCIM
fixtures: after render_config and the existing assert_field/ provider checks,
call assert_field_value for Okta keys (e.g., assert_field_value
'scim_config.okta apiToken' '.scim_config.config.apiToken' '"okta-api-token"',
and similarly for '.scim_config.config.clientId' '"okta-client-id"' or
'.scim_config.config.issuerUrl' '"https://dev-123.okta.com/oauth2/default"') and
for Entra keys (e.g., assert_field_value 'scim_config.entra tenantId'
'.scim_config.config.tenantId' '"tenant-uuid"' and assert_field_value
'scim_config.entra appIdUri' '.scim_config.config.appIdUri'
'"api://entra-client-id"'); place these directly after the corresponding
render_config calls so the script validates the actual rendered provider config
values.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Outside diff comments:
In @.github/workflows/scripts/validate-helm-config-fields.sh:
- Around line 873-918: Add concrete assertions for provider-specific config keys
in the SCIM fixtures: after render_config and the existing assert_field/
provider checks, call assert_field_value for Okta keys (e.g., assert_field_value
'scim_config.okta apiToken' '.scim_config.config.apiToken' '"okta-api-token"',
and similarly for '.scim_config.config.clientId' '"okta-client-id"' or
'.scim_config.config.issuerUrl' '"https://dev-123.okta.com/oauth2/default"') and
for Entra keys (e.g., assert_field_value 'scim_config.entra tenantId'
'.scim_config.config.tenantId' '"tenant-uuid"' and assert_field_value
'scim_config.entra appIdUri' '.scim_config.config.appIdUri'
'"api://entra-client-id"'); place these directly after the corresponding
render_config calls so the script validates the actual rendered provider config
values.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: bd04c592-17ab-4d8c-a90a-54859b790e22

📥 Commits

Reviewing files that changed from the base of the PR and between 254e41e and 12e443b.

📒 Files selected for processing (1)

.github/workflows/scripts/validate-helm-config-fields.sh

## Summary Release Bifrost Helm chart version 2.1.1 with improved virtual key configuration flexibility. The `value` field for virtual keys is now optional, allowing the backend to auto-generate virtual key values when not explicitly provided. ## Changes - Made `bifrost.governance.virtualKeys[].value` field optional in Helm templates - Updated template logic to conditionally include the `value` field only when present - Removed validation that previously required the `value` field to be set - Updated documentation and examples to reflect the optional nature of the field - Bumped chart version from 2.1.0-prerelease2 to 2.1.1 ## Type of change - [x] Feature - [ ] Bug fix - [ ] Refactor - [ ] Documentation - [ ] Chore/CI ## Affected areas - [ ] Core (Go) - [ ] Transports (HTTP) - [ ] Providers/Integrations - [ ] Plugins - [ ] UI (Next.js) - [ ] Docs ## How to test Deploy the Helm chart with virtual keys configured both with and without the `value` field: ```sh # Test with explicit value helm install bifrost ./helm-charts/bifrost --set bifrost.governance.virtualKeys[0].id=vk-1 --set bifrost.governance.virtualKeys[0].name="Test Key" --set bifrost.governance.virtualKeys[0].value="vk-explicit-123" # Test with auto-generated value (omit value field) helm install bifrost ./helm-charts/bifrost --set bifrost.governance.virtualKeys[0].id=vk-2 --set bifrost.governance.virtualKeys[0].name="Auto Key" # Verify the rendered config.json includes/omits the value field appropriately helm template bifrost ./helm-charts/bifrost --values test-values.yaml ``` ## Screenshots/Recordings N/A ## Breaking changes - [ ] Yes - [x] No This change is backward compatible - existing configurations with explicit `value` fields will continue to work unchanged. ## Related issues N/A ## Security considerations Auto-generated virtual key values should be cryptographically secure and generated by the backend service. The Helm chart no longer enforces explicit key values, delegating this responsibility to the application. ## Checklist - [x] I read `docs/contributing/README.md` and followed the guidelines - [x] I added/updated tests where appropriate - [x] I updated documentation where needed - [x] I verified builds succeed (Go and UI) - [x] I verified the CI pipeline passes locally if applicable

coderabbitai

Actionable comments posted: 2

♻️ Duplicate comments (4)

helm-charts/index.yaml (1)
8-8: ⚠️ Potential issue | 🟠 Major

Populate chart digests for newly added index entries.

The new chart records are committed with digest: "", which drops artifact integrity metadata for these releases. Please regenerate helm-charts/index.yaml via the normal Helm index generation path (after packaging) so SHA256 digests are populated for each added version.

Also applies to: 29-29, 50-50
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@helm-charts/index.yaml` at line 8, The index.yaml entries were committed with
empty digest values ("digest: \"\""), so regenerate the Helm index after
packaging the charts to populate SHA256 digests: re-run helm package for the new
chart versions and then run helm repo index (or the project's index generation
script) against the chart directory to update index.yaml so each added version
has a non-empty digest field; confirm the "digest" keys for the new entries are
populated before committing.
helm-charts/bifrost/values.yaml (1)
334-340: ⚠️ Potential issue | 🟡 Minor

Move the secretRef example off the stdio sample.

secretRef only feeds connection_string, so placing it under a stdio client suggests a combination that does nothing. Put it under HTTP/SSE or add a note that it only applies to connection-string-based clients.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@helm-charts/bifrost/values.yaml` around lines 334 - 340, The example
`secretRef` block under the `stdio` sample is misleading because `secretRef`
only provides a `connection_string` (used to set
BIFROST_MCP_<NAME>_CONNECTION_STRING and rewrite connection_string in
config.json) and therefore has no effect for stdio clients; move the `secretRef`
example out of the `stdio` sample into the HTTP/SSE client section (or a shared
"connection-string-based clients" section) or add a one-line note next to
`secretRef` stating "Applies only to connection-string-based clients (e.g.,
HTTP/SSE), not stdio" so readers know it only applies to clients that use
connection_string.
helm-charts/bifrost/templates/_helpers.tpl (2)
767-770: ⚠️ Potential issue | 🟠 Major

The new connection_string paths still need matching validation support.

This helper now renders connection_string for SSE clients and for any client using secretRef, but bifrost.validate still only accepts httpConfig.url / websocketConfig.url. The result is the same broken edge cases as before: HTTP clients backed only by secretRef fail chart rendering, and connectionType: sse is not endpoint-validated at all.

Also applies to: 819-823
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@helm-charts/bifrost/templates/_helpers.tpl` around lines 767 - 770, Update
the bifrost.validate logic to also recognize and validate the new
connection_string key and SSE connectionType: when checking client endpoints,
accept either httpConfig.url, websocketConfig.url or connection_string (and
accept secretRef-backed clients by allowing secretRef as a valid source when no
direct url is present); additionally ensure that connectionType == "sse"
triggers the same endpoint validation as http (i.e., validate connection_string
for SSE) so SSE clients are endpoint-validated too; look for the
bifrost.validate helper and update its conditional checks to include
"$client.connection_string" and to treat "sse" like "http" for validation, and
ensure secretRef paths are considered valid when present.
596-606: ⚠️ Potential issue | 🟠 Major

Always write the S3 env placeholders when existingSecret is set.

With existingSecret, the pod env is injected using the default object-storage secret key names, but this helper only writes access_key_id, secret_access_key, and session_token into logs_store.object_storage when the optional *Key overrides are present. A plain existingSecret config therefore renders credential-less object storage.
Suggested fix
 {{- if $os.existingSecret }}
-{{- if $os.accessKeyIdKey }}
 {{- $_ := set $osConfig "access_key_id" "env.BIFROST_OBJECT_STORAGE_ACCESS_KEY_ID" }}
-{{- end }}
-{{- if $os.secretAccessKeyKey }}
 {{- $_ := set $osConfig "secret_access_key" "env.BIFROST_OBJECT_STORAGE_SECRET_ACCESS_KEY" }}
-{{- end }}
-{{- if $os.sessionTokenKey }}
 {{- $_ := set $osConfig "session_token" "env.BIFROST_OBJECT_STORAGE_SESSION_TOKEN" }}
-{{- end }}
 {{- $_ := set $osConfig "role_arn" "env.BIFROST_OBJECT_STORAGE_ROLE_ARN" }}
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@helm-charts/bifrost/templates/_helpers.tpl` around lines 596 - 606, The
helper currently only sets access_key_id/secret_access_key/session_token into
$osConfig when the optional
$os.accessKeyIdKey/$os.secretAccessKeyKey/$os.sessionTokenKey are present,
causing credential-less object storage when only $os.existingSecret is true;
change the block guarded by $os.existingSecret to always set "access_key_id",
"secret_access_key", and "session_token" in $osConfig to the env placeholders
(env.BIFROST_OBJECT_STORAGE_ACCESS_KEY_ID,
env.BIFROST_OBJECT_STORAGE_SECRET_ACCESS_KEY,
env.BIFROST_OBJECT_STORAGE_SESSION_TOKEN) while keeping the existing "role_arn"
set, and remove the inner conditional checks so that $osConfig is populated
whenever $os.existingSecret is true.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@helm-charts/bifrost/templates/_helpers.tpl`:
- Around line 230-245: Update the config struct in
framework/configstore/clientconfig.go (the ClientConfig type) to include the new
Helm-emitted fields so they aren’t dropped: add a Compat field (either a nested
struct or map) with JSON-tagged boolean fields convert_text_to_chat,
convert_chat_to_responses, should_drop_params, should_convert_params, plus add
McpDisableAutoToolInject bool `json:"mcp_disable_auto_tool_inject,omitempty"`
and RoutingChainMaxDepth int `json:"routing_chain_max_depth,omitempty"` to
ClientConfig; ensure the JSON tags exactly match the Helm keys (client.compat.*,
mcp_disable_auto_tool_inject, routing_chain_max_depth) so file-based/Helm
installs deserialize these values.

In `@helm-charts/bifrost/values.yaml`:
- Around line 166-172: The values.yaml docs describe using encryptionKeySecret
(name/key) to inject BIFROST_ENCRYPTION_KEY and render encryption_key:
"env.BIFROST_ENCRYPTION_KEY" into config.json, but the template in _helpers.tpl
only reads bifrost.encryptionKey; update the chart template logic to check
encryptionKeySecret.name (and key) as well as encryptionKey: when
encryptionKeySecret.name is set, ensure the rendered config.json contains
encryption_key: "env.BIFROST_ENCRYPTION_KEY" and that the pod spec mounts the
secret via envFrom/env or secretKeyRef (BIFROST_ENCRYPTION_KEY) so behavior
matches the documentation; modify the helper function(s) in _helpers.tpl that
produce encryption_key and the deployment/statefulset template that defines env
entries to prefer the secret ref when encryptionKeySecret is provided.

---

Duplicate comments:
In `@helm-charts/bifrost/templates/_helpers.tpl`:
- Around line 767-770: Update the bifrost.validate logic to also recognize and
validate the new connection_string key and SSE connectionType: when checking
client endpoints, accept either httpConfig.url, websocketConfig.url or
connection_string (and accept secretRef-backed clients by allowing secretRef as
a valid source when no direct url is present); additionally ensure that
connectionType == "sse" triggers the same endpoint validation as http (i.e.,
validate connection_string for SSE) so SSE clients are endpoint-validated too;
look for the bifrost.validate helper and update its conditional checks to
include "$client.connection_string" and to treat "sse" like "http" for
validation, and ensure secretRef paths are considered valid when present.
- Around line 596-606: The helper currently only sets
access_key_id/secret_access_key/session_token into $osConfig when the optional
$os.accessKeyIdKey/$os.secretAccessKeyKey/$os.sessionTokenKey are present,
causing credential-less object storage when only $os.existingSecret is true;
change the block guarded by $os.existingSecret to always set "access_key_id",
"secret_access_key", and "session_token" in $osConfig to the env placeholders
(env.BIFROST_OBJECT_STORAGE_ACCESS_KEY_ID,
env.BIFROST_OBJECT_STORAGE_SECRET_ACCESS_KEY,
env.BIFROST_OBJECT_STORAGE_SESSION_TOKEN) while keeping the existing "role_arn"
set, and remove the inner conditional checks so that $osConfig is populated
whenever $os.existingSecret is true.

In `@helm-charts/bifrost/values.yaml`:
- Around line 334-340: The example `secretRef` block under the `stdio` sample is
misleading because `secretRef` only provides a `connection_string` (used to set
BIFROST_MCP_<NAME>_CONNECTION_STRING and rewrite connection_string in
config.json) and therefore has no effect for stdio clients; move the `secretRef`
example out of the `stdio` sample into the HTTP/SSE client section (or a shared
"connection-string-based clients" section) or add a one-line note next to
`secretRef` stating "Applies only to connection-string-based clients (e.g.,
HTTP/SSE), not stdio" so readers know it only applies to clients that use
connection_string.

In `@helm-charts/index.yaml`:
- Line 8: The index.yaml entries were committed with empty digest values
("digest: \"\""), so regenerate the Helm index after packaging the charts to
populate SHA256 digests: re-run helm package for the new chart versions and then
run helm repo index (or the project's index generation script) against the chart
directory to update index.yaml so each added version has a non-empty digest
field; confirm the "digest" keys for the new entries are populated before
committing.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 03b0d88d-c43d-4c6d-9baa-546b7abd4880

📥 Commits

Reviewing files that changed from the base of the PR and between 12e443b and e7ef535.

📒 Files selected for processing (5)

helm-charts/bifrost/Chart.yaml
helm-charts/bifrost/README.md
helm-charts/bifrost/templates/_helpers.tpl
helm-charts/bifrost/values.yaml
helm-charts/index.yaml

🚧 Files skipped from review as they are similar to previous changes (2)

helm-charts/bifrost/Chart.yaml
helm-charts/bifrost/README.md

coderabbitai · 2026-04-15T16:29:55Z

+{{- if .Values.bifrost.client.compat }}
+{{- $compat := dict }}
+{{- if hasKey .Values.bifrost.client.compat "convertTextToChat" }}
+{{- $_ := set $compat "convert_text_to_chat" .Values.bifrost.client.compat.convertTextToChat }}
+{{- end }}
+{{- if hasKey .Values.bifrost.client.compat "convertChatToResponses" }}
+{{- $_ := set $compat "convert_chat_to_responses" .Values.bifrost.client.compat.convertChatToResponses }}
+{{- end }}
+{{- if hasKey .Values.bifrost.client.compat "shouldDropParams" }}
+{{- $_ := set $compat "should_drop_params" .Values.bifrost.client.compat.shouldDropParams }}
+{{- end }}
+{{- if hasKey .Values.bifrost.client.compat "shouldConvertParams" }}
+{{- $_ := set $compat "should_convert_params" .Values.bifrost.client.compat.shouldConvertParams }}
+{{- end }}
+{{- $_ := set $client "compat" $compat }}
 {{- end }}


⚠️ Potential issue | 🟠 Major

These new client keys are still dropped on the v1.5.0 config path.

The helper now emits client.compat, client.mcp_disable_auto_tool_inject, and client.routing_chain_max_depth, but the provided framework/configstore/clientconfig.go:38-65 on this branch still only exposes enable_litellm_fallbacks and does not define these JSON fields. Helm/file-based installs on this backmerge will silently ignore the new settings.

Also applies to: 300-305

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@helm-charts/bifrost/templates/_helpers.tpl` around lines 230 - 245, Update the config struct in framework/configstore/clientconfig.go (the ClientConfig type) to include the new Helm-emitted fields so they aren’t dropped: add a Compat field (either a nested struct or map) with JSON-tagged boolean fields convert_text_to_chat, convert_chat_to_responses, should_drop_params, should_convert_params, plus add McpDisableAutoToolInject bool `json:"mcp_disable_auto_tool_inject,omitempty"` and RoutingChainMaxDepth int `json:"routing_chain_max_depth,omitempty"` to ClientConfig; ensure the JSON tags exactly match the Helm keys (client.compat.*, mcp_disable_auto_tool_inject, routing_chain_max_depth) so file-based/Helm installs deserialize these values.

coderabbitai · 2026-04-15T16:29:55Z

+  # Use an existing Kubernetes secret for the encryption key.
+  # When `name` is set, takes precedence over `encryptionKey`: the chart
+  # injects BIFROST_ENCRYPTION_KEY into the pod via secretKeyRef and writes
+  # `encryption_key: "env.BIFROST_ENCRYPTION_KEY"` in the rendered config.json.
+  encryptionKeySecret:
+    name: ""
+    key: "encryption-key"


⚠️ Potential issue | 🟡 Minor

encryptionKeySecret is documented, but this chart path still doesn't render it into config.json.

These comments promise encryption_key: "env.BIFROST_ENCRYPTION_KEY", but _helpers.tpl still only sets encryption_key from bifrost.encryptionKey. Users who set only encryptionKeySecret.name will not get the rendered-config behavior described here.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@helm-charts/bifrost/values.yaml` around lines 166 - 172, The values.yaml docs describe using encryptionKeySecret (name/key) to inject BIFROST_ENCRYPTION_KEY and render encryption_key: "env.BIFROST_ENCRYPTION_KEY" into config.json, but the template in _helpers.tpl only reads bifrost.encryptionKey; update the chart template logic to check encryptionKeySecret.name (and key) as well as encryptionKey: when encryptionKeySecret.name is set, ensure the rendered config.json contains encryption_key: "env.BIFROST_ENCRYPTION_KEY" and that the pod spec mounts the secret via envFrom/env or secretKeyRef (BIFROST_ENCRYPTION_KEY) so behavior matches the documentation; modify the helper function(s) in _helpers.tpl that produce encryption_key and the deployment/statefulset template that defines env entries to prefer the secret ref when encryptionKeySecret is provided.

## Summary This PR updates the v1.5.0-prerelease3 changelog to focus on the most significant user-facing changes while removing detailed technical implementation notes. It also includes documentation updates for contributing guidelines, user provisioning setup guides, and various feature enhancements. ## Changes - **Changelog consolidation**: Streamlined v1.5.0-prerelease3 changelog to highlight key features like OAuth MCP hints, Azure passthrough, 272k token tier pricing, and critical bug fixes - **User provisioning documentation**: Added comprehensive setup guides for Google Workspace, Keycloak, and Zitadel identity providers with step-by-step configuration instructions - **Enhanced attribute mapping**: Updated Okta and Entra setup guides with improved attribute-to-role mapping examples and custom attribute support - **Documentation improvements**: Updated contributing guidelines, code conventions, and repository setup instructions to reflect current tooling (Vite instead of Next.js) - **Configuration updates**: Restructured governance configuration examples to use separate budget and rate limit arrays with ID references - **Provider enhancements**: Added raw request/response storage options and improved LangChain integration examples ## Type of change - [ ] Bug fix - [x] Feature - [ ] Refactor - [x] Documentation - [ ] Chore/CI ## Affected areas - [ ] Core (Go) - [ ] Transports (HTTP) - [ ] Providers/Integrations - [ ] Plugins - [ ] UI (Next.js) - [x] Docs ## How to test Validate documentation changes by reviewing the updated guides: ```sh # Verify documentation builds correctly cd docs npm install npm run build # Test specific provider setup guides # - Navigate to /enterprise/setting-up-google-workspace # - Navigate to /enterprise/setting-up-keycloak # - Navigate to /enterprise/setting-up-zitadel # - Verify all images and links work correctly # Validate changelog formatting # - Check /changelogs/v1.5.0-prerelease3 renders properly # - Ensure all feature descriptions are clear and concise ``` ## Screenshots/Recordings N/A - Documentation-only changes ## Breaking changes - [ ] Yes - [x] No ## Related issues Updates documentation to reflect current product capabilities and streamlines changelog for better user experience. ## Security considerations The user provisioning guides include proper security practices for OAuth client setup, service account permissions, and domain-wide delegation configuration across multiple identity providers. ## Checklist - [x] I read `docs/contributing/README.md` and followed the guidelines - [x] I added/updated tests where appropriate - [x] I updated documentation where needed - [x] I verified builds succeed (Go and UI) - [x] I verified the CI pipeline passes locally if applicable

## Summary The Go workspace setup script was not specifying a `go` directive or toolchain version, which caused `GOTOOLCHAIN=auto` to select a Go version lower than what `core@v1.4.19` requires. This adds an explicit `go 1.26.2` and `toolchain go1.26.2` directive to the workspace so the correct toolchain is used automatically. ## Changes - Added `go work edit -go=1.26.2 -toolchain=go1.26.2` to `setup-go-workspace.sh` so that `GOTOOLCHAIN=auto` selects Go >= 1.26.2, satisfying the minimum version required by the published `core@v1.4.19` module referenced in `transports/go.mod`. ## Type of change - [ ] Bug fix - [ ] Feature - [ ] Refactor - [ ] Documentation - [x] Chore/CI ## Affected areas - [ ] Core (Go) - [x] Transports (HTTP) - [ ] Providers/Integrations - [ ] Plugins - [ ] UI (Next.js) - [ ] Docs ## How to test ```sh # Verify the workspace is initialized with the correct Go version bash .github/workflows/scripts/setup-go-workspace.sh grep -E "^go |^toolchain" go.work # Expected output: # go 1.26.2 # toolchain go1.26.2 go test ./... ``` ## Breaking changes - [ ] Yes - [x] No ## Related issues ## Security considerations None. ## Checklist - [ ] I read `docs/contributing/README.md` and followed the guidelines - [ ] I added/updated tests where appropriate - [ ] I updated documentation where needed - [ ] I verified builds succeed (Go and UI) - [ ] I verified the CI pipeline passes locally if applicable

## Summary Temporarily disables the `test-docker-image-amd64` and `test-docker-image-arm64` CI jobs in the release pipeline by commenting them out. ## Changes - Both Docker image test jobs (`test-docker-image-amd64` and `test-docker-image-arm64`) have been commented out rather than removed, preserving the full job definitions for easy re-enablement later. ## Type of change - [ ] Bug fix - [ ] Feature - [ ] Refactor - [ ] Documentation - [x] Chore/CI ## Affected areas - [ ] Core (Go) - [ ] Transports (HTTP) - [ ] Providers/Integrations - [ ] Plugins - [ ] UI (Next.js) - [ ] Docs ## How to test No functional code changes. Verify the release pipeline runs without executing the Docker image test jobs. ## Breaking changes - [ ] Yes - [x] No ## Related issues N/A ## Security considerations No security implications. ## Checklist - [ ] I read `docs/contributing/README.md` and followed the guidelines - [ ] I added/updated tests where appropriate - [ ] I updated documentation where needed - [ ] I verified builds succeed (Go and UI) - [ ] I verified the CI pipeline passes locally if applicable

## Summary The `test-docker-image-amd64` and `test-docker-image-arm64` CI jobs have been removed from the release pipeline. These jobs were already commented out and non-functional, and all references to them as dependencies and gate conditions in downstream release jobs have been cleaned up. ## Changes - Deleted the commented-out `test-docker-image-amd64` and `test-docker-image-arm64` job definitions from the release pipeline. - Removed `test-docker-image-amd64` and `test-docker-image-arm64` from the `needs` arrays of `core-release`, `framework-release`, `plugins-release`, `bifrost-http-release`, the Docker build/push jobs, the manifest job, and the final notification job. - Removed the corresponding result checks for those two jobs from all `if` conditions in the affected release jobs. ## Type of change - [ ] Bug fix - [ ] Feature - [ ] Refactor - [ ] Documentation - [x] Chore/CI ## Affected areas - [ ] Core (Go) - [ ] Transports (HTTP) - [ ] Providers/Integrations - [ ] Plugins - [ ] UI (Next.js) - [ ] Docs ## How to test Trigger the release pipeline and confirm that all release jobs proceed without waiting on or referencing the removed Docker image test jobs. ## Breaking changes - [ ] Yes - [x] No ## Related issues ## Security considerations None. ## Checklist - [ ] I read `docs/contributing/README.md` and followed the guidelines - [ ] I added/updated tests where appropriate - [ ] I updated documentation where needed - [ ] I verified builds succeed (Go and UI) - [ ] I verified the CI pipeline passes locally if applicable

## Summary Bumps the Go version used across all release pipeline jobs from `1.26.1` to `1.26.2` to keep the CI environment on the latest patch release. ## Changes - Updated Go version from `1.26.1` to `1.26.2` in all `setup-go` steps within the release pipeline workflow. ## Type of change - [ ] Bug fix - [ ] Feature - [ ] Refactor - [ ] Documentation - [x] Chore/CI ## Affected areas - [ ] Core (Go) - [ ] Transports (HTTP) - [ ] Providers/Integrations - [ ] Plugins - [ ] UI (Next.js) - [ ] Docs ## How to test The release pipeline will use the updated Go version on the next run. No additional manual steps are required beyond verifying the CI pipeline passes. ```sh go version go test ./... ``` ## Screenshots/Recordings N/A ## Breaking changes - [ ] Yes - [x] No ## Related issues N/A ## Security considerations Patch releases often include security and bug fixes. Staying on the latest patch version reduces exposure to known vulnerabilities in the Go toolchain. ## Checklist - [x] I read `docs/contributing/README.md` and followed the guidelines - [ ] I added/updated tests where appropriate - [ ] I updated documentation where needed - [x] I verified builds succeed (Go and UI) - [x] I verified the CI pipeline passes locally if applicable

coderabbitai

Actionable comments posted: 10

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

core/providers/anthropic/chat.go (1)

412-468: ⚠️ Potential issue | 🟠 Major

Reconcile tool_choice against the filtered tool set.

ValidateChatToolsForProvider can remove a server tool here, but the later ToolChoice block still forwards the original pin/any contract from bifrostReq.Params.ToolChoice. That leaves requests where tool_choice references a tool that is no longer present in anthropicReq.Tools, which Anthropic will reject.

🔧 Suggested fix

-			filtered, _ := ValidateChatToolsForProvider(bifrostReq.Params.Tools, bifrostReq.Provider)
+			filtered, _ := ValidateChatToolsForProvider(bifrostReq.Params.Tools, bifrostReq.Provider)
+			survivingToolNames := make(map[string]struct{}, len(filtered))
 			tools := make([]AnthropicTool, 0, len(filtered))
 			for _, tool := range filtered {
 				if tool.Function != nil {
+					survivingToolNames[tool.Function.Name] = struct{}{}
 					tools = append(tools, convertFunctionToolToAnthropic(tool))
 					continue
 				}
 				// Non-function tool: attempt server-tool reconstruction.
 				if converted, ok := convertServerToolToAnthropic(tool); ok {
+					if tool.Name != "" {
+						survivingToolNames[tool.Name] = struct{}{}
+					}
 					tools = append(tools, converted)
 				}
 			}
@@
-			anthropicReq.ToolChoice = toolChoice
+			if toolChoice.Type == "tool" {
+				if _, ok := survivingToolNames[toolChoice.Name]; !ok {
+					toolChoice = nil
+				}
+			} else if toolChoice.Type == "any" && len(survivingToolNames) == 0 {
+				toolChoice = nil
+			}
+			if toolChoice != nil {
+				anthropicReq.ToolChoice = toolChoice
+			}

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@core/providers/anthropic/chat.go` around lines 412 - 468, The tool_choice
must be reconciled against the filtered anthropicReq.Tools list produced after
ValidateChatToolsForProvider to avoid referencing a removed tool; after you
build anthropicReq.Tools (see ValidateChatToolsForProvider,
convertFunctionToolToAnthropic, convertServerToolToAnthropic and
anthropicReq.Tools) validate any pin/name set on
bifrostReq.Params.ToolChoice.ChatToolChoiceStruct.Function.Name (and
equivalents) exists in anthropicReq.Tools (match on tool name); if the
referenced tool is missing, clear the name and set toolChoice.Type to "none" (or
fall back to "auto"/"any" only when the original choice was non-specific) so we
never send a tool_choice that references a tool not present in
anthropicReq.Tools.

♻️ Duplicate comments (9)

.github/workflows/release-pipeline.yml (3)
1034-1058: ⚠️ Potential issue | 🟠 Major

Add the Go checksum DB to both binary-build allowlists.

Both jobs set up Go and build binaries under block-mode egress, but neither allowlist includes sum.golang.org:443. Go validates module and toolchain downloads against the checksum database, and GOSUMDB defaults to sum.golang.org, so fresh runners or cache misses can fail before the build starts. (go.dev)
Suggested patch
             proxy.golang.org:443
+            sum.golang.org:443
             pypi.org:443
Apply the same addition in both allowlist blocks.
Also applies to: 1110-1127
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In @.github/workflows/release-pipeline.yml around lines 1034 - 1058, The Harden
Runner allowlist blocks for the binary-build jobs (the step using
step-security/harden-runner@fa2e9d6... with the "egress-policy: block" and
"allowed-endpoints" keys) are missing the Go checksum DB; add
"sum.golang.org:443" to the allowed-endpoints list in both places where that
Harden Runner step appears so Go module/toolchain checksum validation can reach
the default GOSUMDB during builds.
1241-1256: ⚠️ Potential issue | 🟠 Major

Allow the GitHub API in both Docker-build jobs.

These jobs run verify-bifrost-http-release.sh with GH_TOKEN, but the new allowlists only permit github.com:443. If that script still uses gh or direct GitHub API calls, block mode will break the verification step.

Use this to verify whether the script needs api.github.com:443 and whether the current allowlists already include it.
#!/bin/bash
set -euo pipefail

script=.github/workflows/scripts/verify-bifrost-http-release.sh

echo "== Verification script =="
nl -ba "$script" | sed -n '1,220p'

echo
echo "== GitHub-related calls in verification script =="
rg -n 'gh[[:space:]]+release[[:space:]]+view|gh[[:space:]]+api|api\.github\.com|curl.*github' "$script"

echo
echo "== Current Docker-build allowlists =="
rg -n -C2 'docker-build-amd64:|docker-build-arm64:|allowed-endpoints:|github\.com:443|api\.github\.com:443' .github/workflows/release-pipeline.yml
Also applies to: 1326-1340
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In @.github/workflows/release-pipeline.yml around lines 1241 - 1256, The Harden
Runner step's allowed-endpoints only includes github.com:443 but verification
uses GH_TOKEN and may call the GitHub API (verify-bifrost-http-release.sh), so
update the Harden Runner blocks used by the docker-build jobs (e.g., the steps
named "Harden Runner" inside the docker-build-amd64 and docker-build-arm64 jobs)
to also include api.github.com:443 in allowed-endpoints; run the provided shell
checks against .github/workflows/scripts/verify-bifrost-http-release.sh
(searching for gh api/gh release or api.github.com/curl github) to confirm if
api.github.com:443 is required before committing the change.
420-423: ⚠️ Potential issue | 🟠 Major

Verify these internal-service allowlists against the compose network.

These blocks now hard-code 172.38.0.2:5432 and 172.38.0.12:8301. That only works if the compose config pins those exact addresses and something actually listens on 8301; otherwise the new block-mode policy will make these jobs fail nondeterministically.

Use this to confirm whether Postgres is explicitly pinned to 172.38.0.2, whether any service exposes 8301, and whether the test scripts actually target those addresses.
#!/bin/bash
set -euo pipefail

echo "== Compose network pins =="
nl -ba .github/workflows/configs/docker-compose.yml | sed -n '1,260p' | rg -n -C2 'postgres:|weaviate:|ipv4_address|subnet|8301'

echo
echo "== Workflow allowlist references =="
rg -n '172\.38\.0\.2:5432|172\.38\.0\.12:8301' .github/workflows/release-pipeline.yml

echo
echo "== Script references =="
rg -n '172\.38\.0\.2|172\.38\.0\.12|8301|5432' \
  .github/workflows/scripts/test-bifrost-http.sh \
  .github/workflows/scripts/run-migration-tests.sh \
  .github/workflows/scripts/test-e2e-ui.sh
Also applies to: 495-496, 560-561
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In @.github/workflows/release-pipeline.yml around lines 420 - 423, The allowlist
hard-codes internal addresses (172.38.0.2:5432 and 172.38.0.12:8301) under the
allowed-endpoints key which can fail if the compose network doesn’t pin those
IPs or if no service listens on 8301; inspect the docker-compose service entries
(postgres, weaviate) for explicit ipv4_address/subnet or published ports and
update the workflow to either reference service hostnames/ports (compose DNS) or
dynamically derive addresses instead of hard-coding IPs, and also verify the
test scripts (test-bifrost-http.sh, run-migration-tests.sh, test-e2e-ui.sh)
target the same hostnames/ports or adapt them to not rely on fixed IPs so the
new block-mode policy won’t block legitimate traffic.
.github/workflows/release-cli.yml (1)
23-28: ⚠️ Potential issue | 🟠 Major

Harden the privileged jobs too.

This only moves check-version to block mode. release-cli still runs with GH_TOKEN and R2 credentials under egress-policy: audit, and push-mintlify-changelog still runs with GH_TOKEN under audit as well. The highest-privilege jobs should be switched to block with explicit allowlists, not just the lowest-privilege one.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In @.github/workflows/release-cli.yml around lines 23 - 28, Change the
harden-runner step in the higher-privilege jobs (release-cli and
push-mintlify-changelog, not just check-version) to use egress-policy: block and
provide explicit allowed-endpoints entries; locate the Harden Runner usage block
(uses: step-security/harden-runner@...) inside the release-cli and
push-mintlify-changelog job definitions and update their with: egress-policy to
block and mirror the allowed-endpoints allowlist (e.g., github.com:443 and any
R2 or other service hostnames required) so those jobs run under a strict
allowlist rather than audit mode.
core/bifrost.go (2)
3315-3334: ⚠️ Potential issue | 🟠 Major

Update rollback is still failure-destructive.

By the time these branches run, oldPq has already been drained and its workers are gone. Deleting newPq and removing the provider from bifrost.providers turns a bad config or slice-swap failure into an outage instead of restoring the previous provider.

Also applies to: 3346-3360

6140-6179: ⚠️ Potential issue | 🟠 Major

The final drain still has an orphaned-send race.

This helper exits on the first default, but producers can still win pq.queue <- msg afterward because senders are never quiesced first. Those requests have no worker and no second sweep, so the caller blocks until its own context is cancelled.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@core/bifrost.go` around lines 6140 - 6179, drainQueueWithErrors can still
miss messages because producers may win pq.queue <- msg after workers exit; fix
by quiescing producers before the final sweep: add or use an existing
ProviderQueue closing flag/operation (e.g. call pq.signalClosing() or set
pq.isClosing=true) immediately after workers finish and before calling
drainQueueWithErrors, and update the send path (tryRequest / any method that
does pq.queue <- msg) to check pq.isClosing() and return the shutdown error
instead of sending when true; then drainQueueWithErrors can safely loop until
pq.queue is empty without races.
core/providers/anthropic/responses.go (2)
94-100: ⚠️ Potential issue | 🟠 Major

Add a nil guard before dereferencing ctx in stream-state accessor.

Line 95 and Line 99 dereference ctx unconditionally. If ctx is nil on any call path, this panics.
#!/bin/bash
# Verify whether nil context can reach ToAnthropicResponsesStreamResponse/getOrCreateAnthropicToResponsesStreamState.

# 1) Locate call sites of ToAnthropicResponsesStreamResponse
rg -nP --type=go -C2 '\bToAnthropicResponsesStreamResponse\s*\('

# 2) Check for explicit nil context at call sites
rg -nP --type=go '\bToAnthropicResponsesStreamResponse\s*\(\s*nil\s*,'
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@core/providers/anthropic/responses.go` around lines 94 - 100, The accessor
getOrCreateAnthropicToResponsesStreamState currently dereferences ctx without
checking for nil; add a nil guard at the top of the function (i.e. if ctx ==
nil) and return a newly allocated &anthropicToResponsesStreamState{} immediately
so callers don't panic, otherwise continue with the existing logic using
ctx.Value and ctx.SetValue; reference getOrCreateAnthropicToResponsesStreamState
and anthropicToResponsesStreamStateKey when locating the change.
1798-1799: ⚠️ Potential issue | 🟠 Major

Avoid mutating the input response inside the converter.

Line 1799 rewrites bifrostResp.Item.ResponsesToolMessage.Arguments, which makes this conversion path stateful on caller-owned data. Keep sanitized JSON local and use it only for synthetic delta generation.
Suggested fix
-			argumentsJSON := sanitizeWebSearchArguments(*bifrostResp.Item.ResponsesToolMessage.Arguments)
-			bifrostResp.Item.ResponsesToolMessage.Arguments = &argumentsJSON
+			argumentsJSON := sanitizeWebSearchArguments(*bifrostResp.Item.ResponsesToolMessage.Arguments)
As per coding guidelines: "Implement converter functions as pure transformation functions with no HTTP calls, logging, or side effects."
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@core/providers/anthropic/responses.go` around lines 1798 - 1799, The
converter is mutating caller-owned state by overwriting
bifrostResp.Item.ResponsesToolMessage.Arguments; instead, call
sanitizeWebSearchArguments(*bifrostResp.Item.ResponsesToolMessage.Arguments)
into a local variable (e.g., sanitizedArgs) and use that local value when
building synthetic deltas or downstream strings without assigning it back to
bifrostResp.Item.ResponsesToolMessage.Arguments; ensure the original pointer
remains unchanged so the converter stays a pure transformation (refer to
sanitizeWebSearchArguments and ResponsesToolMessage.Arguments).
core/providers/bedrock/utils.go (1)
1076-1083: ⚠️ Potential issue | 🟠 Major

Use the full ResponsesTextConfigFormatJSONSchema here, not only .Schema.

This still strips sibling fields carried on format.JSONSchema before generating the Bedrock tool or Anthropic output_config.format, so structured-output metadata can be lost during conversion.
🔧 Suggested fix
-	if format.JSONSchema == nil || format.JSONSchema.Schema == nil {
+	if format.JSONSchema == nil {
 		return nil, nil // Schema is required for structured output
 	}
@@
-	schemaObj := *format.JSONSchema.Schema
+	schemaObj := *format.JSONSchema
Based on learnings, convertTextFormatToTool should use *format.JSONSchema as the schema object rather than dereferencing the nested Schema field.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@core/providers/bedrock/utils.go` around lines 1076 - 1083, The code currently
extracts only the nested Schema field (schemaObj := *format.JSONSchema.Schema)
which drops sibling metadata on ResponsesTextConfigFormatJSONSchema; in
convertTextFormatToTool use the full *format.JSONSchema object as the schema
payload instead of dereferencing its Schema field — update the nil checks to
ensure format.JSONSchema is non-nil, set schemaObj (or similarly named local) to
*format.JSONSchema (not *format.JSONSchema.Schema), and keep description
handling via format.JSONSchema.Description so the full structured-output
metadata is preserved when generating Bedrock/Anthropic output_config.format.

🧹 Nitpick comments (10)

core/schemas/images.go (1)

66-71: Keep the method docs aligned with behavior.

BackfillParams now also backfills Model; adding that bullet to the comment would prevent drift.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@core/schemas/images.go` around lines 66 - 71, The documentation for
BackfillParams on BifrostImageGenerationResponse is out of sync: the method now
also backfills the Model field but the comment bullets don't mention it; update
the comment above func (r *BifrostImageGenerationResponse) BackfillParams(req
*BifrostRequest) to include a bullet noting that Model is backfilled (e.g., "-
Model on ImageGenerationResponseParameters (from request if not in response)"),
keeping the existing bullets for NumInputImages, Size, and Quality.

core/schemas/videos.go (1)

148-151: Update BackfillParams comment to include model backfill.

The implementation now backfills both Seconds and Model, but the method comment documents only Seconds.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@core/schemas/videos.go` around lines 148 - 151, The comment for
BackfillParams is out of date: the method now backfills both Seconds and Model
but the doc only mentions Seconds. Update the comment on the BackfillParams
method (on BifrostVideoGenerationResponse.BackfillParams) to list both fields
that can be backfilled (Seconds and Model), describe their sources (from
BifrostRequest params or defaults), and note that Model is populated from
req.Model when missing; keep the comment concise and consistent with the
implementation.

core/internal/llmtests/provider_feature_support_test.go (1)

667-667: Prefer pointer helper for bool literals in new test cases.

For consistency with repo conventions, use schemas.Ptr(true/false) instead of local bool variables plus address-taking in these newly added setup blocks.

♻️ Suggested refactor pattern

-			setupReq: func() *anthropic.AnthropicMessageRequest {
-				eager := true
-				return &anthropic.AnthropicMessageRequest{
-					Tools: []anthropic.AnthropicTool{{Name: "t1", EagerInputStreaming: &eager}},
-				}
-			},
+			setupReq: func() *anthropic.AnthropicMessageRequest {
+				return &anthropic.AnthropicMessageRequest{
+					Tools: []anthropic.AnthropicTool{{Name: "t1", EagerInputStreaming: schemas.Ptr(true)}},
+				}
+			},

Based on learnings: "In the maximhq/bifrost repository, prefer using bifrost.Ptr() to create pointers instead of the address operator (&) ... including test utilities."

Also applies to: 678-678, 689-689, 700-700, 711-711

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@core/internal/llmtests/provider_feature_support_test.go` at line 667, Replace
local bool variables and taking their addresses (e.g., eager := true; &eager) in
the new test setup blocks with the repo helper schemas.Ptr(true/false);
specifically update places that create pointer booleans for fields used in these
tests (the local variable named eager and the analogous bool locals at the other
new setup sites) to call schemas.Ptr(true) or schemas.Ptr(false) directly so the
test uses the standard pointer helper instead of address-of locals.

core/internal/llmtests/eager_input_streaming.go (2)

42-43: Use bifrost.Ptr(true) for pointer construction consistency.

♻️ Suggested change

-		eager := true
-		chatTool.EagerInputStreaming = &eager
+		chatTool.EagerInputStreaming = bifrost.Ptr(true)

Based on learnings: "prefer using bifrost.Ptr() to create pointers instead of the address operator (&) ... across all code paths, including test utilities."

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@core/internal/llmtests/eager_input_streaming.go` around lines 42 - 43,
Replace the local bool-and-address pattern with bifrost.Ptr for pointer
consistency: instead of creating a local variable eager and assigning
chatTool.EagerInputStreaming = &eager, call bifrost.Ptr(true) and assign that
result to chatTool.EagerInputStreaming (reference the identifier
chatTool.EagerInputStreaming and the local variable eager to locate the code).

27-134: Consider adding a Responses-API companion for this scenario (or an explicit skip rationale).

This scenario currently validates only Chat streaming; a paired Responses path would align better with the llmtests dual-API convention.

As per coding guidelines: "core/internal/llmtests/**/*.go: Scenario-based LLM tests ... run against live provider APIs with dual-API testing (Chat + Responses API)."

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@core/internal/llmtests/eager_input_streaming.go` around lines 27 - 134, The
test RunEagerInputStreamingTest only exercises chat streaming via
client.ChatCompletionStreamRequest/request and lacks a Responses-API counterpart
as required by the dual-API convention; add a companion subtest (e.g.,
"EagerInputStreaming_ResponsesAPI") that constructs an equivalent
responses-style request (mirroring request/Params/Tools semantics) and invokes
the Responses streaming method on the client (the Responses stream call used in
other tests), or if the provider truly does not support Responses for this
scenario, add an explicit skip rationale at the top of
RunEagerInputStreamingTest (t.Logf and return) referencing the provider and why
Responses is omitted; ensure the new subtest uses the same retry logic
(WithStreamRetry/StreamingRetryConfig/TestRetryContext) and reuses
NewStreamingToolCallAccumulator to validate tool calls so behavior matches the
Chat path.

core/providers/anthropic/chat_server_tools_test.go (1)

100-100: Remove no-op placeholder variables.

_ = ptrTrue and _ = toolsetType are dead artifacts and can be removed for cleaner tests.

Also applies to: 261-263

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@core/providers/anthropic/chat_server_tools_test.go` at line 100, Remove the
dead placeholder variables that serve no purpose in the tests: delete the `_ =
ptrTrue` and `_ = toolsetType` no-op assignments (they appear near the top and
also around lines 261-263) so the test file
core/providers/anthropic/chat_server_tools_test.go no longer contains unused
placeholder artifacts; ensure there are no remaining references to ptrTrue or
toolsetType before committing.

core/internal/llmtests/server_tools_via_openai.go (1)

110-125: Add a Responses API companion for this scenario.

This scenario currently validates only ChatCompletionRequest; add an equivalent Responses-path assertion so this regression is covered on both APIs in core/internal/llmtests.

As per coding guidelines, core/internal/llmtests/**/*.go: “Scenario-based LLM tests ... with dual-API testing (Chat + Responses API).”
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@core/internal/llmtests/server_tools_via_openai.go` around lines 110 - 125,
Add a parallel Responses-path assertion alongside the existing
ChatCompletionRequest call: construct a responses request mirroring req (use
schemas.BifrostResponsesRequest or the project’s Responses request struct, set
Provider, Model, Input/Content, Params with MaxCompletionTokens and Tools, and
Fallbacks), call client.ResponsesRequest(bfCtx, responsesReq) (or the
appropriate Responses client method), check the returned response/error the same
way you do for client.ChatCompletionRequest (same assertions/validation of tool
invocation and content), and ensure bfCtx is reused and errors are handled
consistently.

core/providers/openai/openai.go (1)

2596-2596: Please resolve or track this TODO before merge.

// TODO: add HandleProviderResponse here in the transcription path leaves a known consistency gap in response handling.

I can draft the exact HandleProviderResponse integration for this block if you want me to open a follow-up issue/patch.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@core/providers/openai/openai.go` at line 2596, The TODO indicates a missing
call to HandleProviderResponse in the transcription path; replace the comment
with a call to the same HandleProviderResponse integration used elsewhere to
normalize provider outputs and errors (invoke HandleProviderResponse with the
provider response object, the transcription result/metadata, the request/context
and logger/metrics as required), propagate any returned normalized result or
error back through the transcription flow, and ensure error handling and
telemetry mirror other places that call HandleProviderResponse so response
consistency is preserved.

core/providers/anthropic/utils.go (1)

830-854: Duplicate input_examples check — the same logic appears twice.

Lines 830-838 and lines 848-854 both check len(tool.InputExamples) > 0 and perform identical header injection logic. The duplication is harmless due to appendUniqueHeader deduplication, but it adds unnecessary code and cognitive overhead.

♻️ Remove the duplicate block

 			if len(tool.InputExamples) > 0 {
 				if !hasProvider || features.AdvancedToolUse {
 					// Bundle header covers input_examples transitively.
 					headers = appendUniqueHeader(headers, AnthropicAdvancedToolUseBetaHeader)
 				} else if features.InputExamples {
 					// Narrow standalone header (e.g. Bedrock).
 					headers = appendUniqueHeader(headers, AnthropicToolExamplesBetaHeader)
 				}
 			}
 			if len(tool.AllowedCallers) > 0 {
 				if !hasProvider || features.AdvancedToolUse {
 					headers = appendUniqueHeader(headers, AnthropicAdvancedToolUseBetaHeader)
 				}
 			}
-			// input_examples has both bundle coverage AND a standalone header.
-			// Prefer the bundle header when the provider accepts the bundle
-			// (covers input_examples transitively); fall back to the narrow
-			// standalone header (Bedrock) when only InputExamples is set.
-			if len(tool.InputExamples) > 0 {
-				if !hasProvider || features.AdvancedToolUse {
-					headers = appendUniqueHeader(headers, AnthropicAdvancedToolUseBetaHeader)
-				} else if features.InputExamples {
-					headers = appendUniqueHeader(headers, AnthropicToolExamplesBetaHeader)
-				}
-			}
 			// Check for fine-grained tool streaming (eager_input_streaming).

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@core/providers/anthropic/utils.go` around lines 830 - 854, The code contains
a duplicated check for tool.InputExamples that injects the same headers twice;
remove the redundant block (either the first or second occurrence) so the logic
that appends AnthropicAdvancedToolUseBetaHeader or
AnthropicToolExamplesBetaHeader based on hasProvider, features.AdvancedToolUse,
and features.InputExamples and tool.InputExamples only runs once; keep
appendUniqueHeader, the conditions involving hasProvider,
features.AdvancedToolUse, features.InputExamples, and the use of
tool.InputExamples intact to preserve behavior.

core/providers/anthropic/types.go (1)

913-920: Remove the stale search_result source note.

This comment now contradicts the implementation in the same struct: Source is already modeled as *AnthropicBlockSource, and the union type is defined later in this file. Keeping the old “no typed slot yet” note will mislead the next change in this area.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@core/providers/anthropic/types.go` around lines 913 - 920, Remove the stale
comment about the search_result "source" lacking a typed slot in the Anthropic
content block comment; the struct already models Source as *AnthropicBlockSource
and the union type is defined later in this file, so update or delete that
paragraph in the comment near AnthropicContentBlock/Source to avoid misleading
future edits.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In @.github/workflows/scripts/setup-go-workspace.sh:
- Around line 21-23: The CI job that runs test-e2e-ui.sh pins go-version to
"1.26.1" but setup-go-workspace.sh sets the workspace/toolchain to 1.26.2 (via
go work edit -go=1.26.2 -toolchain=go1.26.2), causing an automatic toolchain
upgrade; update the go-version value in the e2e-tests.yml job that triggers
test-e2e-ui.sh from "1.26.1" to "1.26.2" so the provisioned runner toolchain
matches the workspace requirement.

In `@core/bifrost.go`:
- Around line 4963-4966: The code currently bypasses the streaming finalizer
when CheckFirstStreamChunkForError returns an error, causing
releasePluginPipeline(pipeline) to run directly and risking double-return if the
drain/post-hook path also finalizes the pipeline; change the error path so that
instead of directly calling releasePluginPipeline, you always invoke the same
finalization path used by the drain/post-hook (e.g., trigger or call the
BifrostContextKeyPostHookSpanFinalizer or a single shared finalizePipeline
helper) after waiting on drainDone, and only return after that finalizer has
run; update the block around checkedStream, drainDone, firstChunkErr (and the
similar block at 5227-5238) to use the shared finalizer and remove direct
releasePluginPipeline(pipeline) calls to prevent double-return/concurrent reuse.
- Around line 4456-4480: The code only checks bifrost.requestQueues for a
replacement ProviderQueue once, so if oldPq becomes closing after that check you
can still return "provider is shutting down" even though a new queue exists;
update the logic around the pq.isClosing() handling (the block that looks up
bifrost.requestQueues.Load(provider) and the later <-pq.done / pq.isClosing()
paths) to re-check requestQueues right before returning an error or blocking on
<-pq.done and, if a non-closing candidate is found, reroute by assigning pq =
candidate and continue; apply the same re-check pattern to the similar logic
around the later paths (the code near the other occurrence referenced in the
comment) so in-flight producers are transparently rerouted instead of failing.

In `@core/internal/llmtests/eager_input_streaming.go`:
- Around line 123-129: The test currently only checks tc.Name != "" and
tc.Arguments != ""; enhance it to assert tc.Arguments is valid JSON by
attempting to json.Unmarshal([]byte(tc.Arguments), &var) and failing the test
(use t.Errorf or t.Fatalf) if unmarshal returns an error, including the error
and the offending tc.Arguments in the message; keep the existing empty-string
check for tc.Name and reuse the loop over toolCalls and tc variable so the new
validation lives immediately after the Arguments non-empty check.

In `@core/providers/anthropic/chat_server_tools_test.go`:
- Around line 33-39: The round-trip test only asserts that Type and Name survive
serialization, so serializer regressions that drop variant fields (e.g., the
allowed_domains in the web_search_with_domains case or user_location in
web_search_with_user_location) will not be detected; update the test in
chat_server_tools_test.go to assert the full payload is preserved by either
comparing the original raw JSON to the re-serialized JSON or by unmarshalling
both into map[string]any (or a concrete struct) and asserting that keys/values
like allowed_domains and user_location (and Type and Name) are present and equal
after the round-trip; specifically target the test cases named
"web_search_with_domains" and "web_search_with_user_location" and include checks
for the allowed_domains and user_location fields rather than only Type and Name.

In `@core/providers/anthropic/chat.go`:
- Around line 354-367: The bug is that setting bifrostReq.Params.TaskBudget
writes to anthropicReq.OutputConfig but later the non-Vertex response_format
branch replaces anthropicReq.OutputConfig with a new AnthropicOutputConfig,
dropping task_budget; fix by not overwriting the whole OutputConfig there—when
handling response_format (the branch that currently does
anthropicReq.OutputConfig = &AnthropicOutputConfig{...}), instead ensure
anthropicReq.OutputConfig is non-nil and set/merge the ResponseFormat (and any
other fields) into the existing anthropicReq.OutputConfig so the previously-set
TaskBudget (from bifrostReq.Params.TaskBudget -> AnthropicTaskBudget) is
preserved.

In `@core/providers/anthropic/types.go`:
- Around line 1215-1229: The new optional fields UseCache (in
AnthropicToolWebFetch) and MaxCharacters (in AnthropicToolTextEditor) are being
sent to older tool versions and must be stripped to avoid 400s; update the
stripUnsupportedAnthropicFields function in utils.go to detect tool
variant/version (or existing feature flags) and remove/clear UseCache for
web_fetch versions older than web_fetch_20260309 and MaxCharacters for
text_editor versions older than text_editor_20250728 (or when the corresponding
feature flag is false), referencing the AnthropicToolWebFetch.UseCache and
AnthropicToolTextEditor.MaxCharacters fields so those keys are not included in
requests to unsupported tool versions.

In `@core/providers/azure/azure.go`:
- Line 3302: The code currently calls EnsureStreamFinalizerCalled(ctx) directly
after postHookRunner, but there's already a defer
providerUtils.EnsureStreamFinalizerCalled(ctx) set earlier, causing double
finalization; remove the direct invocation of EnsureStreamFinalizerCalled(ctx)
(the call after postHookRunner) and rely on the existing deferred
providerUtils.EnsureStreamFinalizerCalled(ctx) to run once when the goroutine
exits, leaving postHookRunner unchanged.

In `@core/providers/utils/stream_test.go`:
- Around line 218-223: The test can hang because sending to src may block if the
forwarder/goroutine stopped; change the send to a non-blocking/bounded write
before close by either making src a buffered channel (buffer size >=1) or
performing a select-based non-blocking send (select { case src <-
&schemas.BifrostStreamChunk{...}: default: }) so the test cannot block
indefinitely when the forwarder has been cancelled; update the send at src (the
chunk with ID "3") accordingly and then close(src).

In `@core/schemas/serialization_test.go`:
- Around line 1301-1306: The negative assertions in serialization_test.go check
for camelCase annotation keys (readOnlyHint, destructiveHint, idempotentHint)
which do not match the provider wire format; update the assert.NotContains calls
that reference "readOnlyHint", "destructiveHint", and "idempotentHint" (and
"annotations"/"File Reader" as needed) to use the expected snake_case wire keys
(e.g., "read_only_hint", "destructive_hint", "idempotent_hint") against the
serialized string variable s so the test actually detects annotation leakage in
provider payloads.

---

Outside diff comments:
In `@core/providers/anthropic/chat.go`:
- Around line 412-468: The tool_choice must be reconciled against the filtered
anthropicReq.Tools list produced after ValidateChatToolsForProvider to avoid
referencing a removed tool; after you build anthropicReq.Tools (see
ValidateChatToolsForProvider, convertFunctionToolToAnthropic,
convertServerToolToAnthropic and anthropicReq.Tools) validate any pin/name set
on bifrostReq.Params.ToolChoice.ChatToolChoiceStruct.Function.Name (and
equivalents) exists in anthropicReq.Tools (match on tool name); if the
referenced tool is missing, clear the name and set toolChoice.Type to "none" (or
fall back to "auto"/"any" only when the original choice was non-specific) so we
never send a tool_choice that references a tool not present in
anthropicReq.Tools.

---

Duplicate comments:
In @.github/workflows/release-cli.yml:
- Around line 23-28: Change the harden-runner step in the higher-privilege jobs
(release-cli and push-mintlify-changelog, not just check-version) to use
egress-policy: block and provide explicit allowed-endpoints entries; locate the
Harden Runner usage block (uses: step-security/harden-runner@...) inside the
release-cli and push-mintlify-changelog job definitions and update their with:
egress-policy to block and mirror the allowed-endpoints allowlist (e.g.,
github.com:443 and any R2 or other service hostnames required) so those jobs run
under a strict allowlist rather than audit mode.

In @.github/workflows/release-pipeline.yml:
- Around line 1034-1058: The Harden Runner allowlist blocks for the binary-build
jobs (the step using step-security/harden-runner@fa2e9d6... with the
"egress-policy: block" and "allowed-endpoints" keys) are missing the Go checksum
DB; add "sum.golang.org:443" to the allowed-endpoints list in both places where
that Harden Runner step appears so Go module/toolchain checksum validation can
reach the default GOSUMDB during builds.
- Around line 1241-1256: The Harden Runner step's allowed-endpoints only
includes github.com:443 but verification uses GH_TOKEN and may call the GitHub
API (verify-bifrost-http-release.sh), so update the Harden Runner blocks used by
the docker-build jobs (e.g., the steps named "Harden Runner" inside the
docker-build-amd64 and docker-build-arm64 jobs) to also include
api.github.com:443 in allowed-endpoints; run the provided shell checks against
.github/workflows/scripts/verify-bifrost-http-release.sh (searching for gh
api/gh release or api.github.com/curl github) to confirm if api.github.com:443
is required before committing the change.
- Around line 420-423: The allowlist hard-codes internal addresses
(172.38.0.2:5432 and 172.38.0.12:8301) under the allowed-endpoints key which can
fail if the compose network doesn’t pin those IPs or if no service listens on
8301; inspect the docker-compose service entries (postgres, weaviate) for
explicit ipv4_address/subnet or published ports and update the workflow to
either reference service hostnames/ports (compose DNS) or dynamically derive
addresses instead of hard-coding IPs, and also verify the test scripts
(test-bifrost-http.sh, run-migration-tests.sh, test-e2e-ui.sh) target the same
hostnames/ports or adapt them to not rely on fixed IPs so the new block-mode
policy won’t block legitimate traffic.

In `@core/bifrost.go`:
- Around line 6140-6179: drainQueueWithErrors can still miss messages because
producers may win pq.queue <- msg after workers exit; fix by quiescing producers
before the final sweep: add or use an existing ProviderQueue closing
flag/operation (e.g. call pq.signalClosing() or set pq.isClosing=true)
immediately after workers finish and before calling drainQueueWithErrors, and
update the send path (tryRequest / any method that does pq.queue <- msg) to
check pq.isClosing() and return the shutdown error instead of sending when true;
then drainQueueWithErrors can safely loop until pq.queue is empty without races.

In `@core/providers/anthropic/responses.go`:
- Around line 94-100: The accessor getOrCreateAnthropicToResponsesStreamState
currently dereferences ctx without checking for nil; add a nil guard at the top
of the function (i.e. if ctx == nil) and return a newly allocated
&anthropicToResponsesStreamState{} immediately so callers don't panic, otherwise
continue with the existing logic using ctx.Value and ctx.SetValue; reference
getOrCreateAnthropicToResponsesStreamState and
anthropicToResponsesStreamStateKey when locating the change.
- Around line 1798-1799: The converter is mutating caller-owned state by
overwriting bifrostResp.Item.ResponsesToolMessage.Arguments; instead, call
sanitizeWebSearchArguments(*bifrostResp.Item.ResponsesToolMessage.Arguments)
into a local variable (e.g., sanitizedArgs) and use that local value when
building synthetic deltas or downstream strings without assigning it back to
bifrostResp.Item.ResponsesToolMessage.Arguments; ensure the original pointer
remains unchanged so the converter stays a pure transformation (refer to
sanitizeWebSearchArguments and ResponsesToolMessage.Arguments).

In `@core/providers/bedrock/utils.go`:
- Around line 1076-1083: The code currently extracts only the nested Schema
field (schemaObj := *format.JSONSchema.Schema) which drops sibling metadata on
ResponsesTextConfigFormatJSONSchema; in convertTextFormatToTool use the full
*format.JSONSchema object as the schema payload instead of dereferencing its
Schema field — update the nil checks to ensure format.JSONSchema is non-nil, set
schemaObj (or similarly named local) to *format.JSONSchema (not
*format.JSONSchema.Schema), and keep description handling via
format.JSONSchema.Description so the full structured-output metadata is
preserved when generating Bedrock/Anthropic output_config.format.

---

Nitpick comments:
In `@core/internal/llmtests/eager_input_streaming.go`:
- Around line 42-43: Replace the local bool-and-address pattern with bifrost.Ptr
for pointer consistency: instead of creating a local variable eager and
assigning chatTool.EagerInputStreaming = &eager, call bifrost.Ptr(true) and
assign that result to chatTool.EagerInputStreaming (reference the identifier
chatTool.EagerInputStreaming and the local variable eager to locate the code).
- Around line 27-134: The test RunEagerInputStreamingTest only exercises chat
streaming via client.ChatCompletionStreamRequest/request and lacks a
Responses-API counterpart as required by the dual-API convention; add a
companion subtest (e.g., "EagerInputStreaming_ResponsesAPI") that constructs an
equivalent responses-style request (mirroring request/Params/Tools semantics)
and invokes the Responses streaming method on the client (the Responses stream
call used in other tests), or if the provider truly does not support Responses
for this scenario, add an explicit skip rationale at the top of
RunEagerInputStreamingTest (t.Logf and return) referencing the provider and why
Responses is omitted; ensure the new subtest uses the same retry logic
(WithStreamRetry/StreamingRetryConfig/TestRetryContext) and reuses
NewStreamingToolCallAccumulator to validate tool calls so behavior matches the
Chat path.

In `@core/internal/llmtests/provider_feature_support_test.go`:
- Line 667: Replace local bool variables and taking their addresses (e.g., eager
:= true; &eager) in the new test setup blocks with the repo helper
schemas.Ptr(true/false); specifically update places that create pointer booleans
for fields used in these tests (the local variable named eager and the analogous
bool locals at the other new setup sites) to call schemas.Ptr(true) or
schemas.Ptr(false) directly so the test uses the standard pointer helper instead
of address-of locals.

In `@core/internal/llmtests/server_tools_via_openai.go`:
- Around line 110-125: Add a parallel Responses-path assertion alongside the
existing ChatCompletionRequest call: construct a responses request mirroring req
(use schemas.BifrostResponsesRequest or the project’s Responses request struct,
set Provider, Model, Input/Content, Params with MaxCompletionTokens and Tools,
and Fallbacks), call client.ResponsesRequest(bfCtx, responsesReq) (or the
appropriate Responses client method), check the returned response/error the same
way you do for client.ChatCompletionRequest (same assertions/validation of tool
invocation and content), and ensure bfCtx is reused and errors are handled
consistently.

In `@core/providers/anthropic/chat_server_tools_test.go`:
- Line 100: Remove the dead placeholder variables that serve no purpose in the
tests: delete the `_ = ptrTrue` and `_ = toolsetType` no-op assignments (they
appear near the top and also around lines 261-263) so the test file
core/providers/anthropic/chat_server_tools_test.go no longer contains unused
placeholder artifacts; ensure there are no remaining references to ptrTrue or
toolsetType before committing.

In `@core/providers/anthropic/types.go`:
- Around line 913-920: Remove the stale comment about the search_result "source"
lacking a typed slot in the Anthropic content block comment; the struct already
models Source as *AnthropicBlockSource and the union type is defined later in
this file, so update or delete that paragraph in the comment near
AnthropicContentBlock/Source to avoid misleading future edits.

In `@core/providers/anthropic/utils.go`:
- Around line 830-854: The code contains a duplicated check for
tool.InputExamples that injects the same headers twice; remove the redundant
block (either the first or second occurrence) so the logic that appends
AnthropicAdvancedToolUseBetaHeader or AnthropicToolExamplesBetaHeader based on
hasProvider, features.AdvancedToolUse, and features.InputExamples and
tool.InputExamples only runs once; keep appendUniqueHeader, the conditions
involving hasProvider, features.AdvancedToolUse, features.InputExamples, and the
use of tool.InputExamples intact to preserve behavior.

In `@core/providers/openai/openai.go`:
- Line 2596: The TODO indicates a missing call to HandleProviderResponse in the
transcription path; replace the comment with a call to the same
HandleProviderResponse integration used elsewhere to normalize provider outputs
and errors (invoke HandleProviderResponse with the provider response object, the
transcription result/metadata, the request/context and logger/metrics as
required), propagate any returned normalized result or error back through the
transcription flow, and ensure error handling and telemetry mirror other places
that call HandleProviderResponse so response consistency is preserved.

In `@core/schemas/images.go`:
- Around line 66-71: The documentation for BackfillParams on
BifrostImageGenerationResponse is out of sync: the method now also backfills the
Model field but the comment bullets don't mention it; update the comment above
func (r *BifrostImageGenerationResponse) BackfillParams(req *BifrostRequest) to
include a bullet noting that Model is backfilled (e.g., "- Model on
ImageGenerationResponseParameters (from request if not in response)"), keeping
the existing bullets for NumInputImages, Size, and Quality.

In `@core/schemas/videos.go`:
- Around line 148-151: The comment for BackfillParams is out of date: the method
now backfills both Seconds and Model but the doc only mentions Seconds. Update
the comment on the BackfillParams method (on
BifrostVideoGenerationResponse.BackfillParams) to list both fields that can be
backfilled (Seconds and Model), describe their sources (from BifrostRequest
params or defaults), and note that Model is populated from req.Model when
missing; keep the comment concise and consistent with the implementation.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: e4f678c6-276b-4b89-95bd-eaebfb81ebeb

📥 Commits

Reviewing files that changed from the base of the PR and between 8c11869 and c907a4d.

⛔ Files ignored due to path filters (8)

examples/mcps/edge-case-server/package-lock.json is excluded by !**/package-lock.json
examples/mcps/error-test-server/package-lock.json is excluded by !**/package-lock.json
examples/mcps/parallel-test-server/package-lock.json is excluded by !**/package-lock.json
examples/mcps/temperature/package-lock.json is excluded by !**/package-lock.json
examples/mcps/test-tools-server/package-lock.json is excluded by !**/package-lock.json
tests/integrations/python/uv.lock is excluded by !**/*.lock
tests/integrations/typescript/package-lock.json is excluded by !**/package-lock.json
ui/package-lock.json is excluded by !**/package-lock.json

📒 Files selected for processing (78)

.github/workflows/e2e-tests.yml
.github/workflows/pr-tests.yml
.github/workflows/release-cli.yml
.github/workflows/release-pipeline.yml
.github/workflows/scripts/setup-go-workspace.sh
.github/workflows/snyk.yml
README.md
cli/go.mod
core/bifrost.go
core/go.mod
core/internal/llmtests/account.go
core/internal/llmtests/eager_input_streaming.go
core/internal/llmtests/provider_feature_support_test.go
core/internal/llmtests/server_tools_via_openai.go
core/internal/llmtests/tests.go
core/providers/anthropic/anthropic.go
core/providers/anthropic/anthropic_test.go
core/providers/anthropic/chat.go
core/providers/anthropic/chat_server_tools_test.go
core/providers/anthropic/chat_test.go
core/providers/anthropic/responses.go
core/providers/anthropic/types.go
core/providers/anthropic/utils.go
core/providers/anthropic/utils_test.go
core/providers/anthropic/validate_chat_tools_test.go
core/providers/azure/azure.go
core/providers/azure/azure_test.go
core/providers/bedrock/bedrock.go
core/providers/bedrock/bedrock_test.go
core/providers/bedrock/convert_tool_config_test.go
core/providers/bedrock/utils.go
core/providers/cohere/cohere.go
core/providers/elevenlabs/elevenlabs.go
core/providers/gemini/gemini.go
core/providers/huggingface/huggingface.go
core/providers/mistral/mistral.go
core/providers/openai/openai.go
core/providers/openai/responses_marshal_test.go
core/providers/openai/types.go
core/providers/replicate/replicate.go
core/providers/utils/stream.go
core/providers/utils/stream_test.go
core/providers/utils/utils.go
core/providers/vertex/vertex.go
core/providers/vertex/vertex_test.go
core/providers/vllm/vllm.go
core/schemas/chatcompletions.go
core/schemas/images.go
core/schemas/responses.go
core/schemas/serialization_test.go
core/schemas/videos.go
examples/mcps/edge-case-server/package.json
examples/mcps/error-test-server/package.json
examples/mcps/parallel-test-server/package.json
examples/mcps/temperature/package.json
examples/mcps/test-tools-server/package.json
examples/plugins/hello-world/go.mod
framework/go.mod
plugins/governance/go.mod
plugins/jsonparser/go.mod
plugins/litellmcompat/go.mod
plugins/logging/go.mod
plugins/maxim/go.mod
plugins/mocker/go.mod
plugins/otel/go.mod
plugins/semanticcache/go.mod
plugins/telemetry/go.mod
tests/integrations/typescript/package.json
tests/scripts/1millogs/go.mod
transports/Dockerfile
transports/Dockerfile.local
transports/bifrost-http/handlers/devpprof.go
transports/bifrost-http/lib/validator_test.go
transports/changelog.md
transports/go.mod
ui/app/pprof/page.tsx
ui/lib/store/apis/devApi.ts
ui/package.json

💤 Files with no reviewable changes (1)

README.md

✅ Files skipped from review due to trivial changes (5)

.github/workflows/e2e-tests.yml
cli/go.mod
.github/workflows/pr-tests.yml
core/go.mod
core/providers/bedrock/bedrock_test.go

🚧 Files skipped from review as they are similar to previous changes (3)

.github/workflows/snyk.yml
core/providers/openai/responses_marshal_test.go
core/providers/anthropic/chat_test.go

coderabbitai · 2026-04-17T22:06:57Z

+	// If the queue is closing, check whether the provider was updated (new queue
+	// available) or removed. On update, transparently re-route to the new queue
+	// so in-flight producers don't get spurious errors. On removal, error out.
+	//
+	// Use a direct sync.Map lookup instead of getProviderQueue to avoid the
+	// lazy-creation path: getProviderQueue can resurrect a provider that was
+	// just removed by RemoveProvider if the account config still exists.
 	if pq.isClosing() {
-		bifrost.releaseChannelMessage(msg)
-		bifrostErr := newBifrostErrorFromMsg("provider is shutting down")
-		bifrostErr.ExtraFields = schemas.BifrostErrorExtraFields{
-			RequestType:    req.RequestType,
-			Provider:       provider,
-			ModelRequested: model,
+		var reroutedPq *ProviderQueue
+		if val, ok := bifrost.requestQueues.Load(provider); ok {
+			if candidate := val.(*ProviderQueue); candidate != pq && !candidate.isClosing() {
+				reroutedPq = candidate
+			}
 		}
-		return nil, bifrostErr
+		if reroutedPq == nil {
+			bifrost.releaseChannelMessage(msg)
+			bifrostErr := newBifrostErrorFromMsg("provider is shutting down")
+			bifrostErr.ExtraFields = schemas.BifrostErrorExtraFields{
+				RequestType:    req.RequestType,
+				Provider:       provider,
+				ModelRequested: model,
+			}
+			return nil, bifrostErr
+		}
+		pq = reroutedPq


⚠️ Potential issue | 🟠 Major

Reroute again if the queue flips to closing after this first check.

This only reroutes producers that see pq.isClosing() before the first enqueue attempt. If oldPq.signalClosing() happens just after this block, the later <-pq.done / pq.isClosing() paths still return "provider is shutting down" even though requestQueues already contains newPq, so hot updates can still fail in-flight requests.

Also applies to: 4731-4755

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@core/bifrost.go` around lines 4456 - 4480, The code only checks bifrost.requestQueues for a replacement ProviderQueue once, so if oldPq becomes closing after that check you can still return "provider is shutting down" even though a new queue exists; update the logic around the pq.isClosing() handling (the block that looks up bifrost.requestQueues.Load(provider) and the later <-pq.done / pq.isClosing() paths) to re-check requestQueues right before returning an error or blocking on <-pq.done and, if a non-closing candidate is found, reroute by assigning pq = candidate and continue; apply the same re-check pattern to the similar logic around the later paths (the code near the other occurrence referenced in the comment) so in-flight producers are transparently rerouted instead of failing.

coderabbitai · 2026-04-17T22:06:57Z

+				checkedStream, drainDone, firstChunkErr := providerUtils.CheckFirstStreamChunkForError(ctx, streamChan)
 				if firstChunkErr != nil {
 					<-drainDone
 					bifrostError = firstChunkErr


⚠️ Potential issue | 🔴 Critical

Don't bypass the streaming finalizer when a started stream turns into an error.

CheckFirstStreamChunkForError can turn an already-started stream back into bifrostError != nil, but the worker still has a direct releasePluginPipeline(pipeline) error path below. If the provider/drain path then invokes BifrostContextKeyPostHookSpanFinalizer, the same PluginPipeline is returned to the pool twice and can be reused concurrently.

Possible fix

- // Release pipeline immediately for non-streaming requests only - // For streaming, the pipeline is released in the postHookSpanFinalizer after streaming completes - // Exception: if streaming request has an error, release immediately since finalizer won't be called - if pipeline != nil && (!IsStreamRequestType(req.RequestType) || bifrostError != nil) { - bifrost.releasePluginPipeline(pipeline) - } + // Non-streaming requests own the pipeline directly. + // Streaming requests must always release through the finalizer so the + // sync.Once guard covers normal completion, panic cleanup, and early-error paths. + if pipeline != nil { + if !IsStreamRequestType(req.RequestType) { + bifrost.releasePluginPipeline(pipeline) + } else if bifrostError != nil { + if finalizer, ok := req.Context.Value(schemas.BifrostContextKeyPostHookSpanFinalizer).(func(context.Context)); ok { + finalizer(req.Context) + } else { + bifrost.releasePluginPipeline(pipeline) + } + } + }

Also applies to: 5227-5238

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@core/bifrost.go` around lines 4963 - 4966, The code currently bypasses the streaming finalizer when CheckFirstStreamChunkForError returns an error, causing releasePluginPipeline(pipeline) to run directly and risking double-return if the drain/post-hook path also finalizes the pipeline; change the error path so that instead of directly calling releasePluginPipeline, you always invoke the same finalization path used by the drain/post-hook (e.g., trigger or call the BifrostContextKeyPostHookSpanFinalizer or a single shared finalizePipeline helper) after waiting on drainDone, and only return after that finalizer has run; update the block around checkedStream, drainDone, firstChunkErr (and the similar block at 5227-5238) to use the shared finalizer and remove direct releasePluginPipeline(pipeline) calls to prevent double-return/concurrent reuse.

coderabbitai · 2026-04-17T22:06:57Z

+		for _, tc := range toolCalls {
+			if tc.Name == "" {
+				t.Error("Tool call missing function name")
+			}
+			if tc.Arguments == "" {
+				t.Error("Tool call missing arguments JSON")
+			}


⚠️ Potential issue | 🟡 Minor

Assert tool-call arguments are valid JSON, not just non-empty.

Right now invalid JSON payloads would still pass this test.

🧪 Suggested strengthening

import ( "context" + "encoding/json" "os" "testing" @@ for _, tc := range toolCalls { if tc.Name == "" { t.Error("Tool call missing function name") } if tc.Arguments == "" { t.Error("Tool call missing arguments JSON") + } else if !json.Valid([]byte(tc.Arguments)) { + t.Errorf("Tool call arguments must be valid JSON: %s", tc.Arguments) } }

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

for _, tc := range toolCalls {

if tc.Name == "" {

t.Error("Tool call missing function name")

}

if tc.Arguments == "" {

t.Error("Tool call missing arguments JSON")

}

import (

"context"

"encoding/json"

"os"

"testing"

)

for _, tc := range toolCalls {

if tc.Name == "" {

t.Error("Tool call missing function name")

}

if tc.Arguments == "" {

t.Error("Tool call missing arguments JSON")

} else if !json.Valid([]byte(tc.Arguments)) {

t.Errorf("Tool call arguments must be valid JSON: %s", tc.Arguments)

}

}

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@core/internal/llmtests/eager_input_streaming.go` around lines 123 - 129, The test currently only checks tc.Name != "" and tc.Arguments != ""; enhance it to assert tc.Arguments is valid JSON by attempting to json.Unmarshal([]byte(tc.Arguments), &var) and failing the test (use t.Errorf or t.Fatalf) if unmarshal returns an error, including the error and the offending tc.Arguments in the message; keep the existing empty-string check for tc.Name and reuse the loop over toolCalls and tc variable so the new validation lives immediately after the Arguments non-empty check.

coderabbitai · 2026-04-17T22:06:57Z

+			name: "web_search_with_domains",
+			raw:  `{"type":"web_search_20250305","name":"web_search","allowed_domains":["example.com","docs.example.com"]}`,
+		},
+		{
+			name: "web_search_with_user_location",
+			raw:  `{"type":"web_search_20250305","name":"web_search","user_location":{"type":"approximate","city":"San Francisco","country":"US","timezone":"America/Los_Angeles"}}`,
+		},


⚠️ Potential issue | 🟠 Major

Round-trip test can miss dropped variant fields.

Cases like web_search_with_domains / web_search_with_user_location include variant payload, but the round-trip check only compares Type and Name. A serializer regression dropping those fields would still pass.

💡 Suggested fix

import ( "encoding/json" + "reflect" "testing" @@ - if tool.Name != tool2.Name || tool.Type != tool2.Type { - t.Errorf("round-trip mismatch\n in: %s\n out: %s", tc.raw, string(out)) - } + var inObj map[string]any + if err := json.Unmarshal([]byte(tc.raw), &inObj); err != nil { + t.Fatalf("input json decode failed: %v", err) + } + var outObj map[string]any + if err := json.Unmarshal(out, &outObj); err != nil { + t.Fatalf("output json decode failed: %v\njson: %s", err, string(out)) + } + if !reflect.DeepEqual(inObj, outObj) { + t.Errorf("round-trip mismatch\n in: %s\n out: %s", tc.raw, string(out)) + }

Also applies to: 143-146

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@core/providers/anthropic/chat_server_tools_test.go` around lines 33 - 39, The round-trip test only asserts that Type and Name survive serialization, so serializer regressions that drop variant fields (e.g., the allowed_domains in the web_search_with_domains case or user_location in web_search_with_user_location) will not be detected; update the test in chat_server_tools_test.go to assert the full payload is preserved by either comparing the original raw JSON to the re-serialized JSON or by unmarshalling both into map[string]any (or a concrete struct) and asserting that keys/values like allowed_domains and user_location (and Type and Name) are present and equal after the round-trip; specifically target the test cases named "web_search_with_domains" and "web_search_with_user_location" and include checks for the allowed_domains and user_location fields rather than only Type and Name.

coderabbitai · 2026-04-17T22:06:57Z

+		// TaskBudget — maps onto output_config.task_budget. If an OutputConfig
+		// already exists (e.g. from structured outputs), attach the budget to
+		// it; otherwise create one.
+		if bifrostReq.Params.TaskBudget != nil {
+			tb := &AnthropicTaskBudget{
+				Type:      bifrostReq.Params.TaskBudget.Type,
+				Total:     bifrostReq.Params.TaskBudget.Total,
+				Remaining: bifrostReq.Params.TaskBudget.Remaining,
+			}
+			if anthropicReq.OutputConfig == nil {
+				anthropicReq.OutputConfig = &AnthropicOutputConfig{}
+			}
+			anthropicReq.OutputConfig.TaskBudget = tb
+		}


⚠️ Potential issue | 🟠 Major

Preserve task_budget when response_format is also set.

This writes TaskBudget into anthropicReq.OutputConfig, but the non-Vertex response_format branch later replaces anthropicReq.OutputConfig with a fresh struct at Line 398. Requests that set both parameters will silently drop output_config.task_budget.

🔧 Suggested fix

- if outputFormat != nil { - anthropicReq.OutputConfig = &AnthropicOutputConfig{ - Format: outputFormat, - } - } + if outputFormat != nil { + if anthropicReq.OutputConfig == nil { + anthropicReq.OutputConfig = &AnthropicOutputConfig{} + } + anthropicReq.OutputConfig.Format = outputFormat + }

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@core/providers/anthropic/chat.go` around lines 354 - 367, The bug is that setting bifrostReq.Params.TaskBudget writes to anthropicReq.OutputConfig but later the non-Vertex response_format branch replaces anthropicReq.OutputConfig with a new AnthropicOutputConfig, dropping task_budget; fix by not overwriting the whole OutputConfig there—when handling response_format (the branch that currently does anthropicReq.OutputConfig = &AnthropicOutputConfig{...}), instead ensure anthropicReq.OutputConfig is non-nil and set/merge the ResponseFormat (and any other fields) into the existing anthropicReq.OutputConfig so the previously-set TaskBudget (from bifrostReq.Params.TaskBudget -> AnthropicTaskBudget) is preserved.

coderabbitai · 2026-04-17T22:06:57Z

 type AnthropicToolWebFetch struct {
-	MaxUses          *int     `json:"max_uses,omitempty"`
-	AllowedDomains   []string `json:"allowed_domains,omitempty"`
-	BlockedDomains   []string `json:"blocked_domains,omitempty"`
-	MaxContentTokens *int     `json:"max_content_tokens,omitempty"`
+	MaxUses          *int                `json:"max_uses,omitempty"`
+	AllowedDomains   []string            `json:"allowed_domains,omitempty"`
+	BlockedDomains   []string            `json:"blocked_domains,omitempty"`
+	MaxContentTokens *int                `json:"max_content_tokens,omitempty"`
+	Citations        *AnthropicCitations `json:"citations,omitempty"` // {enabled: bool} — toggles citation emission on fetched documents
+	UseCache         *bool               `json:"use_cache,omitempty"` // web_fetch_20260309+ only — enables server-side page cache
+}
+
+// AnthropicToolTextEditor holds fields specific to the text_editor tool
+// variants. Only text_editor_20250728 (and later) honours max_characters
+// as a view-truncation cap.
+type AnthropicToolTextEditor struct {
+	MaxCharacters *int `json:"max_characters,omitempty"` // text_editor_20250728+ only
 }


⚠️ Potential issue | 🔴 Critical

🧩 Analysis chain

🏁 Script executed:

#!/bin/bash set -euo pipefail rg -n -C4 'use_cache|max_characters|web_fetch_20260309|text_editor_20250728|StripUnsupportedAnthropicFields' \ core/providers/anthropic/utils.go \ core/providers/anthropic/utils_test.go \ core/providers/anthropic/chat.go \ core/providers/anthropic/chat_test.go

Repository: maximhq/bifrost

Length of output: 2728

🏁 Script executed:

#!/bin/bash set -euo pipefail # Find and display the StripUnsupportedAnthropicFields function implementation rg -n -A 100 'func StripUnsupportedAnthropicFields' core/providers/anthropic/utils.go | head -150

Repository: maximhq/bifrost

Length of output: 41

🏁 Script executed:

#!/bin/bash set -euo pipefail # Search for any occurrence of StripUnsupportedAnthropicFields (might be unexported) rg -n 'StripUnsupportedAnthropicFields' core/providers/anthropic/ # Also search for functions dealing with tool sanitization rg -n 'func.*[Ss]trip.*[Tt]ool\|func.*[Ss]anitize.*[Tt]ool' core/providers/anthropic/utils.go # Look for version-gating or tool version handling rg -n 'web_fetch_20260309|text_editor_20250728' core/providers/anthropic/

Repository: maximhq/bifrost

Length of output: 797

🏁 Script executed:

#!/bin/bash set -euo pipefail # Search all anthropic provider files for the actual function definition rg -n 'func.*StripUnsupported\|func.*stripUnsupported' core/providers/anthropic/ # Check what's in chat.go that might handle tool sanitization rg -n 'func.*[Tt]ool\|web_fetch\|text_editor' core/providers/anthropic/chat.go | head -30 # Search for where AnthropicToolWebFetch and AnthropicToolTextEditor are used rg -n 'AnthropicToolWebFetch|AnthropicToolTextEditor' core/providers/anthropic/

Repository: maximhq/bifrost

Length of output: 41

🏁 Script executed:

#!/bin/bash set -euo pipefail # Get line count to see if file is readable wc -l core/providers/anthropic/utils.go # Look at the section around line 323 where StripUnsupportedAnthropicFields is mentioned sed -n '310,360p' core/providers/anthropic/utils.go # Look at line 712 area sed -n '700,730p' core/providers/anthropic/utils.go

Repository: maximhq/bifrost

Length of output: 3854

🏁 Script executed:

#!/bin/bash set -euo pipefail # Search for where tools are being stripped in the typed path # Look for function that operates on AnthropicMessageRequest rg -n 'func.*AnthropicMessageRequest\|func.*tool.*\[' core/providers/anthropic/utils.go | head -20 # Look at the area just before line 310 sed -n '280,330p' core/providers/anthropic/utils.go # Also search for where use_cache or max_characters might be handled rg -n 'use_cache|max_characters|UseCache|MaxCharacters' core/providers/anthropic/

Repository: maximhq/bifrost

Length of output: 41

🏁 Script executed:

#!/bin/bash set -euo pipefail # Look at the full context around line 310 to see what function we're in sed -n '250,350p' core/providers/anthropic/utils.go # Check for any function related to tool handling rg -n -B5 'tool\.EagerInputStreaming\|tool\.Strict\|tool\.InputExamples' core/providers/anthropic/utils.go | head -50

Repository: maximhq/bifrost

Length of output: 3951

🏁 Script executed:

#!/bin/bash set -euo pipefail # Find the function signature for the tool-stripping code sed -n '200,300p' core/providers/anthropic/utils.go | head -80 # Search for where use_cache and max_characters appear in the codebase rg -n 'use_cache\|UseCache\|max_characters\|MaxCharacters' core/providers/anthropic/ --type go # Check types.go to confirm the new fields exist rg -n -A2 -B2 'AnthropicToolWebFetch\|AnthropicToolTextEditor' core/providers/anthropic/types.go

Repository: maximhq/bifrost

Length of output: 3216

🏁 Script executed:

#!/bin/bash set -euo pipefail # Search for the function name that contains the tool stripping code sed -n '100,250p' core/providers/anthropic/utils.go | grep -n "^func" # Let's get the full function signature sed -n '150,200p' core/providers/anthropic/utils.go # Double check - search for any reference to the new fields in the entire codebase rg -l 'UseCache|MaxCharacters' core/

Repository: maximhq/bifrost

Length of output: 2799

🏁 Script executed:

#!/bin/bash set -euo pipefail # Now let's see the complete tool stripping section sed -n '250,330p' core/providers/anthropic/utils.go # Explicitly search for UseCache and MaxCharacters in all files rg -i 'usecache|maxcharacters' core/providers/anthropic/

Repository: maximhq/bifrost

Length of output: 3898

🏁 Script executed:

#!/bin/bash set -euo pipefail # Check chat.go to see how UseCache and MaxCharacters are being set rg -n -B5 -A5 'UseCache.*=|MaxCharacters.*=' core/providers/anthropic/chat.go # Check if there's any version-gating logic near where these fields are set rg -n -B10 'UseCache.*=.*tool\.' core/providers/anthropic/chat.go | head -40 # Look for any feature flag definitions that might control these rg -n 'UseCache|MaxCharacters|web_fetch_20260309|text_editor_20250728' core/providers/anthropic/types.go

Repository: maximhq/bifrost

Length of output: 41

🏁 Script executed:

#!/bin/bash set -euo pipefail # Get line numbers for where UseCache appears in chat.go rg -n 'UseCache' core/providers/anthropic/chat.go # Look at that section of chat.go grep -n 'UseCache\|MaxCharacters' core/providers/anthropic/chat.go

Repository: maximhq/bifrost

Length of output: 180

🏁 Script executed:

#!/bin/bash set -euo pipefail # Look at the context around lines 166 and 177 in chat.go sed -n '150,190p' core/providers/anthropic/chat.go # Also check what tool type these are coming from and if there's version gating sed -n '140,200p' core/providers/anthropic/chat.go

Repository: maximhq/bifrost

Length of output: 3856

🏁 Script executed:

#!/bin/bash set -euo pipefail # Check if there are feature flags defined for UseCache or MaxCharacters rg -n 'UseCache|MaxCharacters|web_fetch_20260309|text_editor_20250728' core/providers/anthropic/types.go | grep -v "json:" # Look at the ProviderFeatures map to see what flags exist rg -n -A50 'ProviderFeatures\s*=' core/providers/anthropic/types.go | head -80 # Check if there's any version model gating for web_fetch or text_editor rg -n 'web_fetch|text_editor' core/providers/anthropic/

Repository: maximhq/bifrost

Length of output: 12075

🏁 Script executed:

#!/bin/bash set -euo pipefail # Now let's look at the full tool stripping section to see if UseCache and MaxCharacters are being stripped sed -n '290,330p' core/providers/anthropic/utils.go # Double check - are there any feature flags for controlling UseCache or MaxCharacters stripping? rg -n 'UseCache\|MaxCharacters' core/providers/anthropic/utils.go # Check if there are version/model specific gates for web_fetch or text_editor rg -n 'SupportsFastMode\|model.*web_fetch\|model.*text_editor' core/providers/anthropic/utils.go

Repository: maximhq/bifrost

Length of output: 1632

🏁 Script executed:

#!/bin/bash set -euo pipefail # Verify the complete tool stripping section - from line 285 to 329 sed -n '285,335p' core/providers/anthropic/utils.go # Check if there's test coverage for UseCache or MaxCharacters stripping rg -n 'UseCache\|MaxCharacters\|web_fetch_20260309\|text_editor_20250728' core/providers/anthropic/utils_test.go | grep -i 'strip\|gate\|version'

Repository: maximhq/bifrost

Length of output: 2093

🏁 Script executed:

#!/bin/bash set -euo pipefail # Let's verify the complete scope of stripUnsupportedAnthropicFields to be absolutely sure sed -n '170,330p' core/providers/anthropic/utils.go | grep -n "UseCache\|MaxCharacters\|use_cache\|max_characters" # Also check test coverage - look for any tests that verify version-gating of these fields rg -n -C5 'web_fetch_20260309\|text_editor_20250728' core/providers/anthropic/utils_test.go | grep -A5 -B5 'UseCache\|MaxCharacters' # Look for any model-based version gating for these tool types rg -n 'web_fetch_2026\|text_editor_2025' core/providers/anthropic/utils.go

Repository: maximhq/bifrost

Length of output: 41

Add version gating for use_cache and max_characters to prevent 400 errors on older tool versions.

The new fields use_cache (web_fetch_20260309+) and max_characters (text_editor_20250728+) are being copied directly into the request without version checks. If sent with older tool versions (e.g., web_fetch_20250910, text_editor_20250124), the Anthropic API will reject them with a 400 error.

The stripUnsupportedAnthropicFields function in utils.go handles version gating for similar fields (DeferLoading, InputExamples, EagerInputStreaming, Strict), but use_cache and max_characters are missing from this logic. Add explicit stripping based on tool version or add feature flags to gate these fields conditionally.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@core/providers/anthropic/types.go` around lines 1215 - 1229, The new optional fields UseCache (in AnthropicToolWebFetch) and MaxCharacters (in AnthropicToolTextEditor) are being sent to older tool versions and must be stripped to avoid 400s; update the stripUnsupportedAnthropicFields function in utils.go to detect tool variant/version (or existing feature flags) and remove/clear UseCache for web_fetch versions older than web_fetch_20260309 and MaxCharacters for text_editor versions older than text_editor_20250728 (or when the corresponding feature flag is false), referencing the AnthropicToolWebFetch.UseCache and AnthropicToolTextEditor.MaxCharacters fields so those keys are not included in requests to unsupported tool versions.

coderabbitai · 2026-04-17T22:06:57Z


 	ch := make(chan *schemas.BifrostStreamChunk, schemas.DefaultStreamBufferSize)
 	go func() {
+		defer providerUtils.EnsureStreamFinalizerCalled(ctx)


⚠️ Potential issue | 🟠 Major

🧩 Analysis chain

🏁 Script executed:

#!/bin/bash set -euo pipefail # Verify whether EnsureStreamFinalizerCalled already executes/guards the same finalizer. rg -n -C4 'func EnsureStreamFinalizerCalled|BifrostContextKeyPostHookSpanFinalizer|finalizer$ctx$' core/providers/utils/utils.go # Verify duplicate finalizer paths in Azure passthrough streaming. rg -n -C4 'EnsureStreamFinalizerCalled|BifrostContextKeyPostHookSpanFinalizer|finalizer$ctx$' core/providers/azure/azure.go

Repository: maximhq/bifrost

Length of output: 3150

Remove direct finalizer invocation to prevent double finalization.

Lines 3345-3347 directly call the finalizer, but the defer at line 3302 will invoke EnsureStreamFinalizerCalled(ctx) again when the goroutine exits. This causes the same finalizer to execute twice. Since postHookRunner has already been called before the finalizer, remove the direct invocation and let the deferred helper handle it.

Suggested fix

- if finalizer, ok := ctx.Value(schemas.BifrostContextKeyPostHookSpanFinalizer).(func(context.Context)); ok && finalizer != nil { - finalizer(ctx) - } return

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@core/providers/azure/azure.go` at line 3302, The code currently calls EnsureStreamFinalizerCalled(ctx) directly after postHookRunner, but there's already a defer providerUtils.EnsureStreamFinalizerCalled(ctx) set earlier, causing double finalization; remove the direct invocation of EnsureStreamFinalizerCalled(ctx) (the call after postHookRunner) and rely on the existing deferred providerUtils.EnsureStreamFinalizerCalled(ctx) to run once when the goroutine exits, leaving postHookRunner unchanged.

coderabbitai · 2026-04-17T22:06:57Z

+	// Simulate the upstream producer still emitting, then closing. The
+	// drain loop should consume these and terminate.
+	src <- &schemas.BifrostStreamChunk{
+		BifrostChatResponse: &schemas.BifrostChatResponse{ID: "3"},
+	}
+	close(src)


⚠️ Potential issue | 🟠 Major

Avoid a potential test hang on Line 220.

The send to src after cancellation can block indefinitely if the forwarder stops before draining, which can stall the test run instead of failing fast.

♻️ Suggested fix (bounded send before close)

- // Simulate the upstream producer still emitting, then closing. The - // drain loop should consume these and terminate. - src <- &schemas.BifrostStreamChunk{ - BifrostChatResponse: &schemas.BifrostChatResponse{ID: "3"}, - } - close(src) + // Simulate the upstream producer still emitting, then closing. Bound the + // send so regressions fail fast instead of hanging the test. + select { + case src <- &schemas.BifrostStreamChunk{ + BifrostChatResponse: &schemas.BifrostChatResponse{ID: "3"}, + }: + case <-time.After(time.Second): + t.Fatal("source send blocked after cancel; wrapper is not draining") + } + close(src)

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

// Simulate the upstream producer still emitting, then closing. The

// drain loop should consume these and terminate.

src <- &schemas.BifrostStreamChunk{

BifrostChatResponse: &schemas.BifrostChatResponse{ID: "3"},

}

close(src)

// Simulate the upstream producer still emitting, then closing. Bound the

// send so regressions fail fast instead of hanging the test.

select {

case src <- &schemas.BifrostStreamChunk{

BifrostChatResponse: &schemas.BifrostChatResponse{ID: "3"},

}:

case <-time.After(time.Second):

t.Fatal("source send blocked after cancel; wrapper is not draining")

}

close(src)

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@core/providers/utils/stream_test.go` around lines 218 - 223, The test can hang because sending to src may block if the forwarder/goroutine stopped; change the send to a non-blocking/bounded write before close by either making src a buffered channel (buffer size >=1) or performing a select-based non-blocking send (select { case src <- &schemas.BifrostStreamChunk{...}: default: }) so the test cannot block indefinitely when the forwarder has been cancelled; update the send at src (the chunk with ID "3") accordingly and then close(src).

coderabbitai · 2026-04-17T22:06:57Z

+	// Annotations must be absent — json:"-" must suppress the entire field
+	assert.NotContains(t, s, "annotations", "annotations field must not appear in provider payload")
+	assert.NotContains(t, s, "readOnlyHint", "readOnlyHint must not appear in provider payload")
+	assert.NotContains(t, s, "destructiveHint", "destructiveHint must not appear in provider payload")
+	assert.NotContains(t, s, "idempotentHint", "idempotentHint must not appear in provider payload")
+	assert.NotContains(t, s, "File Reader", "annotation title must not appear in provider payload")


⚠️ Potential issue | 🟡 Minor

Negative JSON-key assertions are using non-wire key names.

readOnlyHint / destructiveHint / idempotentHint are camelCase, but provider payload keys would be snake_case. This can let annotation leakage slip through undetected.

💡 Suggested fix

- assert.NotContains(t, s, "readOnlyHint", "readOnlyHint must not appear in provider payload") - assert.NotContains(t, s, "destructiveHint", "destructiveHint must not appear in provider payload") - assert.NotContains(t, s, "idempotentHint", "idempotentHint must not appear in provider payload") + assert.NotContains(t, s, `"read_only_hint"`, "read_only_hint must not appear in provider payload") + assert.NotContains(t, s, `"destructive_hint"`, "destructive_hint must not appear in provider payload") + assert.NotContains(t, s, `"idempotent_hint"`, "idempotent_hint must not appear in provider payload")

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@core/schemas/serialization_test.go` around lines 1301 - 1306, The negative assertions in serialization_test.go check for camelCase annotation keys (readOnlyHint, destructiveHint, idempotentHint) which do not match the provider wire format; update the assert.NotContains calls that reference "readOnlyHint", "destructiveHint", and "idempotentHint" (and "annotations"/"File Reader" as needed) to use the expected snake_case wire keys (e.g., "read_only_hint", "destructive_hint", "idempotent_hint") against the serialized string variable s so the test actually detects annotation leakage in provider payloads.

## Summary Adds an operation-allowed check for OCR requests before they are dispatched to a provider, and fixes the Mistral provider to return its custom provider name when one is configured. ## Changes - Added a `CheckOperationAllowed` guard for `OCRRequest` in `handleProviderRequest`, consistent with how other request types are gated. If the operation is not permitted, a `BifrostError` is returned with the provider key, request type, and requested model populated. - Updated `MistralProvider.GetProviderKey()` to use `providerUtils.GetProviderName` so that custom provider configurations are respected, rather than always returning the hardcoded `schemas.Mistral` value. ## Type of change - [ ] Bug fix - [x] Feature - [ ] Refactor - [ ] Documentation - [ ] Chore/CI ## Affected areas - [x] Core (Go) - [ ] Transports (HTTP) - [x] Providers/Integrations - [ ] Plugins - [ ] UI (Next.js) - [ ] Docs ## How to test ```sh go version go test ./... ``` - Configure a custom provider wrapping Mistral and verify that `GetProviderKey()` returns the custom provider name rather than `mistral`. - Attempt an OCR request against a provider where the operation is not allowed and confirm a `BifrostError` is returned with the correct `Provider`, `RequestType`, and `ModelRequested` fields set. - Attempt an OCR request against a provider where the operation is allowed and confirm the request proceeds normally. ## Breaking changes - [ ] Yes - [x] No ## Related issues ## Security considerations None. ## Checklist - [ ] I read `docs/contributing/README.md` and followed the guidelines - [ ] I added/updated tests where appropriate - [ ] I updated documentation where needed - [ ] I verified builds succeed (Go and UI) - [ ] I verified the CI pipeline passes locally if applicable

## Summary This PR simplifies and consolidates the `config.schema.json` by removing several features, collapsing provider-specific schema variants, and restructuring key configuration definitions to reduce complexity and align with updated runtime semantics. ## Changes - Removed the top-level `version` field that controlled allow-list semantics for empty arrays - Removed the `compat` plugin configuration block (including `convert_text_to_chat`, `convert_chat_to_responses`, `should_drop_params`, `should_convert_params`) - Replaced `compat` with a simpler `enable_litellm_fallbacks` boolean for Groq text completion fallbacks - Removed `mcp_disable_auto_tool_inject` and `routing_chain_max_depth` from server config - Collapsed `provider_with_ollama_config`, `provider_with_sgl_config`, and `provider_with_replicate_config` into the generic `provider` definition; removed their corresponding key types (`ollama_key`, `sgl_key`, `replicate_key`) and `network_config_without_base_url` - Removed providers `nebius`, `xai`, and `runway` from the providers block - Moved `calendar_aligned` from `virtual_key` to the `budget` object; removed `virtual_key_id` and `provider_config_id` from budget in favor of a standalone `budget_id` reference on virtual keys - Removed `chain_rule` from routing rules and relaxed the `scope_id` conditional requirement - Simplified `virtual_key_provider_config` to inline key definitions with full provider-specific key configs (Azure, Vertex, Bedrock, VLLM), replacing the separate `key_ids` and `keys` split - Removed `mcp_client_name` and `allow_on_all_virtual_keys` from MCP configs; removed `allowed_extra_headers` and `disable_auto_tool_inject` from MCP client config - Added `websocket` as a supported MCP connection type with a dedicated `websocket_config` block; removed `inprocess` connection type - Removed `per_user_oauth` as an MCP auth type and dropped the conditional `oauth_config_id` requirement - Renamed `concurrency_and_buffer_size` to `concurrency_config`; renamed `retry_backoff_initial`/`retry_backoff_max` to `retry_backoff_initial_ms`/`retry_backoff_max_ms`; removed `enforce_http2` and `openai_config` from network config - Moved `pricing_overrides` from the top-level config into individual provider definitions - Simplified `provider_pricing_override` schema, removing scoped fields (`scope_kind`, `virtual_key_id`, `provider_id`, `provider_key_id`) and replacing `pattern` with `model_pattern`; added `regex` as a valid `match_type`; expanded supported `request_types` - Renamed `scim_config` to `saml_config` in the top-level schema - Removed `apiToken` from Okta config and made `clientSecret` optional; updated required fields to only `issuerUrl` and `clientId` - Removed `object_storage` and `retention_days` from the logs store config - Removed `id` and `description` fields from provider config entries in the `provider_configs` array - Removed `websocket_responses` and `realtime` from `custom_provider_config` allowed requests; removed the enum constraint on `base_provider_type` - Removed `disable_auto_tool_inject` from `mcp_client_config` VFS settings - Added `deployments` mapping to `azure_key_config` and `vertex_key_config` - Updated `otel` plugin `trace_type` to only accept `"otel"` (removed `genai_extension`, `vercel`, `open_inference`) - Removed `prompts` from the built-in plugin name list - Removed `builtin` as a valid plugin `placement` value - Changed `cluster_config` discovery `dial_timeout` from a Go duration string to an integer (nanoseconds) - Reformatted many inline `required` arrays to multi-line style for readability ## Type of change - [ ] Bug fix - [ ] Feature - [x] Refactor - [ ] Documentation - [ ] Chore/CI ## Affected areas - [ ] Core (Go) - [x] Transports (HTTP) - [x] Providers/Integrations - [x] Plugins - [ ] UI (Next.js) - [ ] Docs ## How to test Validate existing configs against the updated schema to confirm they parse correctly. Verify that configs using removed fields (`version`, `compat`, `mcp_disable_auto_tool_inject`, `chain_rule`, etc.) are rejected by the schema validator. ```sh go test ./... ``` Confirm that provider configs for Ollama, SGL, and Replicate continue to work using the generic `provider` definition. Confirm MCP clients using `websocket` connection type validate correctly with a `websocket_config` block. ## Breaking changes - [x] Yes - [ ] No The following fields have been removed and configs using them will fail schema validation: - `version` (top-level) - `compat` block under server config - `mcp_disable_auto_tool_inject` and `routing_chain_max_depth` under server config - `chain_rule` on routing rules - `calendar_aligned` on virtual keys (now on budgets) - `virtual_key_id` / `provider_config_id` on budgets - `apiToken` on Okta config (now optional `clientSecret` only) - `object_storage` and `retention_days` on logs store - `id`, `description` on provider config entries - `allow_on_all_virtual_keys`, `allowed_extra_headers`, `disable_auto_tool_inject` on MCP client config - `inprocess` MCP connection type and `per_user_oauth` auth type - `enforce_http2` and `openai_config` from network config - `builtin` plugin placement value; `prompts` built-in plugin name - `nebius`, `xai`, `runway` provider entries Migrate by removing or replacing these fields according to the updated schema definitions. ## Related issues ## Security considerations Removal of `per_user_oauth` as an MCP auth type should be reviewed to ensure no active integrations depend on it. The relaxed `scope_id` requirement on routing rules should be validated to confirm it does not inadvertently broaden access scope. ## Checklist - [ ] I read `docs/contributing/README.md` and followed the guidelines - [ ] I added/updated tests where appropriate - [ ] I updated documentation where needed - [ ] I verified builds succeed (Go and UI) - [ ] I verified the CI pipeline passes locally if applicable

## Summary This PR removes all test jobs from the release pipeline and decouples them from the release gate conditions, allowing releases to proceed without waiting for (often flaky) provider API test results. It also significantly expands and restructures `config.schema.json` to reflect new features, provider support, and breaking semantic changes introduced in v1.5.0. ## Changes - **Release pipeline**: Removed `test-core`, `approve-flaky-test-core`, `test-framework`, `test-plugins`, `test-bifrost-http`, `test-migrations`, and `test-e2e-ui` jobs entirely from `release-pipeline.yml`. All release jobs (`core-release`, `framework-release`, `plugins-release`, `bifrost-http-prep`, `docker-build-amd64`, `docker-build-arm64`, `push-mintlify-changelog`) now depend only on change detection and upstream release jobs, not on test outcomes. - **Schema: deny-by-default semantics (v1.5.0)**: Empty arrays in `provider_configs`, `mcp_configs`, `allowed_models`, `key_ids`, and `tools_to_execute` now mean "deny all" rather than "allow all". Use `["*"]` to allow all. A top-level `version` field (enum `1` or `2`, default `2`) controls which semantic applies, with `1` restoring v1.4.x behavior. - **Schema: new providers**: Added `nebius`, `xai`, and `runway` as first-class provider entries. - **Schema: provider key restructuring**: Replaced the inline key object definition in `virtual_key_provider_config` with a flat `key_ids` string array. Introduced dedicated key types `ollama_key`, `sgl_key`, and `replicate_key` with their own `_key_config` blocks. Removed `deployments` from `azure_key_config` and `vertex_key_config` (replaced by `aliases` on `base_key`). Added `aliases` to `base_key` for model-to-deployment/inference-profile mappings. - **Schema: provider variants**: `ollama` and `sgl` now reference `provider_with_ollama_config` and `provider_with_sgl_config` respectively, which use `network_config_without_base_url` (URL is per-key). `replicate` references `provider_with_replicate_config`. Added `openai_config` def with `disable_store` for the Responses API. Renamed `concurrency_config` to `concurrency_and_buffer_size`. - **Schema: network config**: Split `network_config` into `network_config` (with `base_url`) and `network_config_without_base_url`. Added `enforce_http2`, `stream_idle_timeout_in_seconds`, `max_conns_per_host`, `beta_header_overrides`, and `ca_cert_pem` fields. Renamed `retry_backoff_initial_ms`/`retry_backoff_max_ms` to `retry_backoff_initial`/`retry_backoff_max`. - **Schema: MCP changes**: Removed `websocket` connection type; added `inprocess`. Added `per_user_oauth` auth type. Added `mcp_client_name` for config-file resolution. Added `allowed_extra_headers` and `allow_on_all_virtual_keys` to `mcp_client_config`. Added `disable_auto_tool_inject` to MCP plugin config. Added global `mcp_disable_auto_tool_inject` and `routing_chain_max_depth` to server config. - **Schema: routing rules**: Added `chain_rule` boolean to `routing_rule`. Made `scope_id` required (non-null string) when `scope` is `team`, `customer`, or `virtual_key`. - **Schema: budgets**: Moved `calendar_aligned` from the budget object to the virtual key level. Replaced `budget_id` on virtual key with `virtual_key_id`/`provider_config_id` on the budget object itself. Removed `budget_id` from `virtual_key_provider_config`. - **Schema: logs store**: Added `object_storage` (S3/GCS) and `retention_days` to the logs store config. - **Schema: pricing overrides**: Moved `pricing_overrides` from per-provider to a top-level array with scoped `provider_pricing_override` objects supporting `scope_kind`, `virtual_key_id`, `provider_id`, `provider_key_id`, `match_type`, `pattern`, `request_types`, and `pricing_patch`. - **Schema: compat plugin**: Replaced `enable_litellm_fallbacks` with a structured `compat` object supporting `convert_text_to_chat`, `convert_chat_to_responses`, `should_drop_params`, and `should_convert_params`. - **Schema: OTEL plugin**: Expanded `trace_type` enum to `genai_extension`, `vercel`, `open_inference` (was only `otel`). - **Schema: SCIM**: Renamed `saml_config` to `scim_config`. Added `apiToken` to `okta_config` and made `clientSecret` and `apiToken` required. Changed cluster `dial_timeout` from integer (nanoseconds) to Go duration string. - **Schema: misc**: Added `prompts` and `builtin` to plugin name/placement enums. Added `provider_configs` fields `id`, `description`, `network_config`, `proxy_config`, `custom_provider_config`, `concurrency_and_buffer_size`, and `openai_config`. Added `scim_config` top-level ref. Normalized multi-item `required` arrays to single-line format throughout. ## Type of change - [ ] Bug fix - [x] Feature - [x] Refactor - [ ] Documentation - [x] Chore/CI ## Affected areas - [x] Core (Go) - [x] Transports (HTTP) - [x] Providers/Integrations - [x] Plugins - [ ] UI (Next.js) - [ ] Docs ## How to test ```sh # Validate schema against existing configs npx ajv validate -s transports/config.schema.json -d your-config.json # Verify release pipeline runs without test gate # Push a tagged commit and confirm release jobs trigger directly after detect-changes ``` If upgrading from v1.4.x, set `"version": 1` in your config to preserve allow-all semantics for empty arrays, or migrate empty arrays to `["*"]` and adopt v2 deny-by-default semantics. ## Breaking changes - [x] Yes - [ ] No **Empty arrays in `allowed_models`, `key_ids`, `tools_to_execute`, `provider_configs`, and `mcp_configs` now deny all access by default (v2 semantics).** To allow all, use `["*"]`. To restore v1.4.x behavior, set `"version": 1` at the top level of your config. `enable_litellm_fallbacks` has been removed; replace with the `compat` object. `saml_config` has been renamed to `scim_config`. `budget_id` has been removed from virtual keys and `virtual_key_provider_config`. `calendar_aligned` has moved from the budget object to the virtual key. `deployments` has been removed from `azure_key_config` and `vertex_key_config`; use `aliases` on the key instead. `retry_backoff_initial_ms`/`retry_backoff_max_ms` renamed to `retry_backoff_initial`/`retry_backoff_max`. The `websocket` MCP connection type has been removed; use `http` or `sse`. Okta SCIM config now requires `clientSecret` and `apiToken`. ## Related issues N/A ## Security considerations The `insecure_skip_verify` and `ca_cert_pem` fields on `network_config` expose TLS bypass options; these should only be used in controlled environments. The `per_user_oauth` auth type for MCP introduces per-user credential flows that require careful OAuth config management. Removal of test gates from the release pipeline means regressions from flaky provider APIs will no longer block releases, but also means real failures could ship if not caught by other means. ## Checklist - [ ] I read `docs/contributing/README.md` and followed the guidelines - [ ] I added/updated tests where appropriate - [ ] I updated documentation where needed - [ ] I verified builds succeed (Go and UI) - [ ] I verified the CI pipeline passes locally if applicable

## Summary Bumps `core` to v1.4.21 and updates `transports` to depend on `core` v1.4.20, while removing a now-unnecessary workspace Go directive workaround that was previously required to satisfy the toolchain constraint introduced by `core` v1.4.19. ## Changes - Incremented `core` version from `1.4.20` to `1.4.21` - Updated `transports/go.mod` to reference `core` v1.4.20 (previously v1.4.19) - Removed the `go work edit -go=1.26.2 -toolchain=go1.26.2` workaround from the workspace setup script, which was only needed to satisfy the toolchain requirement imposed by the published `core` v1.4.19 ## Type of change - [ ] Bug fix - [ ] Feature - [ ] Refactor - [ ] Documentation - [x] Chore/CI ## Affected areas - [x] Core (Go) - [x] Transports (HTTP) - [ ] Providers/Integrations - [ ] Plugins - [ ] UI (Next.js) - [ ] Docs ## How to test ```sh go work sync go test ./... ``` Verify the workspace initializes without the explicit Go/toolchain directive and that all modules resolve correctly. ## Breaking changes - [ ] Yes - [x] No ## Related issues ## Security considerations None. ## Checklist - [ ] I read `docs/contributing/README.md` and followed the guidelines - [ ] I added/updated tests where appropriate - [ ] I updated documentation where needed - [ ] I verified builds succeed (Go and UI) - [ ] I verified the CI pipeline passes locally if applicable

coderabbitai

Actionable comments posted: 1

♻️ Duplicate comments (4)

core/bifrost.go (4)
3312-3334: ⚠️ Potential issue | 🟠 Major

Keep the old provider alive until the replacement is fully published.

By the time these rollback paths run, oldPq has already been signaled, waited out, and drained. If provider construction or the providers-slice swap fails, the rollback deletes newPq and removes the provider entirely, so a bad update becomes an outage instead of a failed update. Delay tearing down the old provider until the new one is fully constructed and swapped in, or restore oldPq/the old provider on failure.

Also applies to: 3345-3360
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@core/bifrost.go` around lines 3312 - 3334, Rollback currently tears down
oldPq before new provider is fully published, causing outages if provider
creation or slice-swap fails; change UpdateProvider so createBaseProvider and
the providers-slice swap complete before signaling/tearing down oldPq (or, if
any step fails, restore oldPq and the old provider entry): specifically, delay
calling oldPq.signalClosing(),
requestQueues.Delete(oldKey)/waitGroups.Delete(oldKey) and
drainQueueWithErrors(oldPq) until after the new provider is constructed and
bifrost.providers has been replaced, and in the error path for
createBaseProvider/removeProviderFromSlice ensure you re-insert the original
oldPq into requestQueues/waitGroups and leave it active (or properly swap it
back) so the existing provider remains serving while the update either succeeds
or is rolled back.
4483-4537: ⚠️ Potential issue | 🟠 Major

Re-check for a replacement queue before returning shutdown errors.

These late branches still return "provider is shutting down" once oldPq flips to closing after the initial reroute check. During hot updates, requestQueues can already point at a healthy replacement queue, so in-flight producers still fail instead of transparently rerouting. Re-load requestQueues in the <-pq.done and second pq.isClosing() paths before giving up.

Also applies to: 4758-4812
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@core/bifrost.go` around lines 4483 - 4537, The shutdown branches still return
"provider is shutting down" even if requestQueues has been replaced; before
releasing the message and returning in the <-pq.done branch and the second
pq.isClosing() path, reload the current requestQueues mapping and check for a
replacement queue for the same provider/model (use the same key logic used
earlier), and if a healthy replacement exists attempt to send the msg into that
new queue (or update pq and retry the send logic) instead of immediately
releasing and returning; only if no replacement is available proceed to call
bifrost.releaseChannelMessage and return the appropriate newBifrostErrorFromMsg
or newBifrostCtxDoneError.
5231-5238: ⚠️ Potential issue | 🔴 Critical

Don't release streaming pipelines outside the finalizer.

A stream that started successfully can still flip into bifrostError != nil after Line 4963's CheckFirstStreamChunkForError, but this branch still returns the pipeline to the pool directly. If the provider/drain path later invokes BifrostContextKeyPostHookSpanFinalizer, the same pipeline gets released twice and can be reused concurrently. Route started-stream errors through the stored finalizer (or a shared finalize helper) instead of calling releasePluginPipeline here.
Suggested fix
-		// Release pipeline immediately for non-streaming requests only
-		// For streaming, the pipeline is released in the postHookSpanFinalizer after streaming completes
-		// Exception: if streaming request has an error, release immediately since finalizer won't be called
-		if pipeline != nil && (!IsStreamRequestType(req.RequestType) || bifrostError != nil) {
-			bifrost.releasePluginPipeline(pipeline)
-		}
+		// Non-streaming requests own the pipeline directly.
+		// Streaming requests must always release through the shared finalizer so
+		// the sync.Once guard covers normal completion and started-stream errors.
+		if pipeline != nil {
+			if !IsStreamRequestType(req.RequestType) {
+				bifrost.releasePluginPipeline(pipeline)
+			} else if bifrostError != nil {
+				if finalizer, ok := req.Context.Value(schemas.BifrostContextKeyPostHookSpanFinalizer).(func(context.Context)); ok {
+					finalizer(req.Context)
+				} else {
+					bifrost.releasePluginPipeline(pipeline)
+				}
+			}
+		}
Also applies to: 5252-5257
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@core/bifrost.go` around lines 5231 - 5238, The current code releases the
streaming pipeline directly (bifrost.releasePluginPipeline) while also storing
postHookSpanFinalizer (which calls pipeline.FinalizeStreamingPostHookSpans and
releasePluginPipeline) into req.Context under
BifrostContextKeyPostHookSpanFinalizer, risking double-release if the finalizer
runs later; instead, whenever a started-stream error is detected (e.g., after
CheckFirstStreamChunkForError) or any early-return path would release the
pipeline, call the stored postHookSpanFinalizer (or delegate to a shared
finalize helper that runs finalizerOnce) rather than calling
releasePluginPipeline directly so the pipeline is returned exactly once via
finalizerOnce guarding pipeline.FinalizeStreamingPostHookSpans and
bifrost.releasePluginPipeline.
6150-6189: ⚠️ Potential issue | 🟠 Major

The final drain still has a post-sweep enqueue gap.

This helper exits on the first default, but senders can still win pq.queue <- msg afterward because the queue stays open and there is no sender quiescence/refcount. Those requests have no worker and no later sweep, so the caller blocks until its own context is cancelled—and indefinitely if it has no deadline.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@core/bifrost.go` around lines 6150 - 6189, The drain still races with
producers because pq.queue remains writable after workers exit; fix by adding a
producer-quiesce signal (e.g. a boolean + mutex or a dedicated closed channel on
ProviderQueue) and use it from tryRequest to refuse enqueues when closing:
introduce/flip e.g. pq.quiesced (or pq.quiesceCh) before calling
drainQueueWithErrors in signalClosing, update tryRequest to check
pq.isClosing()/pq.quiesced/pq.quiesceCh and return the "provider is shutting
down" error instead of attempting pq.queue <- msg, then run drainQueueWithErrors
to sweep remaining messages knowing no new sends will be attempted. Ensure
references: modify tryRequest, signalClosing flow that calls
drainQueueWithErrors(pq), and ProviderQueue to add the quiesce flag/channel and
its setter/reader.

🧹 Nitpick comments (1)

core/providers/mistral/mistral.go (1)
161-173: Move normalizeChatRequestForConversion to utils.go for provider helper consistency.

Implementation is good, but this unexported helper should live in core/providers/mistral/utils.go per repository structure rules.

As per coding guidelines, “Provider constants, base URLs, and unexported helpers belong in utils.go; error parsing belongs in errors.go.”
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@core/providers/mistral/mistral.go` around lines 161 - 173, Move the
unexported helper normalizeChatRequestForConversion (receiver *MistralProvider,
returning *schemas.BifrostChatRequest) out of mistral.go into a new or existing
core/providers/mistral/utils.go file so it follows the provider helper
convention; keep the exact signature, body (including checks for request == nil,
provider.customProviderConfig, and request.Provider == schemas.Mistral), and its
comment, remove the original from mistral.go, and ensure any callers still
resolve (no API change since package stays the same).

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In @.github/workflows/release-pipeline.yml:
- Around line 90-91: The release workflow removed automated test gates and must
be updated to require test jobs before release; modify
.github/workflows/release-pipeline.yml so the release job(s) include the test
jobs in their needs list (e.g., add needs entries for the existing job names
pr-tests, e2e-tests, snyk, scorecards or the appropriate test job IDs used in
the diff) or reintroduce a composite test job that runs automatically and add it
to needs alongside check-skip and detect-changes; alternatively, if you
intentionally accept manual testing, add a clear comment and link to documented
approval steps and ensure branch protection rules block merges without those
manual approvals.

---

Duplicate comments:
In `@core/bifrost.go`:
- Around line 3312-3334: Rollback currently tears down oldPq before new provider
is fully published, causing outages if provider creation or slice-swap fails;
change UpdateProvider so createBaseProvider and the providers-slice swap
complete before signaling/tearing down oldPq (or, if any step fails, restore
oldPq and the old provider entry): specifically, delay calling
oldPq.signalClosing(), requestQueues.Delete(oldKey)/waitGroups.Delete(oldKey)
and drainQueueWithErrors(oldPq) until after the new provider is constructed and
bifrost.providers has been replaced, and in the error path for
createBaseProvider/removeProviderFromSlice ensure you re-insert the original
oldPq into requestQueues/waitGroups and leave it active (or properly swap it
back) so the existing provider remains serving while the update either succeeds
or is rolled back.
- Around line 4483-4537: The shutdown branches still return "provider is
shutting down" even if requestQueues has been replaced; before releasing the
message and returning in the <-pq.done branch and the second pq.isClosing()
path, reload the current requestQueues mapping and check for a replacement queue
for the same provider/model (use the same key logic used earlier), and if a
healthy replacement exists attempt to send the msg into that new queue (or
update pq and retry the send logic) instead of immediately releasing and
returning; only if no replacement is available proceed to call
bifrost.releaseChannelMessage and return the appropriate newBifrostErrorFromMsg
or newBifrostCtxDoneError.
- Around line 5231-5238: The current code releases the streaming pipeline
directly (bifrost.releasePluginPipeline) while also storing
postHookSpanFinalizer (which calls pipeline.FinalizeStreamingPostHookSpans and
releasePluginPipeline) into req.Context under
BifrostContextKeyPostHookSpanFinalizer, risking double-release if the finalizer
runs later; instead, whenever a started-stream error is detected (e.g., after
CheckFirstStreamChunkForError) or any early-return path would release the
pipeline, call the stored postHookSpanFinalizer (or delegate to a shared
finalize helper that runs finalizerOnce) rather than calling
releasePluginPipeline directly so the pipeline is returned exactly once via
finalizerOnce guarding pipeline.FinalizeStreamingPostHookSpans and
bifrost.releasePluginPipeline.
- Around line 6150-6189: The drain still races with producers because pq.queue
remains writable after workers exit; fix by adding a producer-quiesce signal
(e.g. a boolean + mutex or a dedicated closed channel on ProviderQueue) and use
it from tryRequest to refuse enqueues when closing: introduce/flip e.g.
pq.quiesced (or pq.quiesceCh) before calling drainQueueWithErrors in
signalClosing, update tryRequest to check
pq.isClosing()/pq.quiesced/pq.quiesceCh and return the "provider is shutting
down" error instead of attempting pq.queue <- msg, then run drainQueueWithErrors
to sweep remaining messages knowing no new sends will be attempted. Ensure
references: modify tryRequest, signalClosing flow that calls
drainQueueWithErrors(pq), and ProviderQueue to add the quiesce flag/channel and
its setter/reader.

---

Nitpick comments:
In `@core/providers/mistral/mistral.go`:
- Around line 161-173: Move the unexported helper
normalizeChatRequestForConversion (receiver *MistralProvider, returning
*schemas.BifrostChatRequest) out of mistral.go into a new or existing
core/providers/mistral/utils.go file so it follows the provider helper
convention; keep the exact signature, body (including checks for request == nil,
provider.customProviderConfig, and request.Provider == schemas.Mistral), and its
comment, remove the original from mistral.go, and ensure any callers still
resolve (no API change since package stays the same).

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: cc1c1c61-46db-43bc-bb58-27a058247c4b

📥 Commits

Reviewing files that changed from the base of the PR and between c907a4d and 3e0dd03.

⛔ Files ignored due to path filters (1)

transports/go.sum is excluded by !**/*.sum

📒 Files selected for processing (7)

.github/workflows/release-pipeline.yml
.github/workflows/scripts/setup-go-workspace.sh
core/bifrost.go
core/providers/bedrock/bedrock_test.go
core/providers/mistral/mistral.go
core/version
transports/go.mod

✅ Files skipped from review due to trivial changes (1)

core/version

🚧 Files skipped from review as they are similar to previous changes (1)

.github/workflows/scripts/setup-go-workspace.sh

coderabbitai · 2026-04-17T23:16:18Z

+    needs: [check-skip, detect-changes]
+    if: "needs.check-skip.outputs.should-skip != 'true' && needs.detect-changes.outputs.core-needs-release == 'true'"


⚠️ Potential issue | 🟠 Major

🧩 Analysis chain

🏁 Script executed:

#!/bin/bash # Check if tests are run in other workflows before merging to main echo "=== Checking PR test workflows ===" rg -l 'test|Test' .github/workflows/*.yml | head -10 echo "" echo "=== Looking for test job definitions in pr-tests.yml ===" rg -n 'jobs:|test-' .github/workflows/pr-tests.yml 2>/dev/null | head -20 || echo "pr-tests.yml not found"

Repository: maximhq/bifrost

Length of output: 501

🏁 Script executed:

# Get context around lines 90-91 in release-pipeline.yml sed -n '80,100p' .github/workflows/release-pipeline.yml echo "=== Full jobs section of release-pipeline.yml ===" rg -A 2 'jobs:' .github/workflows/release-pipeline.yml | head -30

Repository: maximhq/bifrost

Length of output: 945

🏁 Script executed:

# Check pr-tests.yml to see if it tests on PRs head -50 .github/workflows/pr-tests.yml

Repository: maximhq/bifrost

Length of output: 1576

🏁 Script executed:

# Look for branch protection rules or merge gate configurations if [ -f '.github/CODEOWNERS' ]; then cat .github/CODEOWNERS; fi echo "=== Checking for branch rules ===" find . -name 'branch*.yml' -o -name '*branch*.json' 2>/dev/null | head -5

Repository: maximhq/bifrost

Length of output: 386

🏁 Script executed:

# Check e2e-tests.yml and snyk.yml for automatic PR triggers echo "=== e2e-tests.yml trigger config ===" head -30 .github/workflows/e2e-tests.yml | grep -A 10 "^on:" echo "" echo "=== snyk.yml trigger config ===" head -30 .github/workflows/snyk.yml | grep -A 10 "^on:" echo "" echo "=== scorecards.yml trigger config ===" head -30 .github/workflows/scorecards.yml | grep -A 10 "^on:"

Repository: maximhq/bifrost

Length of output: 949

🏁 Script executed:

# Check if there are any other test definitions in release-pipeline.yml itself echo "=== All job names in release-pipeline.yml ===" rg -o 'test-\w+|Test-\w+' .github/workflows/release-pipeline.yml | sort -u echo "" echo "=== Count of job definitions ===" rg -c '^ \w+:$' .github/workflows/release-pipeline.yml

Repository: maximhq/bifrost

Length of output: 140

🏁 Script executed:

# Look for the git history or comments indicating why tests were removed echo "=== Comments or documentation in release-pipeline.yml about test gates ===" rg -i 'test|gate|quality|release' .github/workflows/release-pipeline.yml | head -15

Repository: maximhq/bifrost

Length of output: 925

Critical risk: Test gates removed with no automatic testing coverage for PRs or releases.

Test job dependencies have been removed from the release pipeline. However, automatic test coverage is not present elsewhere:

pr-tests.yml requires manual workflow_dispatch approval (not automatic)

e2e-tests.yml runs only on feature branch 02-27-feat_extend_e2e_ui_tests, not on main/PRs

snyk.yml and scorecards.yml are security-focused, not functional test gates

Releases can now proceed without any automated functional test validation. This removes a critical quality gate. Either:

Restore automatic test dependencies in release jobs, or

Implement automatic PR test gates that block merges to main, or

Document why manual testing approval is acceptable for releases

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In @.github/workflows/release-pipeline.yml around lines 90 - 91, The release workflow removed automated test gates and must be updated to require test jobs before release; modify .github/workflows/release-pipeline.yml so the release job(s) include the test jobs in their needs list (e.g., add needs entries for the existing job names pr-tests, e2e-tests, snyk, scorecards or the appropriate test job IDs used in the diff) or reintroduce a composite test job that runs automatically and add it to needs alongside check-skip and detect-changes; alternatively, if you intentionally accept manual testing, add a clear comment and link to documented approval steps and ensure branch protection rules block merges without those manual approvals.

## Summary Bumps the core version to `1.4.22` and rolls back dependency versions across the framework, plugins, and transports to align with a prior stable set of releases. This resolves a version inconsistency introduced by forward-referencing newer module versions that were not yet intended to be consumed by downstream packages. ## Changes - Incremented `core/version` from `1.4.21` to `1.4.22` - Downgraded `bifrost/core` from `v1.4.19` → `v1.4.17` across `framework`, `governance`, `jsonparser`, `litellmcompat`, `logging`, `maxim`, `mocker`, `otel`, `semanticcache`, and `telemetry` plugins - Downgraded `bifrost/framework` from `v1.2.38` → `v1.2.36` (or `v1.2.35` for `semanticcache`) across all dependent plugins - Downgraded `bifrost/core` in `transports` from `v1.4.20` → `v1.4.19` - Downgraded all plugin versions referenced in `transports` (governance, litellmcompat, logging, maxim, otel, semanticcache, telemetry) to their corresponding prior releases - Downgraded `go.opentelemetry.io/otel/sdk` and `go.opentelemetry.io/otel/sdk/metric` from `v1.43.0` → `v1.40.0` in affected plugins - Bumped Go toolchain version in `transports/go.mod` from `1.26.1` to `1.26.2` ## Type of change - [ ] Bug fix - [ ] Feature - [ ] Refactor - [ ] Documentation - [x] Chore/CI ## Affected areas - [x] Core (Go) - [x] Transports (HTTP) - [ ] Providers/Integrations - [x] Plugins - [ ] UI (Next.js) - [ ] Docs ## How to test ```sh go test ./... ``` Verify that all modules resolve correctly with the pinned dependency versions and that no import errors occur during build. ## Breaking changes - [ ] Yes - [x] No ## Related issues N/A ## Security considerations None. These are internal module version adjustments with no changes to auth, secrets, or data handling. ## Checklist - [ ] I read `docs/contributing/README.md` and followed the guidelines - [ ] I added/updated tests where appropriate - [ ] I updated documentation where needed - [ ] I verified builds succeed (Go and UI) - [ ] I verified the CI pipeline passes locally if applicable

## Summary Pins the `bifrost/core` dependency in the example plugin modules to a consistent released version (`v1.4.17`), removing a local `replace` directive that was pointing to the local `core` module path. ## Changes - Replaced the local `replace` directive in `hello-world-wasm-go/go.mod` with a direct reference to `github.com/maximhq/bifrost/core v1.4.17` - Downgraded `hello-world/go.mod` from `v1.4.19` to `v1.4.17` to align both example plugins on the same released version ## Type of change - [ ] Bug fix - [ ] Feature - [ ] Refactor - [ ] Documentation - [x] Chore/CI ## Affected areas - [ ] Core (Go) - [ ] Transports (HTTP) - [ ] Providers/Integrations - [x] Plugins - [ ] UI (Next.js) - [ ] Docs ## How to test ```sh cd examples/plugins/hello-world-wasm-go go mod tidy go build ./... cd examples/plugins/hello-world go mod tidy go build ./... ``` ## Breaking changes - [ ] Yes - [x] No ## Related issues ## Security considerations No security implications. This change only affects dependency resolution for example plugin modules. ## Checklist - [ ] I read `docs/contributing/README.md` and followed the guidelines - [ ] I added/updated tests where appropriate - [ ] I updated documentation where needed - [ ] I verified builds succeed (Go and UI) - [ ] I verified the CI pipeline passes locally if applicable

…kip-pipeline

…--skip-pipeline

…-pipeline

…ipeline

…pipeline

…peline

…--skip-pipeline

…ip-pipeline

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (1)

.github/workflows/scripts/release-bifrost-http-prep.sh (1)

82-82: Extract Go version into a variable to prevent maintenance drift.

Lines 82 and 104 both hardcode 1.26.1. Pull this into a single variable at the top of the script so future toolchain updates can't partially update this file.

♻️ Proposed refactor

 VERSION="$1"
+GO_MOD_VERSION="1.26.1"
@@
-go mod edit -go=1.26.1 -toolchain=none
+go mod edit -go="$GO_MOD_VERSION" -toolchain=none
@@
-go mod edit -go=1.26.1 -toolchain=none
+go mod edit -go="$GO_MOD_VERSION" -toolchain=none

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In @.github/workflows/scripts/release-bifrost-http-prep.sh at line 82, Extract
the hardcoded Go version "1.26.1" into a single shell variable (e.g.,
GO_VERSION) defined near the top of the script and replace both occurrences of
the literal in the commands (the go mod edit invocation shown as go mod edit
-go=1.26.1 -toolchain=none and the other use at line ~104) to reference that
variable (e.g., "$GO_VERSION") so future updates only require changing one
value.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In @.github/workflows/scripts/release-bifrost-http-prep.sh:
- Around line 103-105: Extract the hardcoded toolchain version into a variable
(e.g., TOOLCHAIN_VERSION="1.26.1") and replace both occurrences of the literal
1.26.1 used with go mod edit so they stay in sync; wrap the network step "go mod
tidy" with the repository's existing backoff/retry helpers (imported from
go-utils.sh) so the tidy invocation is retried on transient failures (i.e., call
the provided retry/backoff function around the "go mod tidy" command instead of
invoking it directly).

---

Nitpick comments:
In @.github/workflows/scripts/release-bifrost-http-prep.sh:
- Line 82: Extract the hardcoded Go version "1.26.1" into a single shell
variable (e.g., GO_VERSION) defined near the top of the script and replace both
occurrences of the literal in the commands (the go mod edit invocation shown as
go mod edit -go=1.26.1 -toolchain=none and the other use at line ~104) to
reference that variable (e.g., "$GO_VERSION") so future updates only require
changing one value.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: a677e80e-d015-4e68-811a-e936544278f6

📥 Commits

Reviewing files that changed from the base of the PR and between 3e0dd03 and be3edf3.

⛔ Files ignored due to path filters (12)

examples/plugins/hello-world/go.sum is excluded by !**/*.sum
framework/go.sum is excluded by !**/*.sum
plugins/governance/go.sum is excluded by !**/*.sum
plugins/jsonparser/go.sum is excluded by !**/*.sum
plugins/litellmcompat/go.sum is excluded by !**/*.sum
plugins/logging/go.sum is excluded by !**/*.sum
plugins/maxim/go.sum is excluded by !**/*.sum
plugins/mocker/go.sum is excluded by !**/*.sum
plugins/otel/go.sum is excluded by !**/*.sum
plugins/semanticcache/go.sum is excluded by !**/*.sum
plugins/telemetry/go.sum is excluded by !**/*.sum
transports/go.sum is excluded by !**/*.sum

📒 Files selected for processing (15)

.github/workflows/scripts/release-bifrost-http-prep.sh
core/version
examples/plugins/hello-world-wasm-go/go.mod
examples/plugins/hello-world/go.mod
framework/go.mod
plugins/governance/go.mod
plugins/jsonparser/go.mod
plugins/litellmcompat/go.mod
plugins/logging/go.mod
plugins/maxim/go.mod
plugins/mocker/go.mod
plugins/otel/go.mod
plugins/semanticcache/go.mod
plugins/telemetry/go.mod
transports/go.mod

✅ Files skipped from review due to trivial changes (1)

core/version

coderabbitai · 2026-04-17T23:56:13Z

+# Re-normalize before tidy in case any edit reintroduced a toolchain line
+go mod edit -go=1.26.1 -toolchain=none
 go mod tidy


⚠️ Potential issue | 🟠 Major

🧩 Analysis chain

🏁 Script executed:

#!/bin/bash set -euo pipefail # Verify where retry/backoff is currently applied in this prep flow rg -n -C2 'go mod tidy|go mod edit|backoff|retry|go_get_with_backoff' .github/workflows/scripts/release-bifrost-http-prep.sh rg -n -C2 'backoff|retry|go_get_with_backoff' .github/workflows/scripts/go-utils.sh

Repository: maximhq/bifrost

Length of output: 1802

Add retry/backoff around go mod tidy to reduce release flakiness.

After switching dependency bumps to go mod edit, Line 105 is now the first network-resolving step. A transient proxy/network failure will fail the whole prep job immediately. While this script sources go-utils.sh which provides backoff utilities, they are not applied to go mod tidy.

Additionally, the toolchain version 1.26.1 is duplicated on Lines 82 and 104; consider extracting it as a variable to reduce drift risk on future version bumps.

🔧 Proposed fix

# Re-normalize before tidy in case any edit reintroduced a toolchain line go mod edit -go=1.26.1 -toolchain=none -go mod tidy +max_attempts=5 +attempt=1 +until go mod tidy; do + if [ "$attempt" -ge "$max_attempts" ]; then + echo "❌ Error: go mod tidy failed after $max_attempts attempts" + exit 1 + fi + sleep $((2 ** attempt)) + attempt=$((attempt + 1)) +done

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

# Re-normalize before tidy in case any edit reintroduced a toolchain line

go mod edit -go=1.26.1 -toolchain=none

go mod tidy

# Re-normalize before tidy in case any edit reintroduced a toolchain line

go mod edit -go=1.26.1 -toolchain=none

max_attempts=5

attempt=1

until go mod tidy; do

if [ "$attempt" -ge "$max_attempts" ]; then

echo "❌ Error: go mod tidy failed after $max_attempts attempts"

exit 1

fi

sleep $((2 ** attempt))

attempt=$((attempt + 1))

done

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In @.github/workflows/scripts/release-bifrost-http-prep.sh around lines 103 - 105, Extract the hardcoded toolchain version into a variable (e.g., TOOLCHAIN_VERSION="1.26.1") and replace both occurrences of the literal 1.26.1 used with go mod edit so they stay in sync; wrap the network step "go mod tidy" with the repository's existing backoff/retry helpers (imported from go-utils.sh) so the tidy invocation is retried on transient failures (i.e., call the provided retry/backoff function around the "go mod tidy" command instead of invoking it directly).

coderabbitai

Actionable comments posted: 1

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@docs/changelogs/v1.4.23.mdx`:
- Line 44: The changelog uses two spellings of the word ("marshaling" on Line 30
and "marshalling" on Line 44); choose one spelling and make it consistent across
docs/changelogs/v1.4.23.mdx by replacing the variant at Line 44 (the entry "-
**Payload Marshalling** — Removed unnecessary marshalling of payload in the
transport path") to match the variant used on Line 30 (or vice versa), ensuring
all occurrences of "marshaling"/"marshalling" in this document use the same
spelling.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: c81da04b-ac9b-4d5b-8771-ce385d040399

📥 Commits

Reviewing files that changed from the base of the PR and between be3edf3 and 1e79b19.

📒 Files selected for processing (2)

docs/changelogs/v1.4.23.mdx
docs/docs.json

coderabbitai · 2026-04-18T00:08:54Z

+- **Async Context Propagation** — Preserve context values in async requests so downstream handlers retain request-scoped data
+- **Custom Providers** — Allow custom providers without a list-models endpoint to accept any model rather than restricting on virtual key registration
+- **OTEL Plugin** — Default `insecure` to `true` in config.json and include fallbacks in emitted OTEL metrics
+- **Payload Marshalling** — Removed unnecessary marshalling of payload in the transport path


⚠️ Potential issue | 🟡 Minor

Keep “marshaling/marshalling” spelling consistent in this changelog entry.

Line 30 uses “marshaling” while Line 44 uses “marshalling”; use one variant consistently in the same document.

✏️ Suggested edit

-- **Payload Marshalling** — Removed unnecessary marshalling of payload in the transport path +- **Payload Marshaling** — Removed unnecessary marshaling of payload in the transport path

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

- **Payload Marshalling** — Removed unnecessary marshalling of payload in the transport path

- **Payload Marshaling** — Removed unnecessary marshaling of payload in the transport path

🧰 Tools

🪛 LanguageTool

[uncategorized] ~44-~44: Do not mix variants of the same word (‘marshalling’ and ‘marshaling’) within a single text.
Context: ...oad Marshalling** — Removed unnecessary marshalling of payload in the transport path - **He...

(EN_EXACT_COHERENCY_RULE)

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@docs/changelogs/v1.4.23.mdx` at line 44, The changelog uses two spellings of the word ("marshaling" on Line 30 and "marshalling" on Line 44); choose one spelling and make it consistent across docs/changelogs/v1.4.23.mdx by replacing the variant at Line 44 (the entry "- **Payload Marshalling** — Removed unnecessary marshalling of payload in the transport path") to match the variant used on Line 30 (or vice versa), ensuring all occurrences of "marshaling"/"marshalling" in this document use the same spelling.

crust3780 and others added 10 commits April 13, 2026 14:46

fix(helm): use correct templating for mcpClientConfig (#2673)

a8cf690

docs updates (#2691)

79a1ab4

chore: regenerate openapi.json --skip-pipeline

8f84688

fix: allow custom providers without a list models endpoint to pass in…

1bda46a

… any model rather than restrict it on vk (#2694)

fix: otel plugin fixes (#2727)

5d3b34b

sammaji requested a review from a team as a code owner April 15, 2026 12:30

greptile-apps Bot reviewed Apr 15, 2026

View reviewed changes

Comment thread framework/modelcatalog/main.go

Comment thread plugins/otel/main.go

Comment thread ui/lib/types/schemas.ts

coderabbitai Bot requested a review from danpiths April 15, 2026 13:46

mintlify Bot deployed to staging - docs April 15, 2026 13:47 View deployment

coderabbitai Bot requested changes Apr 15, 2026

View reviewed changes

akshaydeo added 2 commits April 15, 2026 19:55

helm chart updates (#2736)

f2005ef

fixed validation checks (#2737)

132b0d4

coderabbitai Bot requested changes Apr 15, 2026

View reviewed changes

test case fixes for helm release (#2738)

12e443b

coderabbitai Bot reviewed Apr 15, 2026

View reviewed changes

coderabbitai Bot requested changes Apr 15, 2026

View reviewed changes

akshaydeo added 4 commits April 18, 2026 02:44

coderabbitai Bot requested a review from roroghost17 April 17, 2026 21:55

coderabbitai Bot requested changes Apr 17, 2026

View reviewed changes

akshaydeo added 4 commits April 18, 2026 03:57

coderabbitai Bot requested changes Apr 17, 2026

View reviewed changes

akshaydeo and others added 14 commits April 18, 2026 04:52

framework: bump core to v1.4.22 --skip-pipeline

bfe8360

plugins/governance: bump core to v1.4.22 and framework to v1.2.39 --s…

f962154

…kip-pipeline

plugins/jsonparser: bump core to v1.4.22 and framework to v1.2.39 --s…

314f13f

…kip-pipeline

plugins/litellmcompat: bump core to v1.4.22 and framework to v1.2.39 …

dd36ff0

…--skip-pipeline

plugins/logging: bump core to v1.4.22 and framework to v1.2.39 --skip…

aec4b7e

…-pipeline

plugins/maxim: bump core to v1.4.22 and framework to v1.2.39 --skip-p…

cbfa4f4

…ipeline

plugins/mocker: bump core to v1.4.22 and framework to v1.2.39 --skip-…

40737ff

…pipeline

plugins/otel: bump core to v1.4.22 and framework to v1.2.39 --skip-pi…

adc31bc

…peline

plugins/semanticcache: bump core to v1.4.22 and framework to v1.2.39 …

67340ad

…--skip-pipeline

plugins/telemetry: bump core to v1.4.22 and framework to v1.2.39 --sk…

b31cc3a

…ip-pipeline

enforce go 1.26.1 (#2820)

67319e5

transports: update dependencies --skip-pipeline

be3edf3

coderabbitai Bot requested changes Apr 17, 2026

View reviewed changes

Adds changelog for v1.4.23 --skip-pipeline

1e79b19

mintlify Bot deployed to staging - docs April 18, 2026 00:05 View deployment

coderabbitai Bot requested changes Apr 18, 2026

View reviewed changes

akshaydeo merged commit 0da189c into v1.5.0 Apr 18, 2026
19 of 25 checks passed

		needs: [check-skip, detect-changes]
		if: "needs.check-skip.outputs.should-skip != 'true' && needs.detect-changes.outputs.core-needs-release == 'true'"

-# Re-normalize before tidy in case any edit reintroduced a toolchain line
-go mod edit -go=1.26.1 -toolchain=none
-go mod tidy
+# Re-normalize before tidy in case any edit reintroduced a toolchain line
+go mod edit -go=1.26.1 -toolchain=none
+max_attempts=5
+attempt=1
+until go mod tidy; do
+  if [ "$attempt" -ge "$max_attempts" ]; then
+    echo "❌ Error: go mod tidy failed after $max_attempts attempts"
+    exit 1
+  fi
+  sleep $((2 ** attempt))
+  attempt=$((attempt + 1))
+done

	- Payload Marshalling — Removed unnecessary marshalling of payload in the transport path
	- Payload Marshaling — Removed unnecessary marshaling of payload in the transport path

Conversation

sammaji commented Apr 15, 2026

Summary

Changes

Type of change

Affected areas

How to test

Screenshots/Recordings

Breaking changes

Related issues

Security considerations

Checklist

Uh oh!

github-advanced-security AI commented Apr 15, 2026

What Enabling Code Scanning Means:

Uh oh!

CLAassistant commented Apr 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

greptile-apps Bot commented Apr 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Confidence Score: 5/5

Important Files Changed

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot commented Apr 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Suggested reviewers

Poem

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai Bot Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Apr 15, 2026

CLAassistant commented Apr 15, 2026 •

edited

Loading

greptile-apps Bot commented Apr 15, 2026 •

edited

Loading

coderabbitai Bot commented Apr 15, 2026 •

edited

Loading