fix: update gemini-live model endpoints and mode to realtime by Chesars · Pull Request #22814 · BerriAI/litellm

Chesars · 2026-03-04T22:43:55Z

Relevant issues

Supersedes #18009

Pre-Submission checklist

N/A - JSON config only, no code changes
My PR passes all unit tests on make test-unit
My PR's scope is as isolated as possible, it only solves 1 specific problem

Type

🐛 Bug Fix

Changes

The gemini-live-2.5-flash-preview-native-audio-09-2025 model was incorrectly configured with mode: "chat" and REST API endpoints (/v1/chat/completions, /v1/completions), but this model only works with WebSockets (Realtime API).

What changed:

supported_endpoints: Updated to correct realtime endpoints:
- gemini-live-* (vertex_ai) → /vertex_ai/live
- gemini/gemini-live-* (gemini) → /v1/realtime
mode: Changed from "chat" to "realtime" so health checks use _realtime_health_check() (WebSocket) instead of acompletion() (REST), which would fail for this model

Files changed:

model_prices_and_context_window.json
litellm/model_prices_and_context_window_backup.json

Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

The gemini-live-2.5-flash-preview-native-audio-09-2025 model only works with WebSocket (Live API), not REST endpoints. Changed supported_endpoints from /v1/chat/completions to /vertex_ai/live to reflect the actual passthrough endpoint available in LiteLLM proxy.

The gemini/ prefix indicates Google AI Studio, which uses /v1/realtime endpoint (OpenAI-compatible), not /vertex_ai/live.

The mode field is used by health checks to determine the correct check method (WebSocket for realtime vs REST for chat).

vercel · 2026-03-04T22:44:00Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
litellm	Error		Mar 4, 2026 10:44pm

CLAassistant · 2026-03-04T22:44:01Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you all sign our Contributor License Agreement before we can accept your contribution.
1 out of 2 committers have signed the CLA.

✅ Chesars
❌ github-actions[bot]
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

greptile-apps · 2026-03-04T22:46:26Z

Greptile Summary

Fixes incorrect configuration for gemini-live-2.5-flash-preview-native-audio-09-2025 models, which are WebSocket-only (Realtime API) but were misconfigured with REST API settings. Changes the mode from "chat" to "realtime" so health checks use _realtime_health_check() (WebSocket) instead of acompletion() (REST), and updates supported_endpoints to the correct WebSocket routes (/vertex_ai/live for Vertex AI, /v1/realtime for Gemini API).

Both the primary and backup JSON files are updated consistently
The endpoint values match the registered WebSocket routes in litellm/proxy/proxy_server.py
The mode: "realtime" value correctly maps to the realtime health check handler in litellm/litellm_core_utils/health_check_helpers.py
No code changes required — this is a JSON config-only fix, consistent with the project's convention of storing model-specific flags in model_prices_and_context_window.json

Confidence Score: 5/5

This PR is safe to merge — it corrects a config-only bug in JSON metadata with no code changes.
The changes are minimal, well-scoped, and correct. Both JSON files are updated consistently, the endpoint values match existing registered WebSocket routes, and the mode value maps to a valid health check handler. This is a straightforward bug fix with no risk of regression.
No files require special attention.

Important Files Changed

Filename	Overview
model_prices_and_context_window.json	Corrects `mode` from `"chat"` to `"realtime"` and updates `supported_endpoints` to the proper WebSocket endpoints for both gemini-live model variants. Changes are consistent and match the proxy server's registered websocket routes.
litellm/model_prices_and_context_window_backup.json	Backup file mirrors the exact same changes as the primary JSON file, keeping both files in sync.

Flowchart

%%{init: {'theme': 'neutral'}}%%
flowchart TD
    A["gemini-live-2.5-flash model"] --> B{mode?}
    B -->|"Before: chat"| C["acompletion() REST call"]
    C --> D["❌ Fails — model is WebSocket-only"]
    B -->|"After: realtime"| E["_realtime_health_check() WebSocket call"]
    E --> F["✅ Correct health check"]

    G["Vertex AI variant"] --> H["/vertex_ai/live endpoint"]
    I["Gemini API variant"] --> J["/v1/realtime endpoint"]

_{Last reviewed commit: 0e1a633}

github-actions bot and others added 5 commits March 3, 2026 17:52

chore: regenerate poetry.lock to match pyproject.toml (#3)

8665e92

Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

Merge remote-tracking branch 'upstream/main'

bca8730

fix: use /v1/realtime for gemini/ provider live model

ddf9598

The gemini/ prefix indicates Google AI Studio, which uses /v1/realtime endpoint (OpenAI-compatible), not /vertex_ai/live.

fix: update mode to realtime for gemini-live models

0e1a633

The mode field is used by health checks to determine the correct check method (WebSocket for realtime vs REST for chat).

Chesars mentioned this pull request Mar 4, 2026

fix: update gemini-live model to use realtime mode instead of chat #18009

Closed

3 tasks

vercel bot had a problem deploying to Preview March 4, 2026 22:44 Failure

Chesars merged commit 028dd3f into BerriAI:main Mar 4, 2026
29 of 37 checks passed

Chesars deleted the fix/gemini-live-supported-endpoints branch March 4, 2026 22:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: update gemini-live model endpoints and mode to realtime#22814

fix: update gemini-live model endpoints and mode to realtime#22814
Chesars merged 5 commits intoBerriAI:mainfrom
Chesars:fix/gemini-live-supported-endpoints

Chesars commented Mar 4, 2026

Uh oh!

vercel bot commented Mar 4, 2026 •

edited

Loading

Uh oh!

CLAassistant commented Mar 4, 2026

Uh oh!

greptile-apps bot commented Mar 4, 2026

Important Files Changed

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

Chesars commented Mar 4, 2026

Relevant issues

Pre-Submission checklist

Type

Changes

What changed:

Files changed:

Uh oh!

vercel bot commented Mar 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

CLAassistant commented Mar 4, 2026

Uh oh!

greptile-apps bot commented Mar 4, 2026

Greptile Summary

Confidence Score: 5/5

Important Files Changed

Flowchart

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

vercel bot commented Mar 4, 2026 •

edited

Loading