Fix json_tool_call filtering: streaming support, pop->get, behavioral parity by jquinter · Pull Request #2 · haggai-backline/litellm

jquinter · 2026-02-09T14:07:42Z

Summary

Addresses the 3 findings from the review of PR BerriAI#18384 (BerriAI#18384):

Streaming support (requested by @krrishdholakia): AWSEventStreamDecoder now accepts json_mode and suppresses json_tool_call chunks in streaming, converting its arguments to text content. Updated all callers (make_call, make_sync_call in both invoke_handler.py and converse_handler.py) to pass json_mode to the decoder.
optional_params.pop("json_mode") -> .get() (flagged by Greptile): Avoids mutating the caller's dict, which could break downstream logging/retries/metrics that access optional_params after response transformation.
Preserve original tool_calls behavior: _filter_json_mode_tools() returns None only when json_tool_call was the sole tool (converted to content). Returns the list (including []) otherwise, preserving the original behavior of always setting tool_calls.

Also extracts the filtering logic into _filter_json_mode_tools() and moves import json to module level per project conventions.

Test plan

test_transform_response_with_both_json_tool_call_and_real_tool — non-streaming mixed tools
test_transform_response_does_not_mutate_optional_params — verifies .get() fix
test_streaming_decoder_filters_json_tool_call — streaming with mixed json_tool_call + real tool
test_streaming_decoder_without_json_mode_passes_all_tools — streaming passthrough when json_mode=False
All 61 existing tests in test_converse_transformation.py pass

…mantic_tool_filter.py tests

This reverts commit 1e8848c.

…lease Litellm tuesday cicd release

…esday_cicd_release Revert "Litellm tuesday cicd release"

…el batches

) (BerriAI#19220)" This reverts commit 72e5193.

…mantic_tool_filter.py tests

This reverts commit 1e8848c.

BerriAI#20071)" This reverts commit ef73f33.

…lease_final Litellm tuesday cicd release final

Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>

…cific_fields'

…81.7 bump litellm 1.81.7

update 02 staging PR

There is no reason to merge user messages.

adding together ai models to litellm models json

[Infra] UI - Testing: Adding Unit Testing Coverage

…uth_mh [Feature] UI - Admin Settings: Add option for Authentication for public AI Hub

[Fix] UI - Model Info Page: Fix Input and Output Labels

…resize [Fix] UI - Model Page: Column Resizing on Smaller Screens

…ation refactor: migrate Ant Design notifications to use `App.useApp()` cont…

…rvers (BerriAI#20602) - process_mcp_request() now falls back to OAuth2 passthrough when Authorization header contains a non-LiteLLM token (catches HTTPException and ProxyException 401/403) - MCPClient._get_auth_headers() adds missing MCPAuth.oauth2 case

…ternal URLs in their agent cards (e.g., http://0.0.0.0:8001/) (BerriAI#20604) * v1 card resolver fix * fix: is_localhost_or_internal_url * fix code * test_fix_agent_card_url_replaces_localhost * test restruct * test_a2a_non_streaming * test agnts * add exception handling * init errors * add localhost retry * add agent_testing * test_a2a_non_streaming * _build_streaming_logging_obj * code qa fixes * test_card_resolver_fallback_from_new_to_old_path * fix linting

[Re-issue: Fix] Keys and Teams Router Setting + Allow Override of Router Settings

…tibility (BerriAI#20591) Keycloak (and similar OIDC providers) include role claims in the JWT access token but not in the UserInfo endpoint response. Previously, roles were only extracted from UserInfo, causing all SSO users to default to internal_user_view_only regardless of their actual role. Changes: - Extract user roles from JWT access token in process_sso_jwt_access_token() when UserInfo doesn't provide them (tries role_mappings first, then GENERIC_USER_ROLE_ATTRIBUTE) - Handle list-type role values in get_litellm_user_role() since Keycloak returns roles as arrays (e.g. ["proxy_admin"] instead of "proxy_admin") - Add 9 new unit tests covering role extraction and list handling - Update 3 existing tests for new JWT decode behavior Closes BerriAI#20407

…#20566) Notes: General support for Opus 4.6 was added in BerriAI#20506 however it omitted the AU (australian) specific instance profile used in Bedrock. This change only adds the the au id. It is copied from the US model settings which is consistent with past additions of this regional model profile.

…CP + Agent guardrail support (BerriAI#20619) * fix: fix styling * fix(custom_code_guardrail.py): add http support for custom code guardrails allows users to call external guardrails on litellm with minimal code changes (no custom handlers) Test guardrail integrations more easily * feat(a2a/): add guardrails for agent interactions allows the same guardrails for llm's to be applied to agents as well * fix(a2a/): support passing guardrails to a2a from the UI * style(code-editor): allow editing custom code guardrails on ui + add examples of pre/post calls for custom code guardrails * feat(mcp/): support custom code guardrails for mcp calls allows custom code guardrails to work on mcp input * feat(chatui.tsx): support guardrails on mcp tool calls on playground

…erriAI#20618) * fix(mypy): resolve missing return statements and type casting issues * fix(pangea): use elif to prevent UnboundLocalError and handle None messages Address Greptile review feedback: - Make branches mutually exclusive using elif to prevent input_messages from being overwritten - Handle case where data.get('messages') returns None to avoid passing invalid payload to Pangea API --------- Co-authored-by: Shin <shin@openclaw.ai>

…lable on Internet (BerriAI#20607) * update MCPAuthenticatedUser * add available_on_public_internet for MCPs * update claude.md * init IPAddressUtils * init available_on_public_internet * add on REST endpoints * filter with IP * TestIsInternalIp * _extract_mcp_headers_from_request * init get_mcp_client_ip * _get_general_settings * allowed_server_ids * address PR comments * get_mcp_server_by_name fix * fix server * fix review comments * get_public_mcp_servers * address _get_allowed_mcp_servers

* update MCPAuthenticatedUser * add available_on_public_internet for MCPs * update claude.md * init IPAddressUtils * init available_on_public_internet * add on REST endpoints * filter with IP * TestIsInternalIp * _extract_mcp_headers_from_request * init get_mcp_client_ip * _get_general_settings * allowed_server_ids * address PR comments * get_mcp_server_by_name fix * fix server * fix review comments * get_public_mcp_servers * address _get_allowed_mcp_servers * test fix * fix linting * inint ui types * add ui for managing MCP private/public * add ui * fixes * add to schema * add types * fix endpoint * add endpoint * update manager * test mcp * dont use external party for ip address

…sion detection (BerriAI#20622)

[Fix] /key/list user_id Empty String Edge Case

- a2a_protocol/exception_mapping_utils.py: Fix type ignore comment for None assignment - caching/redis_cache.py: Add type ignore for async ping return type - caching/redis_cluster_cache.py: Add type ignore for async ping return type - llms/deprecated_providers/palm.py: Add type ignore for palm.generate_text - proxy/auth/handle_jwt.py: Add type ignore for jwt.decode options argument All changes add appropriate type: ignore comments to handle library typing inconsistencies.

Replace text-embedding-004 with gemini-embedding-001. The old model was deprecated and returns 404: 'models/text-embedding-004 is not found for API version v1beta' Co-authored-by: Shin <shin@openclaw.ai>

…settings [Feature] UI - Team Settings: Soft Budget + Alerting Emails

Update opus 4.6 blog with adaptive thinking

… tools (BerriAI#18384) When both `tools` and `response_format` are used with Bedrock Converse, LiteLLM adds an internal `json_tool_call` tool for structured output. Bedrock may return both this internal tool and real user tools, which breaks consumers like the OpenAI Agents SDK. Changes: - Extract filtering logic into `_filter_json_mode_tools()` method - Filter out `json_tool_call` when mixed with real tools in responses - Add streaming support: `AWSEventStreamDecoder` now accepts `json_mode` and suppresses `json_tool_call` chunks, converting arguments to text - Fix `optional_params.pop("json_mode")` -> `.get()` to avoid mutating the caller's dict (affects logging/retries/metrics downstream) - Preserve original behavior of setting `tool_calls=[]` for empty lists

Sameerlite and others added 30 commits February 3, 2026 12:08

Fix litellm/tests/test_litellm/proxy/_experimental/mcp_server/test_se…

86ae627

…mantic_tool_filter.py tests

Fix code quality tests

b379fb6

Revert "add missing indexes on VerificationToken table (BerriAI#20040)"

ecb6413

This reverts commit 1e8848c.

Add support for delete via only file_id

a92a0fa

Add support for delete via only file_id

cad15e2

Merge pull request BerriAI#20328 from BerriAI/litellm_tuesday_cicd_re…

23f662e

…lease Litellm tuesday cicd release

Revert "Litellm tuesday cicd release"

1b1854b

Merge pull request BerriAI#20330 from BerriAI/revert-20328-litellm_tu…

80acd4c

…esday_cicd_release Revert "Litellm tuesday cicd release"

Fix: Managed Batches: Inconsistent State Management for list and canc…

410e546

…el batches

Revert "fix: models loadbalancing billing issue by filter (BerriAI#18891

eb8f4d3

) (BerriAI#19220)" This reverts commit 72e5193.

Fix litellm/tests/test_litellm/proxy/_experimental/mcp_server/test_se…

9a6bafe

…mantic_tool_filter.py tests

Fix code quality tests

017b78d

Revert "add missing indexes on VerificationToken table (BerriAI#20040)"

fae0554

This reverts commit 1e8848c.

Revert "fix: prevent error when max_fallbacks exceeds available models (

31cdffd

BerriAI#20071)" This reverts commit ef73f33.

Fix litellm_security_tests

21e95c7

Merge pull request BerriAI#20333 from BerriAI/litellm_tuesday_cicd_re…

793a7fd

…lease_final Litellm tuesday cicd release final

Apply suggestion from @greptile-apps[bot]

c2a33ea

Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>

Fix: Extra inputs are not permitted, field: 'messages[2].provider_spe…

3765d88

…cific_fields'

bump litellm 1.81.7

47c5366

Merge pull request BerriAI#20336 from BerriAI/litellm_bump_version_1.…

070d501

…81.7 bump litellm 1.81.7

Merge pull request BerriAI#20337 from BerriAI/main

b7f0d05

update 02 staging PR

added 1h ttl support for aws bedrock

1aa619d

fixing glm-4.7 input cost per token

ea19d8d

bugfix: Remove user messages merging

ffbc8d2

There is no reason to merge user messages.

Merge pull request BerriAI#20319 from FelipeRodriguesGare/add/models

be89b38

adding together ai models to litellm models json

fixed linting

b693458

Add get files API support and tests

ff568de

Add get files API support and tests

d8761de

removing thinking lines

c1aa1c3

fixed typo

a25289e

yuneng-jiang and others added 30 commits February 6, 2026 12:33

Merge pull request BerriAI#20596 from BerriAI/litellm_ui_yj_cov_01

49eab29

[Infra] UI - Testing: Adding Unit Testing Coverage

Merge pull request BerriAI#20444 from BerriAI/litellm_ui_config_req_a…

4de0ed7

…uth_mh [Feature] UI - Admin Settings: Add option for Authentication for public AI Hub

Merge pull request BerriAI#20462 from BerriAI/litellm_model_info_cost

ac8f380

[Fix] UI - Model Info Page: Fix Input and Output Labels

Merge pull request BerriAI#20599 from BerriAI/litellm_model_page_col_…

dfc4a1b

…resize [Fix] UI - Model Page: Column Resizing on Smaller Screens

Merge pull request BerriAI#20549 from swayambhu94/fix/ui/antd-notific…

a4689c9

…ation refactor: migrate Ant Design notifications to use `App.useApp()` cont…

Merge remote-tracking branch 'origin' into litellm_router_search_fix

400e560

use cached keys and teams for router settings

fd3ca08

Merge pull request BerriAI#20205 from BerriAI/litellm_router_search_fix

218373c

[Re-issue: Fix] Keys and Teams Router Setting + Allow Override of Router Settings

fixing user_id

4d1b5d8

Add OpenAI/Azure release test suite with HTTP client lifecycle regres…

0d74656

…sion detection (BerriAI#20622)

Merge pull request BerriAI#20623 from BerriAI/litellm_user_id_fix

271877f

[Fix] /key/list user_id Empty String Edge Case

docs (BerriAI#20626)

1b24a0f

docs

36be004

fix(test): update deprecated gemini embedding model (BerriAI#20621)

537f7af

Replace text-embedding-004 with gemini-embedding-001. The old model was deprecated and returns 404: 'models/text-embedding-004 is not found for API version v1beta' Co-authored-by: Shin <shin@openclaw.ai>

ui new buil

51af66f

team settings soft budget and alerting emails

e968e37

fixing test

8ae1fe3

Merge pull request BerriAI#20634 from BerriAI/litellm_ui_team_budget_…

a427a2b

…settings [Feature] UI - Team Settings: Soft Budget + Alerting Emails

Update opus 4.6 blog with adaptive thinking

8741512

Merge pull request BerriAI#20637 from BerriAI/litellm_blog_claude_4_6

f5ed782

Update opus 4.6 blog with adaptive thinking

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix json_tool_call filtering: streaming support, pop->get, behavioral parity#2

Fix json_tool_call filtering: streaming support, pop->get, behavioral parity#2
jquinter wants to merge 365 commits intohaggai-backline:mainfrom
jquinter:fix/bedrock-json-tool-call-filtering

jquinter commented Feb 9, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants

Conversation

jquinter commented Feb 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants

jquinter commented Feb 9, 2026 •

edited

Loading