update 02 staging PR by Sameerlite · Pull Request #20337 · BerriAI/litellm

Sameerlite · 2026-02-03T11:37:54Z

Relevant issues

Pre-Submission checklist

Please complete all items before asking a LiteLLM maintainer to review your PR

I have Added testing in the tests/litellm/ directory, Adding at least 1 test is a hard requirement - see details
My PR passes all unit tests on make test-unit
My PR's scope is as isolated as possible, it only solves 1 specific problem

CI (LiteLLM team)

CI status guideline:

50-55 passing tests: main is stable with minor issues.

45-49 passing tests: acceptable but needs attention

<= 40 passing tests: unstable; be careful with your merges and assess the risk.

Branch creation CI run
Link:
CI run for the last commit
Link:
Merge / cherry-pick CI run
Links:

Type

🆕 New Feature
🐛 Bug Fix
🧹 Refactoring
📖 Documentation
🚄 Infrastructure
✅ Test

Changes

…drail [Fix] UI - Team Settings: Disable Global Guardrail Persistence

[Feature] Key reset_spend endpoint

…down [Feature] UI - SSO: Add Team Mappings

* init: SemanticMCPToolFilter * init: SemanticToolFilterHook * test_e2e_semantic_filter * mock tests: test_semantic_filter_basic_filtering * Update litellm/proxy/_experimental/mcp_server/semantic_tool_filter.py Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com> * refactor folder/file organization * docs fix * fix filter * fix: filter_tools * fix linting tool filrer * initialize_from_config * fix: _expand_mcp_tools * _initialize_semantic_tool_filter * working: async_post_call_response_headers_hook * clean up semantic tool filter * add _initialize_semantic_tool_filter * build_router_from_mcp_registry * _get_tools_by_names * fiix config * async_post_call_response_headers_hook --------- Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>

* init: SemanticMCPToolFilter * init: SemanticToolFilterHook * test_e2e_semantic_filter * mock tests: test_semantic_filter_basic_filtering * Update litellm/proxy/_experimental/mcp_server/semantic_tool_filter.py Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com> * refactor folder/file organization * docs fix * fix filter * fix: filter_tools * fix linting tool filrer * initialize_from_config * fix: _expand_mcp_tools * _initialize_semantic_tool_filter * working: async_post_call_response_headers_hook * clean up semantic tool filter * add _initialize_semantic_tool_filter * build_router_from_mcp_registry * _get_tools_by_names * fiix config * async_post_call_response_headers_hook * docs mcp filter * docs fix --------- Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>

…arch` with VertexAI/gemini models (#20280) * test_gemini_openai_web_search_tool_to_google_search * feat: Handle OpenAI style web search tools

Add documentation correctly for nova sonic

[Feature] UI - Default Team Settings: Migrate Default Team Settings to use Reusable Model Select

[Feature] UI - Navbar: Option to Hide Community Engagement Buttons

…19220)" This reverts commit 72e5193.

…mantic_tool_filter.py tests

This reverts commit 1e8848c.

Litellm tuesday cicd release

…icd_release Revert "Litellm tuesday cicd release"

…19220)" This reverts commit 72e5193.

…mantic_tool_filter.py tests

This reverts commit 1e8848c.

#20071)" This reverts commit ef73f33.

…inal Litellm tuesday cicd release final

bump litellm 1.81.7

vercel · 2026-02-03T11:38:04Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
litellm	Ready	Preview, Comment	Feb 3, 2026 11:38am

greptile-apps · 2026-02-03T11:43:35Z

Greptile Overview

Greptile Summary

This staging PR combines several significant changes:

Major Changes:

MCP Semantic Tool Filter: New feature that semantically filters MCP tools before LLM calls using embeddings to reduce context window size. Includes pre-call hook, tool expansion logic, and response headers for filter stats.
Access Group Filtering Removal: Completely reverts the access group filtering feature (issue [Bug]: CRITICAL - Models loadbalanced outside of Model Access Group #18333) that prevented cross-team load balancing. Removes filter_deployments_by_access_groups() function, related metadata extraction, and all associated tests.
Database Migration Deletion: Removes the migration file that added indexes to LiteLLM_VerificationToken table, which could cause migration state issues in production.
Key Management: Adds new /key/{key}/reset_spend endpoint allowing proxy/team admins to reset key spend to lower values with validation.
Fallback Logic: Simplifies fallback exhaustion check by removing redundant model group length validation.
UI Refactoring: Extracts community engagement buttons (Slack/GitHub) into separate component.

Key Concerns:

The deleted migration file could break deployed instances that already ran it
Semantic filter initialization happens during startup (violates performance rule about avoiding expensive startup operations)
Removing access group filtering removes a security feature without explanation
Good test coverage for new features (700+ lines of tests for reset_spend endpoint, comprehensive MCP semantic filter tests)

Confidence Score: 3/5

This PR has some significant risks including migration deletion and feature removal that need careful review before merging
Score reflects: (1) deleted migration file that could break production deployments, (2) complete removal of access group filtering security feature without documented justification, (3) startup initialization that may cause performance issues per custom rules, (4) good test coverage for new features partially offsets concerns
Critical attention needed for migration file deletion (litellm-proxy-extras/litellm_proxy_extras/migrations/20260129103648_add_verificationtoken_indexes/migration.sql) and semantic filter startup initialization (litellm/proxy/proxy_server.py). Review removal of access group filtering (litellm/router_utils/common_utils.py, litellm/router.py)

Important Files Changed

Filename	Overview
litellm/router_utils/common_utils.py	Removed `filter_deployments_by_access_groups` function and its usage - reverting feature #18333
litellm/router.py	Removed access group filtering from router, removing cross-team load balancing protection
litellm/proxy/litellm_pre_call_utils.py	Removed access group metadata extraction for keys and teams from request processing
litellm/proxy/proxy_server.py	Added semantic tool filter initialization during proxy startup, creating Router object in startup path
litellm/proxy/hooks/mcp_semantic_filter/hook.py	New pre-call hook for semantic filtering of MCP tools using embeddings and semantic similarity
litellm/proxy/management_endpoints/key_management_endpoints.py	Added new `/key/{key}/reset_spend` endpoint with validation logic for resetting key spend to lower values
litellm/router_utils/fallback_event_handlers.py	Simplified fallback depth check by removing redundant model group length validation
litellm-proxy-extras/litellm_proxy_extras/migrations/20260129103648_add_verificationtoken_indexes/migration.sql	Deleted migration file that added database indexes - potentially breaking for deployed instances

Sequence Diagram

sequenceDiagram
    participant Client
    participant ProxyServer
    participant SemanticFilterHook
    participant SemanticMCPToolFilter
    participant Router as LiteLLM Router
    participant LLM as LLM Provider

    Note over ProxyServer: Startup Phase
    ProxyServer->>SemanticFilterHook: initialize_from_config()
    SemanticFilterHook->>SemanticMCPToolFilter: new(embedding_model, router, top_k)
    SemanticMCPToolFilter->>SemanticMCPToolFilter: build_router_from_mcp_registry()
    Note over SemanticMCPToolFilter: Fetches all MCP tools<br/>and builds semantic index
    SemanticFilterHook-->>ProxyServer: hook registered

    Note over Client,LLM: Request Phase
    Client->>ProxyServer: POST /chat/completions<br/>{messages, tools: [...]}
    ProxyServer->>SemanticFilterHook: async_pre_call_hook(data)
    
    alt MCP tools with litellm_proxy reference
        SemanticFilterHook->>SemanticFilterHook: _expand_mcp_tools()
        Note over SemanticFilterHook: Expands MCP references<br/>to full tool definitions
    end
    
    SemanticFilterHook->>SemanticFilterHook: extract_user_query(messages)
    SemanticFilterHook->>SemanticMCPToolFilter: filter_tools(query, tools)
    SemanticMCPToolFilter->>Router: aembedding(query)
    Router->>LLM: Generate embedding
    LLM-->>Router: embedding vector
    Router-->>SemanticMCPToolFilter: query embedding
    SemanticMCPToolFilter->>SemanticMCPToolFilter: Calculate similarity scores
    SemanticMCPToolFilter-->>SemanticFilterHook: filtered_tools (top_k)
    
    SemanticFilterHook->>SemanticFilterHook: Update data["tools"]<br/>Add filter stats to metadata
    SemanticFilterHook-->>ProxyServer: modified data
    
    ProxyServer->>LLM: completion(messages, filtered_tools)
    LLM-->>ProxyServer: response
    
    ProxyServer->>SemanticFilterHook: async_post_call_response_headers_hook()
    SemanticFilterHook-->>ProxyServer: headers with filter stats
    ProxyServer-->>Client: response + headers<br/>(x-litellm-semantic-filter)

greptile-apps

_{8 files reviewed, 2 comments}

_{Edit Code Review Agent Settings | Greptile}

greptile-apps · 2026-02-03T11:43:42Z

litellm/proxy/proxy_server.py

+    ## SEMANTIC TOOL FILTER ##
+    # Read litellm_settings from config for semantic filter initialization
+    try:
+        verbose_proxy_logger.debug("About to initialize semantic tool filter")
+        _config = proxy_config.get_config_state()
+        _litellm_settings = _config.get("litellm_settings", {})
+        verbose_proxy_logger.debug(f"litellm_settings keys = {list(_litellm_settings.keys())}")
+        await ProxyStartupEvent._initialize_semantic_tool_filter(
+            llm_router=llm_router,
+            litellm_settings=_litellm_settings,
+        )
+        verbose_proxy_logger.debug("After semantic tool filter initialization")
+    except Exception as e:
+        verbose_proxy_logger.error(f"Semantic filter init failed: {e}", exc_info=True)


The semantic filter initialization occurs during startup and calls build_router_from_mcp_registry() which may involve network calls to fetch tools from MCP servers. This violates the custom rule about avoiding expensive operations during startup. Consider lazy initialization on first use instead.

Context Used: Rule from dashboard - What: Avoid creating new database requests or Router objects in the critical request path.

Why: Cre... (source)

_{Note: If this suggestion doesn't match your team's coding style, reply to this and let me know. I'll remember it for next time!}

Prompt To Fix With AI

This is a comment left during a code review. Path: litellm/proxy/proxy_server.py Line: 796:809 Comment: The semantic filter initialization occurs during startup and calls `build_router_from_mcp_registry()` which may involve network calls to fetch tools from MCP servers. This violates the custom rule about avoiding expensive operations during startup. Consider lazy initialization on first use instead. **Context Used:** Rule from `dashboard` - What: Avoid creating new database requests or Router objects in the critical request path. Why: Cre... ([source](https://app.greptile.com/review/custom-context?memory=0c2a17ad-5f29-423f-a48b-371852ac4169)) <sub>Note: If this suggestion doesn't match your team's coding style, reply to this and let me know. I'll remember it for next time!</sub> How can I resolve this? If you propose a fix, please make it concise.

greptile-apps · 2026-02-03T11:43:44Z

Additional Comments (1)

litellm-proxy-extras/litellm_proxy_extras/migrations/20260129103648_add_verificationtoken_indexes/migration.sql
Deleting a migration file is dangerous for production deployments. If this migration was already applied to deployed databases, removing it from the codebase can cause migration state inconsistencies. Consider reverting the indexes with a new DOWN migration instead of deleting the file.

If these indexes were causing issues, the proper approach is:

Keep this migration file
Create a new migration that drops the indexes
Document why they were removed

Prompt To Fix With AI

This is a comment left during a code review.
Path: litellm-proxy-extras/litellm_proxy_extras/migrations/20260129103648_add_verificationtoken_indexes/migration.sql
Line: 1:8

Comment:
Deleting a migration file is dangerous for production deployments. If this migration was already applied to deployed databases, removing it from the codebase can cause migration state inconsistencies. Consider reverting the indexes with a new DOWN migration instead of deleting the file.

If these indexes were causing issues, the proper approach is:
1. Keep this migration file
2. Create a new migration that drops the indexes
3. Document why they were removed

How can I resolve this? If you propose a fix, please make it concise.

yuneng-jiang and others added 30 commits February 2, 2026 12:38

temp commit for branch switching

f1bca73

adding team mappings UI

899bafb

Adding tests

65c62ff

reset_spend endpoint

edfe239

team setting disable global guardrail fix

16f0b49

fixing tests

2645d25

Merge pull request #20307 from BerriAI/litellm_ui_disable_global_guar…

cd154c4

…drail [Fix] UI - Team Settings: Disable Global Guardrail Persistence

Merge pull request #20305 from BerriAI/litellm_reset_spend_endpoint

62993b5

[Feature] Key reset_spend endpoint

Merge pull request #20299 from BerriAI/litellm_ui_allowed_routes_drop…

8eba641

…down [Feature] UI - SSO: Add Team Mappings

option to hide community engagement buttons

32b1ff7

Add blog post: Achieving Sub-Millisecond Proxy Overhead (#20309)

1b9631d

Migrate Default Team settings to use reusable Model Select

cf734cb

fix linting

4e8c6d1

fix mypy lint

c8f9af1

Add documentation correctly for nova sonic

333419b

[Feat] /chat/completions - allow using OpenAI style tools for `web_se…

5cfcf67

…arch` with VertexAI/gemini models (#20280) * test_gemini_openai_web_search_tool_to_google_search * feat: Handle OpenAI style web search tools

Merge pull request #20320 from BerriAI/litellm_nova-sonic_doc

24a4979

Add documentation correctly for nova sonic

docs Tracing Tools

5aa8725

docs fix

7ae9804

Merge pull request #20310 from BerriAI/litellm_ui_def_team_settings

2984832

[Feature] UI - Default Team Settings: Migrate Default Team Settings to use Reusable Model Select

Merge pull request #20308 from BerriAI/litellm_ui_community_buttons

9202870

[Feature] UI - Navbar: Option to Hide Community Engagement Buttons

Revert "fix: models loadbalancing billing issue by filter (#18891) (#…

7dd0248

…19220)" This reverts commit 72e5193.

Fix litellm/tests/test_litellm/proxy/_experimental/mcp_server/test_se…

86ae627

…mantic_tool_filter.py tests

Fix code quality tests

b379fb6

Revert "add missing indexes on VerificationToken table (#20040)"

ecb6413

This reverts commit 1e8848c.

Merge pull request #20328 from BerriAI/litellm_tuesday_cicd_release

23f662e

Litellm tuesday cicd release

Revert "Litellm tuesday cicd release"

1b1854b

Merge pull request #20330 from BerriAI/revert-20328-litellm_tuesday_c…

80acd4c

…icd_release Revert "Litellm tuesday cicd release"

Sameerlite added 9 commits February 3, 2026 15:00

Revert "fix: models loadbalancing billing issue by filter (#18891) (#…

eb8f4d3

…19220)" This reverts commit 72e5193.

Fix litellm/tests/test_litellm/proxy/_experimental/mcp_server/test_se…

9a6bafe

…mantic_tool_filter.py tests

Fix code quality tests

017b78d

Revert "add missing indexes on VerificationToken table (#20040)"

fae0554

This reverts commit 1e8848c.

Revert "fix: prevent error when max_fallbacks exceeds available models (

31cdffd

#20071)" This reverts commit ef73f33.

Fix litellm_security_tests

21e95c7

Merge pull request #20333 from BerriAI/litellm_tuesday_cicd_release_f…

793a7fd

…inal Litellm tuesday cicd release final

bump litellm 1.81.7

47c5366

Merge pull request #20336 from BerriAI/litellm_bump_version_1.81.7

070d501

bump litellm 1.81.7

Sameerlite merged commit b7f0d05 into litellm_oss_staging_02_03_2026 Feb 3, 2026
58 of 67 checks passed

greptile-apps bot reviewed Feb 3, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

update 02 staging PR#20337

update 02 staging PR#20337
Sameerlite merged 39 commits intolitellm_oss_staging_02_03_2026from
main

Sameerlite commented Feb 3, 2026

Uh oh!

Uh oh!

vercel bot commented Feb 3, 2026

Uh oh!

greptile-apps bot commented Feb 3, 2026

Important Files Changed

Uh oh!

greptile-apps bot left a comment

Uh oh!

greptile-apps bot Feb 3, 2026

Uh oh!

greptile-apps bot commented Feb 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

Sameerlite commented Feb 3, 2026

Relevant issues

Pre-Submission checklist

CI (LiteLLM team)

Type

Changes

Uh oh!

Uh oh!

vercel bot commented Feb 3, 2026

Uh oh!

greptile-apps bot commented Feb 3, 2026

Greptile Overview

Greptile Summary

Confidence Score: 3/5

Important Files Changed

Sequence Diagram

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot Feb 3, 2026

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot commented Feb 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants