Main by towry · Pull Request #5 · towry/litellm

towry · 2026-03-16T15:34:32Z

Relevant issues

Pre-Submission checklist

Please complete all items before asking a LiteLLM maintainer to review your PR

I have Added testing in the tests/test_litellm/ directory, Adding at least 1 test is a hard requirement - see details
My PR passes all unit tests on make test-unit
My PR's scope is as isolated as possible, it only solves 1 specific problem
I have requested a Greptile review by commenting @greptileai and received a Confidence Score of at least 4/5 before requesting a maintainer review

Delays in PR merge?

If you're seeing a delay in your PR being merged, ping the LiteLLM Team on Slack (#pr-review).

CI (LiteLLM team)

CI status guideline:

50-55 passing tests: main is stable with minor issues.

45-49 passing tests: acceptable but needs attention

<= 40 passing tests: unstable; be careful with your merges and assess the risk.

Branch creation CI run
Link:
CI run for the last commit
Link:
Merge / cherry-pick CI run
Links:

Type

🆕 New Feature
🐛 Bug Fix
🧹 Refactoring
📖 Documentation
🚄 Infrastructure
✅ Test

Changes

Ensure final finish_reason chunks retain non-OpenAI attributes from original provider chunks, including the holding_chunk flush path where delta is non-empty. Add regression tests for both final-chunk branches. Made-with: Cursor

Made-with: Cursor

Previously, stream_chunk_builder only took annotations from the first chunk that contained them, losing any annotations from later chunks. This is a problem because providers like Gemini/Vertex AI send grounding metadata (converted to annotations) in the final streaming chunk, while other providers may spread annotations across multiple chunks. Changes: - Collect and merge annotations from ALL annotation-bearing chunks instead of only using the first one

) * added the header mapping feature * added tests * final cleanup * final cleanup * added missing test and logic * fixed header sending bug * Update litellm/proxy/auth/auth_utils.py Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com> * added back init file in responses + fixed test_auth_utils.py int local_testing --------- Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>

Support missing base64 padding in managed character/video IDs so copied encoded IDs still decode to the original upstream character ID. Made-with: Cursor

Use typed character response models and video multipart helpers so /videos/characters forwards uploaded MP4 files with video/* content type. Made-with: Cursor

Avoid the temporary Any alias and use a concrete FileTypes import compatible with type checks. Made-with: Cursor

…atch Fix: Vertex ai Batch Output File Download Fails with 500

docs(blog): add WebRTC blog post link

…header_order Refactor: Filtering beta header after transformation

…al-streaming-attributes fix(streaming): preserve custom attributes on final stream chunk

… issues - Remove duplicate DecodedCharacterId TypedDict from litellm/types/videos/main.py - Remove dead LITELLM_MANAGED_VIDEO_CHARACTER_COMPLETE_STR constant from litellm/types/utils.py - Add FastAPI Form validation for name field in video_create_character endpoint Made-with: Cursor

…n video handlers Add response.raise_for_status() before transform_*_response() calls in all eight video character/edit/extension handler methods (sync and async): - video_create_character_handler / async_video_create_character_handler - video_get_character_handler / async_video_get_character_handler - video_edit_handler / async_video_edit_handler - video_extension_handler / async_video_extension_handler Without these checks, httpx does not raise on 4xx/5xx responses, so provider errors (e.g., 401 Unauthorized) pass directly to Pydantic model constructors, causing ValidationError instead of meaningful HTTP errors. The raise_for_status() ensures the exception handler receives proper HTTPStatusError for translation into actionable messages. Made-with: Cursor

…r in router-first routing Add avideo_create_character and avideo_get_character to the list of video endpoints that use router-first routing when a model is provided (either from decoded IDs or target_model_names). Previously only avideo_edit and avideo_extension were in the router-first block. This ensures both character endpoints benefit from multi-deployment load balancing and model resolution, making them consistent with the other video operations. This allows: - avideo_create_character: Router picks among multiple deployments when target_model_names is set - avideo_get_character: Router assists with multi-model environments for consistency Made-with: Cursor

- Clear examples for SDK and proxy usage - Feature highlights: router support, encoding, error handling - Best practices for character uploads and prompting - Available from LiteLLM v1.83.0+ - Troubleshooting guide for common issues Made-with: Cursor

- Add curl examples for avideo_edit and avideo_extension APIs - Explain how LiteLLM encodes/decodes managed character IDs - Show metadata included in character IDs (provider, model_id) - Detail transparent router-first routing benefits Made-with: Cursor

…tion test Add avideo_create_character, avideo_get_character, avideo_edit, and avideo_extension to the skip condition since Azure video calls don't use initialize_azure_sdk_client. Tests now properly skip with expected behavior instead of failing: - test_ensure_initialize_azure_sdk_client_always_used[avideo_create_character] ✓ - test_ensure_initialize_azure_sdk_client_always_used[avideo_get_character] ✓ - test_ensure_initialize_azure_sdk_client_always_used[avideo_edit] ✓ - test_ensure_initialize_azure_sdk_client_always_used[avideo_extension] ✓ Made-with: Cursor

@AbstractMethod

…sion methods Convert all 8 new video methods from @AbstractMethod to concrete implementations that raise NotImplementedError. This prevents breaking external third-party BaseVideoConfig subclasses at import time. Methods affected: - transform_video_create_character_request/response - transform_video_get_character_request/response - transform_video_edit_request/response - transform_video_extension_request/response External integrators can now upgrade without instantiation errors; NotImplementedError is only raised when operations are actually called on unsupported providers. This restores backward compatibility with the project's policy. Made-with: Cursor

…r-endpoint-fixes [Feat] Add create character endpoints and other new videos Endpoints

…14_2026 Litellm oss staging 03 14 2026

This reverts commit 1864fa0.

docs: Learn page updates, card links, integrations, sidebar changes

Sameerlite and others added 30 commits March 13, 2026 13:06

docs(blog): add WebRTC blog post link

8f769ef

Made-with: Cursor

Refactor: Filtering beta header after transformation

1a8f8c6

Fix downloading vertex ai files

22b333c

fix(video): decode managed character ids robustly

61519d6

Support missing base64 padding in managed character/video IDs so copied encoded IDs still decode to the original upstream character ID. Made-with: Cursor

fix(video): enforce character endpoint video MIME handling

4a7ef7b

Use typed character response models and video multipart helpers so /videos/characters forwards uploaded MP4 files with video/* content type. Made-with: Cursor

fix(types): use direct FileTypes import in video schemas

94405b6

Avoid the temporary Any alias and use a concrete FileTypes import compatible with type checks. Made-with: Cursor

Add new videos endpoints

79c787b

Add new videos endpoints

c338892

Add new videos endpoints routing and init

8dab5de

Add new videos transformation

14a691f

Add new videos docs

430f3ac

Merge branch 'main' into litellm_create-character-endpoint-fixes

9beec82

Merge pull request BerriAI#23718 from BerriAI/litellm_fix_vertex_ai_b…

ab377f3

…atch Fix: Vertex ai Batch Output File Download Fails with 500

Merge pull request BerriAI#23547 from Sameerlite/litellm_blog-webrtc

10d5475

docs(blog): add WebRTC blog post link

Merge pull request BerriAI#23715 from BerriAI/litellm_anthropic_beta_…

0bbdd2a

…header_order Refactor: Filtering beta header after transformation

Merge pull request BerriAI#23530 from Sameerlite/litellm_preserve-fin…

b796ee9

…al-streaming-attributes fix(streaming): preserve custom attributes on final stream chunk

Fix docs

32842a5

Fix docs

1255382

Merge pull request BerriAI#23737 from BerriAI/litellm_create-characte…

71dfd01

…r-endpoint-fixes [Feat] Add create character endpoints and other new videos Endpoints

Merge pull request BerriAI#23686 from BerriAI/litellm_oss_staging_03_…

3dccdde

…14_2026 Litellm oss staging 03 14 2026

towry merged commit 1864fa0 into deploy Mar 16, 2026
1 check failed

towry deleted the main branch March 16, 2026 15:35

towry restored the main branch March 16, 2026 15:36

towry added a commit that referenced this pull request Mar 16, 2026

Revert "Main (#5)"

5bc7cdb

This reverts commit 1864fa0.

towry mentioned this pull request Mar 16, 2026

Revert "Main" #6

Closed

towry pushed a commit that referenced this pull request Mar 20, 2026

Merge pull request #5 from Astrodevil/v0-docs

76c8b38

docs: Learn page updates, card links, integrations, sidebar changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Main#5

Main#5
towry merged 30 commits intodeployfrom
main

towry commented Mar 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

towry commented Mar 16, 2026

Relevant issues

Pre-Submission checklist

Delays in PR merge?

CI (LiteLLM team)

Type

Changes

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants