"Fix: Subtract cached tokens from batch cost calculation" by priyam-that · Pull Request #19704 · BerriAI/litellm

priyam-that · 2026-01-24T18:02:07Z

Fixes issue Cost calculation incorrectly charges for cached tokens #19680 where cached tokens were being charged in batch operations
Handles both cache_read_input_tokens (Anthropic/OpenAI) and prompt_tokens_details.cached_tokens (z.ai/Bedrock/Gemini) formats
Applies fix to both input_cost_per_token_batches and input_cost_per_token paths
Prevents overcharging users by 10x+ on requests with high cache hit rates"

Relevant issues

Pre-Submission checklist

Please complete all items before asking a LiteLLM maintainer to review your PR

I have Added testing in the tests/litellm/ directory, Adding at least 1 test is a hard requirement - see details
My PR passes all unit tests on make test-unit
My PR's scope is as isolated as possible, it only solves 1 specific problem

CI (LiteLLM team)

CI status guideline:

50-55 passing tests: main is stable with minor issues.

45-49 passing tests: acceptable but needs attention

<= 40 passing tests: unstable; be careful with your merges and assess the risk.

Branch creation CI run
Link:
CI run for the last commit
Link:
Merge / cherry-pick CI run
Links:

Type

🆕 New Feature
🐛 Bug Fix
🧹 Refactoring
📖 Documentation
🚄 Infrastructure
✅ Test

Changes

- Fixes issue BerriAI#19680 where cached tokens were being charged in batch operations - Handles both cache_read_input_tokens (Anthropic/OpenAI) and prompt_tokens_details.cached_tokens (z.ai/Bedrock/Gemini) formats - Applies fix to both input_cost_per_token_batches and input_cost_per_token paths - Prevents overcharging users by 10x+ on requests with high cache hit rates"

vercel · 2026-01-24T18:02:12Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Review	Updated (UTC)
litellm	Ready	Preview, Comment	Jan 24, 2026 6:08pm

CLAassistant · 2026-01-24T18:02:13Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

vercel bot deployed to Preview January 24, 2026 18:08 View deployment

Chesars mentioned this pull request Jan 26, 2026

fix(xai): correct cached token cost calculation for xAI models #19772

Merged

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

"Fix: Subtract cached tokens from batch cost calculation"#19704

"Fix: Subtract cached tokens from batch cost calculation"#19704
priyam-that wants to merge 1 commit intoBerriAI:mainfrom
priyam-that:fix/cached-token-batch-cost-calculation

priyam-that commented Jan 24, 2026

Uh oh!

vercel bot commented Jan 24, 2026 •

edited

Loading

Uh oh!

CLAassistant commented Jan 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

priyam-that commented Jan 24, 2026

Relevant issues

Pre-Submission checklist

CI (LiteLLM team)

Type

Changes

Uh oh!

vercel bot commented Jan 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

CLAassistant commented Jan 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

vercel bot commented Jan 24, 2026 •

edited

Loading