feat: include cache creation/read tokens for AWS Bedrock explicit caching by yuzisun · Pull Request #1721 · envoyproxy/ai-gateway

yuzisun · 2026-01-05T02:49:20Z

Description
Include cache creation and cache hit tokens to total input tokens as well as keep separate fields for cache miss/hit accounting. This is to unify the usage response to user for both implicit and explicit cache as the input tokens for gpt and gemini include the cache tokens.

Signed-off-by: Dan Sun <dsun20@bloomberg.net>

codecov-commenter · 2026-01-05T02:55:20Z

Codecov Report

❌ Patch coverage is 98.11321% with 1 line in your changes missing coverage. Please review.
✅ Project coverage is 81.10%. Comparing base (488e668) to head (89ac98c).

Files with missing lines	Patch %	Lines
internal/translator/openai_awsbedrock.go	95.45%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1721      +/-   ##
==========================================
+ Coverage   81.05%   81.10%   +0.04%     
==========================================
  Files         147      147              
  Lines       13319    13327       +8     
==========================================
+ Hits        10796    10809      +13     
+ Misses       1873     1869       -4     
+ Partials      650      649       -1

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

Signed-off-by: Dan Sun <dsun20@bloomberg.net>

…hing (#1721) **Description** Include cache creation and cache hit tokens to total input tokens as well as keep separate fields for cache miss/hit accounting. This is to unify the usage response to user for both implicit and explicit cache as the input tokens for gpt and gemini include the cache tokens. --------- Signed-off-by: Dan Sun <dsun20@bloomberg.net>

…hing (envoyproxy#1721) **Description** Include cache creation and cache hit tokens to total input tokens as well as keep separate fields for cache miss/hit accounting. This is to unify the usage response to user for both implicit and explicit cache as the input tokens for gpt and gemini include the cache tokens. --------- Signed-off-by: Dan Sun <dsun20@bloomberg.net> Signed-off-by: yxia216 <yxia216@bloomberg.net>

…hing (envoyproxy#1721) **Description** Include cache creation and cache hit tokens to total input tokens as well as keep separate fields for cache miss/hit accounting. This is to unify the usage response to user for both implicit and explicit cache as the input tokens for gpt and gemini include the cache tokens. --------- Signed-off-by: Dan Sun <dsun20@bloomberg.net>

yuzisun added 4 commits January 4, 2026 18:08

include cached token in input tokens for AWS Bedrock

618ca82

Signed-off-by: Dan Sun <dsun20@bloomberg.net>

fix tests

9794b3d

Signed-off-by: Dan Sun <dsun20@bloomberg.net>

fix: add missing SetCacheCreationInputTokens call in metrics extraction

cc7fd19

Signed-off-by: Dan Sun <dsun20@bloomberg.net>

fix cache token tests

dc664d4

Signed-off-by: Dan Sun <dsun20@bloomberg.net>

yuzisun requested a review from a team as a code owner January 5, 2026 02:49

dosubot bot added the size:L This PR changes 100-499 lines, ignoring generated files. label Jan 5, 2026

yuzisun added 2 commits January 4, 2026 22:03

fix lint issues

af6c732

Signed-off-by: Dan Sun <dsun20@bloomberg.net>

remove redis

92a310b

Signed-off-by: Dan Sun <dsun20@bloomberg.net>

mathetake approved these changes Jan 5, 2026

View reviewed changes

mathetake enabled auto-merge (squash) January 5, 2026 19:12

Merge branch 'main' into cache_point

89ac98c

mathetake merged commit bcf4cdf into envoyproxy:main Jan 5, 2026
32 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: include cache creation/read tokens for AWS Bedrock explicit caching#1721

feat: include cache creation/read tokens for AWS Bedrock explicit caching#1721
mathetake merged 7 commits intoenvoyproxy:mainfrom
yuzisun:cache_point

yuzisun commented Jan 5, 2026

Uh oh!

codecov-commenter commented Jan 5, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

yuzisun commented Jan 5, 2026

Uh oh!

codecov-commenter commented Jan 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov-commenter commented Jan 5, 2026 •

edited

Loading