genai: define cached tokens attributes by verdie-g · Pull Request #3163 · open-telemetry/semantic-conventions

verdie-g · 2025-12-02T17:12:52Z

Fixes open-telemetry/semantic-conventions-genai#23

Changes

This PR adds two new GenAI attributes to represent provider-level prompt caching:

gen_ai.usage.cache_read_input_tokens
gen_ai.usage.cache_creation_input_tokens

It also updates the description of gen_ai.usage.input_tokens to state that it must include cached tokens. OpenAI and Vertex AI already count cached tokens in input_tokens, while Anthropic excludes them.

Merge requirement checklist

CONTRIBUTING.md guidelines followed.
Change log entry added, according to the guidelines in When to add a changelog entry.
- If your PR does not need a change log, start the PR title with [chore]
Links to the prototypes or existing instrumentations (when adding or changing conventions)

eddyerburgh · 2025-12-17T10:17:03Z

@lmolkova are there any concerns here or do you think this can be merged? Would be great to get these added to the spec

alexmojaki · 2025-12-19T14:26:19Z

@eddyerburgh see my review comment, that's a required change

github-actions · 2026-01-03T03:40:10Z

This PR has been labeled as stale due to lack of activity. It will be automatically closed if there is no further activity over the next 7 days.

verdie-g · 2026-01-03T10:59:25Z

@lmolkova could you have a look at this one before it gets automatically closed 🙏

github-actions · 2026-01-19T03:51:02Z

This PR has been labeled as stale due to lack of activity. It will be automatically closed if there is no further activity over the next 7 days.

alexmojaki · 2026-01-19T10:02:40Z

There's merge conflicts now

Co-authored-by: Liudmila Molkova <neskazu@gmail.com>

lmolkova · 2026-01-26T21:54:29Z

@verdie-g thanks for the contribution, everything looks great. I think you need to regenerate markdown tables and it'd be good to go!

stephentoub · 2026-01-27T14:15:30Z

+note: Add cache token attributes and provider-specific normalization guidance for GenAI usage metrics
+issues: [1959]
+subtext: |
+  - Add `gen_ai.usage.cache_read.input_tokens` attribute for tokens served from provider cache


Should these be recorded in metrics as well? Assuming yes, what should the gen_ai.token.type be?

@stephentoub not yet, there is no metric definition for it (no type defined). There are probably some design flaws we'll need to fix to make it happen. Created https://github.com/open-telemetry/semantic-conventions/issues/3341 so we don't forget

…ventions Add support for extracting cache_read_input_tokens, cache_creation_input_tokens, and reasoning_tokens from OTEL span attributes in both the Elasticsearch ingestion path (otel.traces.ts) and the ClickHouse-to-ES mapper (span.mapper.ts). - Extract `gen_ai.usage.cache_read.input_tokens` and `gen_ai.usage.cache_creation.input_tokens` per official OTEL GenAI semconv (open-telemetry/semantic-conventions#3163) - Extract `gen_ai.usage.reasoning_tokens` (Traceloop/OpenLLMetry convention) - Aggregate cache and reasoning tokens at trace level in metrics computation - Add Elasticsearch schema fields and migration for the new token metrics - Clean up redundant `gen_ai.usage.total_tokens` from params - Remove leftover debug console.logs from otel.metrics.ts

* feat: support gen_ai.input/output.messages and parts pattern in OTEL traces Map gen_ai.input.messages and gen_ai.output.messages (OTEL GenAI semantic conventions) to LangWatch input/output fields. Previously these ended up in params with input: null and output: null. - Extract gen_ai.input/output.messages directly as chat_messages without Zod validation to avoid stripping provider-specific content blocks (Anthropic tool_use/tool_result, pi-ai parts, etc.) - Support the "parts" pattern (Vercel AI SDK / pi-ai) natively in ChatMessage type alongside "content" - Handle content blocks with {type:"text", content:"..."} in addition to {type:"text", text:"..."} for text extraction - Widen LLM type detection to include CLIENT span kind (OTEL GenAI spec) - Increase params truncation from 32KB to 128KB - Extract system instructions from pre-existing gen_ai.input.messages in ClickHouse pipeline * feat: extract cache and reasoning tokens from OTEL GenAI semantic conventions Add support for extracting cache_read_input_tokens, cache_creation_input_tokens, and reasoning_tokens from OTEL span attributes in both the Elasticsearch ingestion path (otel.traces.ts) and the ClickHouse-to-ES mapper (span.mapper.ts). - Extract `gen_ai.usage.cache_read.input_tokens` and `gen_ai.usage.cache_creation.input_tokens` per official OTEL GenAI semconv (open-telemetry/semantic-conventions#3163) - Extract `gen_ai.usage.reasoning_tokens` (Traceloop/OpenLLMetry convention) - Aggregate cache and reasoning tokens at trace level in metrics computation - Add Elasticsearch schema fields and migration for the new token metrics - Clean up redundant `gen_ai.usage.total_tokens` from params - Remove leftover debug console.logs from otel.metrics.ts * fix: add missing monitors.getById mock in OnlineEvaluationDrawer test helpers The useContext mock was missing monitors.getById.invalidate, causing 3 unhandled TypeError exceptions during update mutation tests. * fix: use long type for cache token fields in ES schema and migration The field was already mapped as long in ES (from OTEL intValue ingestion), so the migration must use long to avoid illegal_argument_exception. * feat: stop deleting token attrs from params and display cache tokens in drawer - Stop deleting gen_ai.usage.* token fields from span params so they remain visible in the trace details - Display cache_read and cache_creation tokens in the SpanDetails drawer alongside prompt/completion/reasoning tokens

github-actions · 2026-05-05T16:37:23Z

This PR contains changes to area(s) that do not have an active SIG/project and will be auto-closed:

gen-ai

Such changes may be rejected or put on hold until a new SIG/project is established.

Please refer to the Semantic Convention Areas
document to see the current active SIGs and also to learn how to kick start a new one.

verdie-g requested review from a team as code owners December 2, 2025 17:12

github-project-automation Bot added this to Semantic Conventions Triage Dec 2, 2025

github-project-automation Bot moved this to Untriaged in Semantic Conventions Triage Dec 2, 2025

verdie-g requested a review from a team as a code owner December 2, 2025 17:14

github-actions Bot added enhancement New feature or request area:gen-ai labels Dec 2, 2025

verdie-g mentioned this pull request Dec 2, 2025

More detailed token usage span attributes and metrics open-telemetry/semantic-conventions-genai#23

Open

alexmojaki reviewed Dec 3, 2025

View reviewed changes

Comment thread model/gen-ai/spans.yaml Outdated

lmolkova moved this from Untriaged to Awaiting codeowners approval in Semantic Conventions Triage Dec 8, 2025

verdie-g mentioned this pull request Dec 10, 2025

Augment UsageDetails with cached / reasoning token counts dotnet/extensions#7122

Merged

lmolkova added this to GenAI Semantic Conventions and Instrumentation libraries Dec 15, 2025

lmolkova moved this to In Progress in GenAI Semantic Conventions and Instrumentation libraries Dec 15, 2025

alexmojaki mentioned this pull request Dec 19, 2025

Migrate OpenAI and Anthropic instrumentations to new OpenTelemetry semantic conventions pydantic/logfire#1586

Open

10 tasks

verdie-g force-pushed the gen-ai-cached-tokens branch from c959bb6 to c2e4498 Compare December 19, 2025 15:17

verdie-g requested a review from alexmojaki December 19, 2025 15:18

alexmojaki approved these changes Dec 19, 2025

View reviewed changes

github-actions Bot added the Stale label Jan 3, 2026

github-actions Bot removed the Stale label Jan 4, 2026

github-actions Bot added the Stale label Jan 19, 2026

verdie-g force-pushed the gen-ai-cached-tokens branch from c2e4498 to b86579e Compare January 19, 2026 10:20

lmolkova reviewed Jan 19, 2026

View reviewed changes

Comment thread model/gen-ai/registry.yaml Outdated

Comment thread model/gen-ai/registry.yaml Outdated

Comment thread model/gen-ai/registry.yaml Outdated

Comment thread model/gen-ai/registry.yaml Outdated

Comment thread model/gen-ai/spans.yaml Outdated

lmolkova mentioned this pull request Jan 19, 2026

GenAI: added reasoning tokens usage attribute #3222

Closed

3 tasks

verdie-g requested a review from lmolkova January 19, 2026 19:33

verdie-g and others added 9 commits January 21, 2026 16:19

genai: define cached tokens attributes

b6742a2

Add changelog

4c0c5a5

Update markdowns

7701cf3

Switch to recommended

bb8f2d4

Apply suggestions from code review

66865f3

Co-authored-by: Liudmila Molkova <neskazu@gmail.com>

Address comments

93b8638

Update changelog

2fdb340

Apply suggestions from code review

468d5da

Co-authored-by: Liudmila Molkova <neskazu@gmail.com>

Address comments

1c01bce

verdie-g force-pushed the gen-ai-cached-tokens branch from fc9db01 to 1c01bce Compare January 21, 2026 15:31

verdie-g requested a review from lmolkova January 21, 2026 15:40

lmolkova approved these changes Jan 26, 2026

View reviewed changes

github-project-automation Bot moved this from Awaiting codeowners approval to Needs More Approval in Semantic Conventions Triage Jan 26, 2026

lmolkova moved this from Needs More Approval to Ready to be Merged in Semantic Conventions Triage Jan 26, 2026

Regenerate markdown

ec6f945

joaopgrassi added this pull request to the merge queue Jan 27, 2026

Merged via the queue into open-telemetry:main with commit 9d7d97a Jan 27, 2026
18 checks passed

github-project-automation Bot moved this from In Progress to Done in GenAI Semantic Conventions and Instrumentation libraries Jan 27, 2026

stephentoub reviewed Jan 27, 2026

View reviewed changes

lmolkova mentioned this pull request May 5, 2026

GenAI: Add metrics for detailed token usage (cache, reasoning) open-telemetry/semantic-conventions-genai#76

Open

This was referenced Jan 31, 2026

🐛 Bug Report: cache read and cache creation input tokens aren't recorded on span attributes traceloop/openllmetry#3647

Closed

fix(anthropic): restore accidentally lost cache tokens attributes traceloop/openllmetry#3648

Merged

rogeriochaves mentioned this pull request Feb 8, 2026

feat: emit cache tokens as gen_ai.usage span attributes orq-ai/openclaw#2

Merged

4 tasks

github-actions Bot added the triage:rejected:declined label May 5, 2026

Conversation

verdie-g commented Dec 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Merge requirement checklist

Uh oh!

Uh oh!

eddyerburgh commented Dec 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

alexmojaki commented Dec 19, 2025

Uh oh!

github-actions Bot commented Jan 3, 2026

Uh oh!

verdie-g commented Jan 3, 2026

Uh oh!

github-actions Bot commented Jan 19, 2026

Uh oh!

alexmojaki commented Jan 19, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lmolkova commented Jan 26, 2026

Uh oh!

Uh oh!

stephentoub Jan 27, 2026

Choose a reason for hiding this comment

Uh oh!

lmolkova Jan 27, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot commented May 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

verdie-g commented Dec 2, 2025 •

edited

Loading

eddyerburgh commented Dec 17, 2025 •

edited

Loading