Skip to content

gen-ai: add judgment boundary attributes to evaluation result#3297

Closed
Nick-heo-eg wants to merge 3 commits intoopen-telemetry:mainfrom
Nick-heo-eg:genai-judgment-boundary
Closed

gen-ai: add judgment boundary attributes to evaluation result#3297
Nick-heo-eg wants to merge 3 commits intoopen-telemetry:mainfrom
Nick-heo-eg:genai-judgment-boundary

Conversation

@Nick-heo-eg
Copy link
Copy Markdown

Summary

This PR adds a small set of attributes to the existing gen_ai.evaluation.result event to improve traceability of decision boundaries
where multiple alternative outcomes were evaluated.

The design follows the direction discussed in open-telemetry/semantic-conventions-genai#72 and does not introduce new events.

Changes

  • Added 4 new attributes to model/gen-ai/registry.yaml:
    • gen_ai.evaluation.judgment.phase
    • gen_ai.evaluation.judgment.selected_path
    • gen_ai.evaluation.judgment.alternatives_evaluated
    • gen_ai.evaluation.judgment.human_in_loop
  • Connected these attributes to gen_ai.evaluation.result event in model/gen-ai/events.yaml

Motivation

Current GenAI traces capture execution outcomes but do not provide an explicit signal that alternative paths (e.g., allow vs block) were evaluated.

These attributes allow systems to demonstrate that such evaluations occurred, without exposing internal reasoning or policy logic.

Design Rationale

  • Extends existing event: Uses gen_ai.evaluation.result rather than creating a new event
  • Optional attributes: All attributes are opt-in, applying only when judgment boundaries exist
  • Minimal scope: Records only that alternatives were evaluated and which path was selected
  • No reasoning exposure: Does not capture policy logic, scoring details, or decision rationale

Note

Documentation files under docs/gen-ai/ are autogenerated from the registry and event schemas and are not edited directly in this PR.

Related Issue

@linux-foundation-easycla
Copy link
Copy Markdown

linux-foundation-easycla Bot commented Jan 17, 2026

CLA Signed

The committers listed above are authorized under a signed CLA.

This commit adds attributes to the gen_ai.evaluation.result event
to support traceability of decision boundaries where multiple
alternative outcomes were evaluated.

Implements discussion from open-telemetry/semantic-conventions#3244
@Nick-heo-eg Nick-heo-eg force-pushed the genai-judgment-boundary branch from 463cfb9 to 3459667 Compare January 17, 2026 08:33
…t intent

Adds two concrete JSON examples demonstrating judgment boundary attribute usage:
- Content safety pre-execution check with automatic decision
- Cost boundary evaluation with human escalation

Clarifies that judgment boundary attributes are intended for event-level
auditability and post-hoc inspection rather than high-cardinality metric
aggregation.
@Nick-heo-eg Nick-heo-eg force-pushed the genai-judgment-boundary branch from 187f313 to b5eb523 Compare January 19, 2026 11:08
@lmolkova lmolkova moved this from Untriaged to Blocked in Semantic Conventions Triage Jan 19, 2026
@lmolkova lmolkova moved this from Blocked to Awaiting codeowners approval in Semantic Conventions Triage Jan 19, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

Archived in project

Development

Successfully merging this pull request may close these issues.

2 participants