[processor/tailsampling] Initial support for early decisions by csmarchbanks · Pull Request #44456 · open-telemetry/opentelemetry-collector-contrib

csmarchbanks · 2025-11-21T15:46:40Z

Description

This change adds a new optional interface that samplers can implement, EarlyEvaluator. If a sampler implements the EarlyEvaluator interface and the EarlyDecisions configuration is turned on then for each batch of traces received TSP will try to see if any spans from that batch will cause the trace to be sampled. If so, then we can send all the trace data we have along and remove it from memory. It is also possible that all policies return NotSampled, at which point TSP will drop the data.

In this initial implementation drop policies are not supported. In addition, the deprecated top level invert_match policies are also not supported and will run into issues if still being used.

Link to tracking issue

Part of #43876

Testing

I have added some tests and also ran this in some collectors in our infrastructure.

Documentation

TODO: I need to add more documentation than currently present

yvrhdn

I like the new interface! I think it's a good balance between giving policies the opportunity to decide fast without making it too heavy. Left a few minor comments.

yvrhdn · 2025-11-21T20:45:05Z

+
+		decision, err := earlyEval.EarlyEvaluate(tsp.ctx, id, currentSpans, trace)
+		if err != nil {
+			tsp.telemetry.ProcessorTailSamplingSamplingPolicyEvaluationError.Add(tsp.ctx, 1)


minor: might be nice to add an attribute indicating where the error happened? Before this PR errors could only happen in samplingPolicyOnTick.

Logiraptor

Sending a partial review while sitting in a hotel 😅, don't block on me while I'm OOO!

Logiraptor · 2025-11-22T17:42:25Z

+	// time this function is called. It is included for implementations that
+	// wait for a trigger, such as the parent span being received, before
+	// looking across all the received spans.
+	EarlyEvaluate(ctx context.Context, traceID pcommon.TraceID, newData ptrace.ResourceSpans, allData *TraceData) (Decision, error)


Am I understanding correctly that allData actually includes newData? Looking at how the incoming spans are appended before checking early decisions in processTrace. I wonder if there's a smaller interface we could require here that doesn't have this duplication or the need to warn about iterating ReceivedBatches?

Yeah, this is a tradeoff I intentionally made but I am open to discussion. I wanted to support some sort of trigger span being present inside of newData that then causes the early decision code to look over all trace data. A concrete example of that would be when the root span is received it may be desirable to look over the trace as a whole instead of just the batch to see how spans are connected.

It also allows an extension to mutate the data similar to adding the sampling policy attribute if desired (which we do in some of our extensions already).

As far as duplication goes, the newData is always the last resource span entry in allData right now, but that feels like too much of an implementation detail to expose to users.

Logiraptor · 2025-11-22T18:43:56Z

+
+	// We do not support early evaluation when drop policies are present yet.
+	if len(dropPolicies) > 0 {
+		earlyEvaluationPossible = false


I'd recommend some monitoring here (and elsewhere) so we can see clearly the "why" early decisions aren't possible across the cluster. That should help prioritize investment going forward.

Yep, for sure, monitoring is on my TODO still. I am hoping we remove this in the future, but drop policies (and composite policies) look to involve tracking a bitset of early evaluation results that I didn't want to include in this PR.

Allow the processor to run early evaluations at batch ingest time and make sampling decisions based on the result. This will drop or forward traces before the entire batch is ready to reduce the memory required keeping all trace data in memory until the end of the decision wait. To begin with only implement "basic" samplers, those that do not support invert, or nest other policies. This will still provide some gains while keeping this change manageable. Drop specifically will take more work to implement as we cannot make an early Sampled decision until all drop policies have been evaluated which will require some state to be maintained.

Combine the two interfaces previously created to simplify the code and avoid as many casts everywhere. A sampler that does not support early decisions can just return samplingpolicy.Unspecified.

csmarchbanks · 2025-12-12T16:26:18Z

Closing this one for now as #44878 looks to have more impact in our environments. I may come back to it in the future.

…s received (#44878)  #### Description When testing the work done to provide early decisions (#44456) the impact was very limited in some scenarios, specifically when policies existed that look for long traces or if an error is ever present. It is not possible to know if an error span will come along in the future so almost all traces end up waiting until the decision wait causing the savings to only be a couple of percent in some environments. What I found myself wishing for was a way to run all decisions more quickly for most cases, and what worked fairly well is to base that decision on if a root span has been received or not. This change implements a second decision wait to collect any straggler spans that might be present for a trace (e.g. a second service with a different batch timer before sending), but still allow it to be much faster than the base decision wait. A good way of thinking about the two options is that decision_wait is the maximum amount of time we will wait for a span to arrive, and decision_wait_after_root_received is the minimum amount of time we will wait for additional spans to come in. The downside of this approach is that heavily asynchronous traces may not be sampled as expected, however we find that those do not work very well as it is since they commonly last longer than the decision wait anyway. The behavior is opt in so no changes are needed for any users. I wanted to keep the changes in this PR relatively small, but in the future we could re-implement id batcher to support moving traces between batches, or possibly a priority queue where we pop values until some threshold, rather than having two batchers.  #### Link to tracking issue Part of #43876  #### Testing Added smoke test for the new functionality  #### Documentation Added documentation explaining the new configuration variable.

…s received (open-telemetry#44878)  #### Description When testing the work done to provide early decisions (open-telemetry#44456) the impact was very limited in some scenarios, specifically when policies existed that look for long traces or if an error is ever present. It is not possible to know if an error span will come along in the future so almost all traces end up waiting until the decision wait causing the savings to only be a couple of percent in some environments. What I found myself wishing for was a way to run all decisions more quickly for most cases, and what worked fairly well is to base that decision on if a root span has been received or not. This change implements a second decision wait to collect any straggler spans that might be present for a trace (e.g. a second service with a different batch timer before sending), but still allow it to be much faster than the base decision wait. A good way of thinking about the two options is that decision_wait is the maximum amount of time we will wait for a span to arrive, and decision_wait_after_root_received is the minimum amount of time we will wait for additional spans to come in. The downside of this approach is that heavily asynchronous traces may not be sampled as expected, however we find that those do not work very well as it is since they commonly last longer than the decision wait anyway. The behavior is opt in so no changes are needed for any users. I wanted to keep the changes in this PR relatively small, but in the future we could re-implement id batcher to support moving traces between batches, or possibly a priority queue where we pop values until some threshold, rather than having two batchers.  #### Link to tracking issue Part of open-telemetry#43876  #### Testing Added smoke test for the new functionality  #### Documentation Added documentation explaining the new configuration variable.

yvrhdn reviewed Nov 21, 2025

View reviewed changes

Logiraptor reviewed Nov 22, 2025

View reviewed changes

atoulme added the processor/tailsampling Tail sampling processor label Dec 8, 2025

csmarchbanks added 8 commits December 9, 2025 09:56

Add parent span to sampling TraceData

448ea7e

Clarify that allData should not be used on each request

2635bd9

Fix latency upper threshold behavior

d6342f7

Fix review feedback

e4b424d

Add EarlyEvaluate to the Evaluator interface

c7ce6de

Combine the two interfaces previously created to simplify the code and avoid as many casts everywhere. A sampler that does not support early decisions can just return samplingpolicy.Unspecified.

Fix and test NotSampled early decision case

f686d23

Fix lint errors

0dd0a1c

csmarchbanks force-pushed the tsp-early-decisions branch from fff8c62 to 0dd0a1c Compare December 9, 2025 17:02

Fix lint failures post rebase

5352840

csmarchbanks force-pushed the tsp-early-decisions branch from e38fb6a to e20f50f Compare December 9, 2025 19:42

Add more early vs normal attributes for debugging

a6efff5

csmarchbanks force-pushed the tsp-early-decisions branch from e20f50f to a6efff5 Compare December 9, 2025 19:45

Add changelog entry

aa88196

csmarchbanks marked this pull request as ready for review December 9, 2025 23:03

csmarchbanks requested a review from a team as a code owner December 9, 2025 23:03

csmarchbanks requested a review from ArthurSens December 9, 2025 23:03

github-actions Bot assigned ArthurSens Dec 9, 2025

github-actions Bot requested a review from portertech December 9, 2025 23:03

csmarchbanks mentioned this pull request Dec 10, 2025

[processor/tailsampling] Allow faster decisions after the root span is received #44878

Merged

csmarchbanks closed this Dec 12, 2025

csmarchbanks mentioned this pull request Apr 8, 2026

[processor/tailsampling] add sampling_strategy config with trace-complete and span-ingest modes #46762

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[processor/tailsampling] Initial support for early decisions#44456

[processor/tailsampling] Initial support for early decisions#44456
csmarchbanks wants to merge 11 commits into
open-telemetry:mainfrom
csmarchbanks:tsp-early-decisions

csmarchbanks commented Nov 21, 2025

Uh oh!

yvrhdn left a comment

Uh oh!

Uh oh!

yvrhdn Nov 21, 2025

Uh oh!

Uh oh!

Uh oh!

Logiraptor left a comment

Uh oh!

Uh oh!

Uh oh!

Logiraptor Nov 22, 2025

Uh oh!

csmarchbanks Nov 25, 2025 •

edited

Loading

Uh oh!

Uh oh!

Logiraptor Nov 22, 2025

Uh oh!

csmarchbanks Nov 25, 2025

Uh oh!

csmarchbanks commented Dec 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

csmarchbanks commented Nov 21, 2025

Description

Link to tracking issue

Testing

Documentation

Uh oh!

yvrhdn left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

yvrhdn Nov 21, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Logiraptor left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Logiraptor Nov 22, 2025

Choose a reason for hiding this comment

Uh oh!

csmarchbanks Nov 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Logiraptor Nov 22, 2025

Choose a reason for hiding this comment

Uh oh!

csmarchbanks Nov 25, 2025

Choose a reason for hiding this comment

Uh oh!

csmarchbanks commented Dec 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

csmarchbanks Nov 25, 2025 •

edited

Loading