[Draft] Sampling milestones blog post by jmacd · Pull Request #7735 · open-telemetry/opentelemetry.io

jmacd · 2025-09-08T21:59:29Z

Work-in-progress to share with the Sampling SIG before asking for editorial help.

jpkrohling

A blog post on this was long overdue, thank you very much, @jmacd !!

jpkrohling · 2025-09-09T10:03:49Z

+
+## Intro
+
+The OpenTelemetry sampling project promotes features and


Suggested change

The OpenTelemetry sampling project promotes features and

The OpenTelemetry Sampling SIG promotes features and

jpkrohling · 2025-09-09T10:06:45Z

+cSpell:ignore:
+---
+
+## Intro


I feel like there could be an "intro intro" paragraph. As a reader, why should I care? Is it for me? Like:

The OTel Sampling SIG promotes ... . In this blog post, we'll share the progress we've made over the past ... months, as well as provide a peek into the future.

Users look to OTel to provide ...

jpkrohling · 2025-09-09T10:10:29Z

+score. Adjusted count is the mathematical reciprocal of selection
+probability. Here are a few examples of the term in use:
+
+- _25% probability sampling is communicated by `ot=th:c`, corresponding with an adjusted count of 4 per item._


I know we've introduced "ot" and "tc" before, but perhaps we could spell out that "c" means 25% probability sampling?

jpkrohling · 2025-09-09T10:12:36Z

+- _An adjusted count of N means we would expect to see N-1 similar items had we collected all of the data._
+
+Our goal is that OpenTelemetry users can lower telemetry data
+collection costs through sampling, while preserving adjusted count


Suggested change

collection costs through sampling, while preserving adjusted count

collection volume through sampling, while preserving adjusted count

It's not always about costs: the operational requirements (network bandwidth, for instance) might be a constraint in some scenarios.

jpkrohling · 2025-09-09T10:15:50Z

+- SDKs will record the tracestate field as part of the OTLP span record
+- Collectors and backends will be able to count using adjusted counts, enabling acculate metrics calculated from sampled data.
+
+We have supplemental guidelines for OpenTelemetry collectors in case


"OpenTelemetry collectors" -- are you talking about people configuring OpenTelemetry Collector instances?

jpkrohling · 2025-09-09T10:26:20Z

@@ -0,0 +1,258 @@
+---
+title: OpenTelemetry Sampling update


I like this topic, I followed the development of this spec somewhat closely, and I believe the blog post portraits the work that has been done. That said, I'm not sure what's the audience for this.

If we are trying to give the community of users an update about the sampling features that are coming, then I'd reframe this blog post, so that it starts with a problem statement, followed perhaps by a concrete use-case (real or not), and then what's being done to solve that. There's no need to get into the details of how things are calculated, just that the sampling threshold is propagated through regular trace context level 2, "coming soon to an SDK near you".

If we are trying to get maintainers to implement this, I'd make it very clear at the very beginning, and also start with a clear problem statement, to convince them that they should implement this in their SDKs.

I believe I still know the math behind this, and the blog post was a good refresher for me. I'm afraid readers not familiar with sampling (especially probabilistic) might get lost quickly though. Perhaps we could have a call somewhere like: "and if you are interested in knowing how this magic works or have an interest in statistics or probability, look at this doc. We'd love to have you with us!"

jpkrohling · 2025-09-09T10:32:21Z

+probability. Here are a few examples of the term in use:
+
+- _25% probability sampling is communicated by `ot=th:c`, corresponding with an adjusted count of 4 per item._
+- _An adjusted count of N means we would expect to see N-1 similar items had we collected all of the data._


There's one small thing that bothered me a bit: "similar items" is subjective here. By sampling, we are effectively throwing away data. If we are solely sampling based on the trace ID, then we might not be sampling enough of the rare events. Even if we are getting 1% of the rare events, the attributes within the events might not be representative (am I getting 1% of client=vip?).

I know it's a nit for the article, but I feel like users shouldn't be led to think that they will be able to sample 1% of their data and correctly extrapolate to 100% from that. They can't.

[Draft] Sampling milestones blog post

dc4e310

github-project-automation Bot added this to SIG Comms: PRs & Issues Sep 8, 2025

opentelemetrybot requested a review from a team September 8, 2025 21:59

github-actions Bot added the docs:blog An issue requesting a blog post, or a PR for a new blog post label Sep 8, 2025

jmacd mentioned this pull request Sep 8, 2025

Stratified Sampling Policy for Tailsampling processor open-telemetry/opentelemetry-collector-contrib#41877

Closed

jmacd added 2 commits September 8, 2025 16:34

edit

3115e76

lint

c5fde2d

jpkrohling reviewed Sep 9, 2025

View reviewed changes

Rewrite!

355a782

jmacd closed this Oct 1, 2025

github-project-automation Bot moved this to Done in SIG Comms: PRs & Issues Oct 1, 2025

jmacd mentioned this pull request Oct 1, 2025

Sampling milestones blog post #7967

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Draft] Sampling milestones blog post#7735

[Draft] Sampling milestones blog post#7735
jmacd wants to merge 4 commits into
open-telemetry:mainfrom
jmacd:jmacd/sampling_milestone_blog

jmacd commented Sep 8, 2025

Uh oh!

jpkrohling left a comment

Uh oh!

jpkrohling Sep 9, 2025

Uh oh!

jpkrohling Sep 9, 2025

Uh oh!

jpkrohling Sep 9, 2025

Uh oh!

jpkrohling Sep 9, 2025

Uh oh!

jpkrohling Sep 9, 2025

Uh oh!

jpkrohling Sep 9, 2025

Uh oh!

jpkrohling Sep 9, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants


		## Intro

		The OpenTelemetry sampling project promotes features and

	The OpenTelemetry sampling project promotes features and
	The OpenTelemetry Sampling SIG promotes features and

	collection costs through sampling, while preserving adjusted count
	collection volume through sampling, while preserving adjusted count

Conversation

jmacd commented Sep 8, 2025

Uh oh!

jpkrohling left a comment

Choose a reason for hiding this comment

Uh oh!

jpkrohling Sep 9, 2025

Choose a reason for hiding this comment

Uh oh!

jpkrohling Sep 9, 2025

Choose a reason for hiding this comment

Uh oh!

jpkrohling Sep 9, 2025

Choose a reason for hiding this comment

Uh oh!

jpkrohling Sep 9, 2025

Choose a reason for hiding this comment

Uh oh!

jpkrohling Sep 9, 2025

Choose a reason for hiding this comment

Uh oh!

jpkrohling Sep 9, 2025

Choose a reason for hiding this comment

Uh oh!

jpkrohling Sep 9, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants