Skip to content

[chore] add guidelines for telemetry stability levels#14280

Merged
codeboten merged 3 commits intoopen-telemetry:mainfrom
ChrsMark:document_stability_levels
Dec 11, 2025
Merged

[chore] add guidelines for telemetry stability levels#14280
codeboten merged 3 commits intoopen-telemetry:mainfrom
ChrsMark:document_stability_levels

Conversation

@ChrsMark
Copy link
Copy Markdown
Member

Description

This PR adds a guideline section about the stability levels the produced components' telemetry should follow.

Link to tracking issue

Related to open-telemetry/opentelemetry.io#8558 and #13910.

Signed-off-by: ChrsMark <chrismarkou92@gmail.com>
@codecov
Copy link
Copy Markdown

codecov Bot commented Dec 10, 2025

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 92.14%. Comparing base (8f51a17) to head (8715a42).
⚠️ Report is 6 commits behind head on main.

Additional details and impacted files
@@            Coverage Diff             @@
##             main   #14280      +/-   ##
==========================================
- Coverage   92.17%   92.14%   -0.03%     
==========================================
  Files         668      668              
  Lines       41510    41513       +3     
==========================================
- Hits        38262    38254       -8     
- Misses       2215     2221       +6     
- Partials     1033     1038       +5     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Comment thread docs/coding-guidelines.md Outdated

### Metrics

Metrics emitted by Collector components follow the same stability levels documented by
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this about internal metrics, or metrics emitted by receivers/scrapers? The reference "Internal Telemetry Stability" implies the former, but the reference in the scraper document suggests the latter. I think this should be made extra clear.

Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm not sure if we have a specific definition to refer to the non-internal metrics. Tried to clarify that and added an example metric system.cpu.time. Any other suggestions on how this could be further clarified would be welcome.

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think changing the section titles to "Internal Telemetry Stability Levels" and "Internal Metrics" would help.

I would also completely remove the reference in the scraping receivers docs; "metrics" in that doc seems to systematically refer to the metrics the scraper outputs into the pipeline, so non-internal.

Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

"metrics" in that doc seems to systematically refer to the metrics the scraper outputs into the pipeline, so non-internal.

That's what I want to cover here :), metrics the scraper outputs into the pipeline. Not the internal ones.

Copy link
Copy Markdown
Contributor

@jade-guiton-dd jade-guiton-dd Dec 10, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Oh. In that case, it's the reference to the "Internal Telemetry Stability" page that's confusing, since that page is specifically about internal telemetry. If I'm not mistaken, the changes made in #14070 were also about internal metrics 🤔 I think there may have been some confusion somewhere

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If it really is just receiver/scraper metrics, could we directly refer to the SemConv document / OTEP that this change is inspired from?

Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The idea is that both internal telemetry and non-internal telemetry should follow the same stability levels.

Copy link
Copy Markdown
Contributor

@jade-guiton-dd jade-guiton-dd Dec 10, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Okay. In that case, in terms of wording I would recommend something like: "Metrics emitted by Collector scrapers/receivers (e.g. system.cpu.time) follow the same stability levels as the Collector's internal metrics (e.g. otelcol_process_cpu_seconds), as documented in [Internal Telemetry Stability]"

Signed-off-by: ChrsMark <chrismarkou92@gmail.com>
@codspeed-hq

This comment was marked as outdated.

Signed-off-by: ChrsMark <chrismarkou92@gmail.com>
@codeboten codeboten added this pull request to the merge queue Dec 11, 2025
Merged via the queue into open-telemetry:main with commit 605123a Dec 11, 2025
61 of 62 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants