feat: Add a metric ingestion time SM sanitization #15222

cstyan · 2024-12-03T04:12:39Z

Adding a per-tenant metric for when a tenant has structured metadata sanitized at ingestion time. This would be less spammy than a log line, and while the tenant label could be high cardinality it's unlikely to be as the % of users using SM and sending SM that has invalid characters is likely low.

Some of the discussion around #15113 has been that the query time sanitization of the SM name and value (value would be sanitized in that PR, name sanitization is already in place) is that query time sanitization is expensive, and we'd like to move that functionality to be on a per-tenant basis.

The goal here is to inform us which users have been sending invalid SM so that we modify the query time SM sanitization code blocks so that they're only run on a per-tenant basis based on per-tenant overrides. With this metric in place we would be able to have a good idea ahead of time of which users have been sending invalid SM before we make the swap over so that we can set the per-tenant config in advance of the rollout.

ashwanthgoli · 2024-12-03T04:16:59Z

pkg/distributor/distributor.go

+					normalized = otlptranslate.NormalizeLabel(structuredMetadata[i].Name)
+					if normalized != structuredMetadata[i].Name {
+						structuredMetadata[i].Name = normalized
+						d.tenantPushSanitizedStructuredMetadata.WithLabelValues(tenantID).Inc()


should we also add this for value sanitisation?

yep, this just got missed in my git add --patch and then I saw the test failing as you posted your comment, pushed the value inc() call

ingestion time. Signed-off-by: Callum Styan <[email protected]>

ashwanthgoli

lgtm!

pkg/distributor/distributor_test.go

Signed-off-by: Callum Styan <[email protected]>

cyriltovena

LGTM

Not sure we need a metrics for that but why not.

cstyan requested a review from a team as a code owner December 3, 2024 04:12

pull-request-size bot added the size/S label Dec 3, 2024

ashwanthgoli reviewed Dec 3, 2024

View reviewed changes

Add a metric for when a tenant has structured metadata sanitized at

91cabf9

ingestion time. Signed-off-by: Callum Styan <[email protected]>

cstyan force-pushed the callum-sm-sanitize-metric branch from 56f2cf7 to 91cabf9 Compare December 3, 2024 04:19

ashwanthgoli approved these changes Dec 3, 2024

View reviewed changes

pkg/distributor/distributor_test.go Outdated Show resolved Hide resolved

lint

bcbe1b4

Signed-off-by: Callum Styan <[email protected]>

pull-request-size bot added size/M and removed size/S labels Dec 3, 2024

cyriltovena approved these changes Dec 3, 2024

View reviewed changes

cstyan merged commit e9d0c3e into main Dec 4, 2024
60 checks passed

cstyan deleted the callum-sm-sanitize-metric branch December 4, 2024 03:36

This was referenced Dec 23, 2024

chore(k234): release 3.4.0 #15536

Open

chore(k235): release 3.4.0 #15555

Open

loki-gh-app bot mentioned this pull request Jan 6, 2025

chore(k236): release 3.4.0 #15595

Open

loki-gh-app bot mentioned this pull request Jan 13, 2025

chore(k237): release 3.4.0 #15705

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add a metric ingestion time SM sanitization #15222

feat: Add a metric ingestion time SM sanitization #15222

cstyan commented Dec 3, 2024 •

edited

Loading

ashwanthgoli Dec 3, 2024

cstyan Dec 3, 2024

ashwanthgoli left a comment

cyriltovena left a comment

feat: Add a metric ingestion time SM sanitization #15222

feat: Add a metric ingestion time SM sanitization #15222

Conversation

cstyan commented Dec 3, 2024 • edited Loading

ashwanthgoli Dec 3, 2024

Choose a reason for hiding this comment

cstyan Dec 3, 2024

Choose a reason for hiding this comment

ashwanthgoli left a comment

Choose a reason for hiding this comment

cyriltovena left a comment

Choose a reason for hiding this comment

cstyan commented Dec 3, 2024 •

edited

Loading