[common][router][WIP] cache dimensions for otel #1532

m-nagarajan · 2025-02-13T21:45:09Z

Summary

Problem:
Currently, every metric.record() call creates a new Attributes object with all the dimensions needed for that metric. As it also happens on the happy path, it will lead to high rate of object churn and potentially affecting GC.

Solution:
This PR aims at reducing this object churn.
Considered 2 approaches and end up choosing approach 2 to cache the dimensions:

Pre create all possible dimensions: This gets complicated as store name and cluster name are also part of the dimensions and it can lead to creating so many Attributes object and we need to craft a key during runtime to get to the precreated dimensions. Precreating everything is not possible as there can be new stores coming into the picture after bootstrap.
Cache the dimensions: Rather than pre creating all the dimensions we can create them as and when needed and then cache it for future uses. This will be more dynamic without the need for precreating all combinations. Similar key is needed to access the cache.

Implementation details:

using a ThreadLocal<Map<VeniceMetricsDimensions, String>> to pass in the dimension and its values rather than building an object everytime or pass using varargs or writing custom methods for each metrics
using a VeniceConcurrentHashMap<String, Attributes> to cache the unique Attributes
key (of type String)to access this cache is the combination of all dimension names and values. Eg: "DIMENSION1NAMEdimension1valueDIMENSION2NAMEdimension2value..."
Modified dimensionsList in MetricEntity from Set to a SortedSet to help in creating consistent keys.
Every RouterHttpRequestStats (or potentially any stats object class) will create its own VeniceOpenTelemetryDimensionsCache to take advantage of the base dimensions.
For the key, I originally was pre-creating a pattern for each MetricEntity like "DIMENSION1NAME%sDIMENSION2NAME%s..." but that needs using String.format(), so ended up using a StringBuilder and creating the full string during runtime instead.

How was this PR tested?

NA

Does this PR introduce any user-facing changes?

No. You can skip the rest of this section.
Yes. Make sure to explain your proposed changes and call out the behavior change.

lluwm · 2025-02-18T17:43:36Z

...lient-common/src/main/java/com/linkedin/venice/stats/VeniceOpenTelemetryDimensionsCache.java

+        String dimensionValue = reusableDimensionsMap.get(dimension);
+        if (dimensionValue == null) {
+          // TODO: this is not a comprehensive check as this thread local map is not cleared after use
+          throw new VeniceException("Dimension value cannot be null for " + dimension);


Why we can assert dimensionValue is always in reusableDimensionsMap? Is it because we always filling in the content before calling checkCacheAndGetDimensions or something else? Would it be intuitive to move the codes inside this function, I mean the codes that inserts dimension values into the cache. If we could do that, then probably we don't need to expose the getThreadLocalReusableDimensionsMap to public.

If TODO comments is still valid and we never clear the cache after use, is there any reasonable limit value that we can cap the size of this cache?

lluwm · 2025-02-18T18:06:02Z

...lient-common/src/main/java/com/linkedin/venice/stats/VeniceOpenTelemetryDimensionsCache.java

+  private final String baseMetricDimensionsKey;
+
+  /** used to pass in the dimension and its values to create {@link Attributes} and avoid creating temp maps/arrays */
+  private static final ThreadLocal<Map<VeniceMetricsDimensions, String>> threadLocalReusableDimensionsMap =


Is it correct that, ThreadLocal requires every router thread servicing a client query have to create a copy of this map? If it is the case, do we know the upper limit of how large this number could be?

FelixGV

Left a high level comment... Please take a look and LMK what you think.

FelixGV · 2025-02-18T20:08:59Z

...ces/venice-router/src/main/java/com/linkedin/venice/router/stats/RouterHttpRequestStats.java

-      dimensions = Attributes.builder()
-          .putAll(commonMetricDimensions)
-          .put(getDimensionName(VENICE_REQUEST_RETRY_TYPE), retryType.getRetryType())
-          .build();
+      Map<VeniceMetricsDimensions, String> reusableDimensionsMap = getThreadLocalReusableDimensionsMap();
+      reusableDimensionsMap.put(VENICE_REQUEST_RETRY_TYPE, retryType.getRetryType());
+      dimensions = otelDimensionsCache.checkCacheAndGetDimensions(retryCountMetric.getMetricEntity());


This logic looks extremely complex to me, both in terms of my ability to understand it, but also in terms of time complexity (so many map operations, string building, etc...).

Since the MetricEntityState retryCountMetric is a private property of this class, and this class has the per-store scope that we're interested in, I don't understand, why not cache the Attributes dimensions inside of the MetricEntityState object itself? Then this whole function can become simply retryCountMetric.record(1); and we let the dimension-passing be completely handled on the inside, by simply taking it from some private final Attributes dimensions property of the MetricEntityState... No map lookups, no string building, no threadlocal state... all of that disappears completely. And by hiding the OTel complexity in this way, we should greatly simplify the migration from Tehuti to OTel.

WDYT?

Thanks @FelixGV for the comment. Packing the cache inside MetricEntityState by relying on the existing class with the per-store scope sounds good as well, but it doesn't eliminate the string building (for cache key) completely as each state can have multiple Attributes keyed by one/more of the varying dimensions. For instance,

retryCountMetric can have multiple Attributes based on values of RequestRetryType

healthyRequestMetric can have multiple Attributes based on values of HttpResponseStatus and HttpResponseStatusCodeCategory.
Also, when we move away from tehuti we will regroup some of the current MetricEntityStates into 1 (for instance: healthy_request, unhealthy_request, tardy_request, etc to just 1) which is going to increase this cardinality further. We can continue keeping each of these combinations to be a separate MetricEntityState, but I feel like its too much denormalizing and we have to further denormalize it to keep things 1:1. In other alternative routes, we will need some form of key to access the cache (global or per RouterHttpRequestStats or per MetricEntityState ). The content of the key gets smaller as we move further away from global cache, the easier being just 1 dimension like in retryCountMetric where a cache per RouterHttpRequestStats like below would work.

VeniceConcurrentHashMap<RequestRetryType, Attributes> otelDimensionCacheForRequestRetryType;

If not, if we have combinations of two or more dimensions as keys, we need to construct some form of key for the cache. What do you think?

m-nagarajan added 2 commits February 13, 2025 13:15

Introduce caching the dimensions for otel

aa989b1

minor cleanup

b6f4d69

lluwm reviewed Feb 18, 2025

View reviewed changes

FelixGV reviewed Feb 18, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[common][router][WIP] cache dimensions for otel #1532

[common][router][WIP] cache dimensions for otel #1532

m-nagarajan commented Feb 13, 2025 •

edited

Loading

lluwm Feb 18, 2025

lluwm Feb 18, 2025 •

edited

Loading

FelixGV left a comment

FelixGV Feb 18, 2025

m-nagarajan Feb 19, 2025

[common][router][WIP] cache dimensions for otel #1532

Are you sure you want to change the base?

[common][router][WIP] cache dimensions for otel #1532

Conversation

m-nagarajan commented Feb 13, 2025 • edited Loading

Summary

How was this PR tested?

Does this PR introduce any user-facing changes?

lluwm Feb 18, 2025

Choose a reason for hiding this comment

lluwm Feb 18, 2025 • edited Loading

Choose a reason for hiding this comment

FelixGV left a comment

Choose a reason for hiding this comment

FelixGV Feb 18, 2025

Choose a reason for hiding this comment

m-nagarajan Feb 19, 2025

Choose a reason for hiding this comment

m-nagarajan commented Feb 13, 2025 •

edited

Loading

lluwm Feb 18, 2025 •

edited

Loading