Change the inclusivity of exponential histogram bounds #2633

jmacd · 2022-06-28T00:36:39Z

Fixes #2611 (for the most part)

Changes

The OpenTelemetry OTLP v0.11 exponential histogram defined lower-inclusive histogram buckets. The rationale for this decision is that the computations involved are simpler than for upper-inclusive histogram buckets, relatively speaking, due to the nature of IEEE 754's floating point representation.

The Prometheus project is strongly opposed to the use of lower-inclusive boundaries for histogram buckets because of historical precedent. In brief, there are existing modes of querying (existing) histogram data that are defined with upper-inclusive boundaries => it is possible to make exact inferences using existing histogram data => future histograms should preserve this mode of inference.

The Prometheus project has (with limits) adopted the base-2 exponential boundary structure defined in OTLP v0.11, so the only major difference between theirs and OpenTelemetry's is this issue. The proposed changes that will bring harmony in this space are: (1) this PR, (2) a demonstration of the aggregator change (OTel-Go), and (3) a comments-only change in the protocol.

The counter argument to making this change can be summarized:

It is slightly simpler to use lower-inclusive definitions
Exactness, even for power of two queries, is unwelcome overhead
Exactness is not possible for non-power-of-two queries
Negative-range numbers have opposite inclusivity in both proposals, counter to the claim of consistency.

The argument in favor of this proposal can be summarized:

Prometheus will be happy (even without exactness)
OTel never required exactness, so it's not a breaking change
Checking for exact for powers-of-two is cheap
~~Table-lookup implementations become less relatively-complex.~~

Please see the changes to the mapping functions prototyped in open-telemetry/opentelemetry-go#2982.

CC: @gouthamve @beorn7 @RichiH.

oertl · 2022-06-28T11:00:50Z

@jmacd

Table-lookup implementations become less relatively-complex.

Isn't it rather the opposite? The floating-point representations of values 1.0 and 1.00001 have the same exponent, therefore it is easier to map them to the same bucket than to put them in different buckets.

reyang

LGTM.

jmacd · 2022-06-28T17:09:04Z

Isn't it rather the opposite? The floating-point representations of values 1.0 and 1.00001 have the same exponent, therefore it is easier to map them to the same bucket than to put them in different buckets.

I think it's established that lower-inclusive boundaries have less "absolute" complexity, that's demonstrated in these changes. I suspect your point is that even a table-lookup implementation has to do more work here, because it requires a new test for significand==0--it's the same in every one of these methods, we're adding an explicit test for power-of-two values and subtracting 1 from the calculated index.

As I recall, @oertl your O(1) table-lookup implementation used 2 comparisons per call. I believe your point is that now it will require 3 comparisons per call?

I was trying to find a silver lining here: the complexity difference between table-lookup and log()-based methods is shrinking, even as they become more complex.

oertl · 2022-06-28T18:27:16Z

@jmacd

As I recall, @oertl your O(1) table-lookup implementation used 2 comparisons per call. I believe your point is that now it will require 3 comparisons per call?

Yes, but maybe it is possible to avoid additional branching for powers of two by decrementing the floating-point input value first (e.g., using Math.nextDown in Java). Anyway, I do not see how table-lookup implementations would become "less relatively-complex" with exclusive lower bounds, which is one of your arguments in favor of this proposal.

jmacd · 2022-06-30T19:21:16Z

@oertl I struck that sentence about relative complexity. (It didn't land well!)

You're right that nextAfter can change the boundary condition between one and another inclusivity (Prometheus made this suggestion, too, but it looks like equal complexity to me--it costs more than one additional comparison at least.)

I looked back at the table lookup mapping function I had prototyped, which I derived from yours (included in this draft PR). To get a rough idea, the program to generate the table of constants is about the same size as the library that performs the lookup and the reverse mapping function. The library to perform the lookup/reverse-mapping is about the same size of, but already more complex than the logarithm-based mapping function. To me the change you would make in a table lookup function will leave an implementation with about the same complexity as before. On the other hand, the change to be made in a logarithm or exponent-based lookup function leaves behind substantially more complexity than it had before, and this is what I meant by relative complexity. The logarithm implementation suffers a lot, the table lookup implementation suffers a little.

jack-berg

Working on the implementation but left some comments.

specification/metrics/data-model.md

beorn7 · 2022-07-05T11:37:50Z

Sorry for the separate comments rather than doing one review.

I think this looks fine from the Prometheus perspective. Many thanks for doing this!

specification/metrics/data-model.md

jmacd · 2022-07-05T23:01:06Z

~~As explained in #2633 (comment), I no longer think this is a good idea.~~

…ication into jmacd/histobounds

jmacd · 2022-07-07T19:53:04Z

As explained in #2611 (comment), I still consider this change acceptable, but the text had some errors. I will re-open this after correcting the mistakes.

… scales; clarify why two different expositions; typos

Co-authored-by: David Ashpole <[email protected]>

specification/metrics/data-model.md

…ication into jmacd/histobounds

…ecification into jmacd/histobounds

…ication into jmacd/histobounds

specification/metrics/data-model.md

reyang

LGTM. I've prototyped this in OpenTelemetry .NET, as I can tell, it is working very well. @jmacd thank you for great work! 👏

Co-authored-by: Reiley Yang <[email protected]>

jmacd · 2022-08-05T16:59:15Z

@open-telemetry/specs-approvers Please approve this PR that was requested by the Prometheus developers as it will improve collaboration between these groups.

…ication into jmacd/histobounds

…ecification into jmacd/histobounds

gouthamve

LGTM!

dashpole

LGTM

…y#2633)

Change the exponential histogram boundary condition

a90cbd7

jmacd requested review from a team June 28, 2022 00:36

github-actions bot assigned carlosalberto Jun 28, 2022

reyang approved these changes Jun 28, 2022

View reviewed changes

reyang added area:sdk Related to the SDK spec:metrics Related to the specification/metrics directory labels Jun 28, 2022

jack-berg approved these changes Jul 2, 2022

View reviewed changes

beorn7 reviewed Jul 5, 2022

View reviewed changes

specification/metrics/data-model.md Show resolved Hide resolved

beorn7 reviewed Jul 5, 2022

View reviewed changes

specification/metrics/data-model.md Outdated Show resolved Hide resolved

beorn7 reviewed Jul 5, 2022

View reviewed changes

specification/metrics/data-model.md Outdated Show resolved Hide resolved

jack-berg reviewed Jul 5, 2022

View reviewed changes

specification/metrics/data-model.md Show resolved Hide resolved

jmacd closed this Jul 5, 2022

Merge branch 'main' of github.com:open-telemetry/opentelemetry-specif…

ca0b140

…ication into jmacd/histobounds

jmacd mentioned this pull request Jul 7, 2022

Prometheus and OTel High Resolution Histograms incompatibilities #2611

Closed

Merge branch 'main' of github.com:open-telemetry/opentelemetry-specif…

409ba17

…ication into jmacd/histobounds

Joshua MacDonald added 3 commits July 7, 2022 14:22

remove lookup table; untabify; derive the scaling factor for positive…

f9f1b17

… scales; clarify why two different expositions; typos

typo

e982a80

remove mention of table lookup

76125e7

jmacd reopened this Jul 7, 2022

Joshua MacDonald added 2 commits July 7, 2022 14:53

format subscripts

53d26d6

deformat

d65a076

Update specification/metrics/data-model.md

8a37196

Co-authored-by: David Ashpole <[email protected]>

reyang reviewed Jul 25, 2022

View reviewed changes

specification/metrics/data-model.md Outdated Show resolved Hide resolved

reyang reviewed Jul 25, 2022

View reviewed changes

specification/metrics/data-model.md Outdated Show resolved Hide resolved

Joshua MacDonald added 6 commits July 27, 2022 11:15

Merge branch 'main' of github.com:open-telemetry/opentelemetry-specif…

e68e7bf

…ication into jmacd/histobounds

reiley's fixes

8215aaf

Merge branch 'jmacd/histobounds' of github.com:jmacd/opentelemetry-sp…

b5d24a7

…ecification into jmacd/histobounds

lengthen the explanation

87577de

lint

68c75e2

Merge branch 'main' of github.com:open-telemetry/opentelemetry-specif…

aa45c6d

…ication into jmacd/histobounds

reyang reviewed Jul 28, 2022

View reviewed changes

specification/metrics/data-model.md Outdated Show resolved Hide resolved

reyang approved these changes Jul 28, 2022

View reviewed changes

reyang mentioned this pull request Jul 28, 2022

Exponential Bucket Histogram - part 1 open-telemetry/opentelemetry-dotnet#3462

Merged

jmacd and others added 2 commits July 28, 2022 12:52

Update specification/metrics/data-model.md

b0014b0

Co-authored-by: Reiley Yang <[email protected]>

Merge branch 'main' into jmacd/histobounds

5af9404

jmacd and others added 3 commits August 5, 2022 11:28

Merge branch 'main' into jmacd/histobounds

b9b24a6

Merge branch 'main' of github.com:open-telemetry/opentelemetry-specif…

cc95f75

…ication into jmacd/histobounds

Merge branch 'jmacd/histobounds' of github.com:jmacd/opentelemetry-sp…

bc00417

…ecification into jmacd/histobounds

gouthamve approved these changes Aug 9, 2022

View reviewed changes

dashpole approved these changes Aug 9, 2022

View reviewed changes

Merge branch 'main' into jmacd/histobounds

9e47e1f

cijothomas approved these changes Aug 12, 2022

View reviewed changes

reyang merged commit ea01715 into open-telemetry:main Aug 12, 2022

jack-berg mentioned this pull request Aug 18, 2022

Lower exclusive exponential histogram bounds open-telemetry/opentelemetry-java#4700

Merged

MadVikingGod mentioned this pull request Sep 9, 2022

Prerelease v1.10.0 open-telemetry/opentelemetry-go#3158

Merged

ahayworth mentioned this pull request Oct 7, 2022

Audit for 1.13.0 spec compliance open-telemetry/opentelemetry-ruby#1384

Closed

carlosalberto pushed a commit to carlosalberto/opentelemetry-specification that referenced this pull request Oct 31, 2024

Change the inclusivity of exponential histogram bounds (open-telemetr…

b492c56

…y#2633)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change the inclusivity of exponential histogram bounds #2633

Change the inclusivity of exponential histogram bounds #2633

jmacd commented Jun 28, 2022 •

edited

Loading

oertl commented Jun 28, 2022

reyang left a comment

jmacd commented Jun 28, 2022

oertl commented Jun 28, 2022

jmacd commented Jun 30, 2022

jack-berg left a comment

beorn7 commented Jul 5, 2022

jmacd commented Jul 5, 2022 •

edited

Loading

jmacd commented Jul 7, 2022

reyang left a comment

jmacd commented Aug 5, 2022

gouthamve left a comment

dashpole left a comment

Change the inclusivity of exponential histogram bounds #2633

Change the inclusivity of exponential histogram bounds #2633

Conversation

jmacd commented Jun 28, 2022 • edited Loading

Changes

oertl commented Jun 28, 2022

reyang left a comment

Choose a reason for hiding this comment

jmacd commented Jun 28, 2022

oertl commented Jun 28, 2022

jmacd commented Jun 30, 2022

jack-berg left a comment

Choose a reason for hiding this comment

beorn7 commented Jul 5, 2022

jmacd commented Jul 5, 2022 • edited Loading

jmacd commented Jul 7, 2022

reyang left a comment

Choose a reason for hiding this comment

jmacd commented Aug 5, 2022

gouthamve left a comment

Choose a reason for hiding this comment

dashpole left a comment

Choose a reason for hiding this comment

jmacd commented Jun 28, 2022 •

edited

Loading

jmacd commented Jul 5, 2022 •

edited

Loading