Fix vertx otel sdk reload in devmode #28792

brunobat · 2022-10-24T13:57:04Z

Fixes OpenTelemetry instrumentation stops working for resteasy-reactive resources and JDBC after dev mode reload #29645

Part of #26444
Currently OTel will not live reload, working only on first start.

This PR includes:

Centralise instrumentation related classes in the Instrumentation processor and recorder.
Create an OpenTelemetryWapper that will detect when the global OTel SDK has changed in development mode.

Caveat:

On REST requests, the first instrumentation call after reload doesn't send span because OTEL is reloaded after the return of the call. The REST call starts with the old OTel SDK object in the vertx instrumentation and ends with the new one.

radcortez

I think we need to avoid the check on each tracer method because this would only be useful for DEV mode, but we still are doing the check for runtime with no gain.

My recommendation is to produce a CDI Bean of VertxTracer, so we have updated Instrumenters with the correct OpenTelemetry instance. Then in the OpenTelemetryVertxTracingFactory, we just have a delegate to the CDI VertTracer and keep a reference to the one we pass to the factory, so we can update it by replacing the new CDI Bean on reload.

To make the span also works on the reload request, I think we can recreate the start of the span on VertxHttpHotReplacementSetup before the root handler dispatches the current request. We have the instance of VertxTracer, so we can call the receiveRequest method, and we should have everything available for it.

The tricky part is that the current request holds the state of the Trace object (created initially by the request that triggered the reload). In this case, we would need to access the request internal state directly to replace the Trace object. There are no API's for that, but I think that some bytecode transformations for this particular case (and only for DEV mode) are not going to hurt.

...kus/opentelemetry/runtime/tracing/intrumentation/vertx/OpenTelemetryVertxTracingFactory.java

.../opentelemetry/runtime/tracing/intrumentation/vertx/wrapper/DevModeOpenTelemetryWrapper.java

...quarkus/opentelemetry/runtime/tracing/intrumentation/vertx/wrapper/OpenTelemetryWrapper.java

stuartwdouglas · 2022-11-22T03:26:19Z

I think we need to avoid the check on each tracer method because this would only be useful for DEV mode, but we still are doing the check for runtime with no gain.

My recommendation is to produce a CDI Bean of VertxTracer, so we have updated Instrumenters with the correct OpenTelemetry instance. Then in the OpenTelemetryVertxTracingFactory, we just have a delegate to the CDI VertTracer and keep a reference to the one we pass to the factory, so we can update it by replacing the new CDI Bean on reload.

The current check is basically just a volatile read. The CDI approach you are talking about will almost certainly be way more expensive per request.

brunobat · 2022-11-22T10:55:55Z

I think we need to avoid the check on each tracer method because this would only be useful for DEV mode, but we still are doing the check for runtime with no gain.
My recommendation is to produce a CDI Bean of VertxTracer, so we have updated Instrumenters with the correct OpenTelemetry instance. Then in the OpenTelemetryVertxTracingFactory, we just have a delegate to the CDI VertTracer and keep a reference to the one we pass to the factory, so we can update it by replacing the new CDI Bean on reload.

The current check is basically just a volatile read. The CDI approach you are talking about will almost certainly be way more expensive per request.

I would prefer to avoid anything CDI related, if possible and keep the current strategy... Maybe simplify the detection to setup instrumenters into helper method.
The resetIfChanged() in prod mode is a constant anyway... I bet this will get optimised.

radcortez · 2022-11-22T13:55:17Z

But you can completely avoid that check. And it is not only about the check. Since you need to add it to every tracing method, we must remember to always add that check to all implementations moving forward. Also, I believe instruments shouldn't have to deal with code that is only relevant for Dev mode.

The issue is that since Vert.x is always the same instance from restart to restart, the VertxTracer which contains the original instrumenters has to be updated. I think it is easier and more straightforward to just replace the current VertxTracer with a new one originating from the restart operation. You avoid changing the instrumenters and additional wrappers for the OpenTelemetry instance. This could be done in a way only to be applied to Dev mode, and regular Runtime would keep its current implementation.

As for CDI, the issue is that we should remove calls to GlobalOpenTelemetry.get, which is no longer recommended. Yes a different issue, but somehow related to how we are starting things.

brunobat · 2022-11-22T18:10:18Z

@stuartwdouglas , in the end, @radcortez proposal about creating delegates on the the Tracer Factory makes more sense because it will not require future changes in the instrumenters themselves.
I avoided CDI altogether and I'm using a simplified factory for production code.
Roberto helped with the code and he has also a solution for the first request after reload, but given its complexity, it will be done in a future PR.

brunobat · 2022-12-05T10:21:26Z

@gsmet any prediction on when this can be merged?

radcortez · 2022-12-05T10:54:26Z

I'll merge it.

This comment was marked as resolved.

Sign in to view

quarkus-bot bot added the area/tracing label Oct 24, 2022

brunobat changed the title ~~fix vertx otel sdk reload in devmode~~ Fix vertx otel sdk reload in devmode Oct 24, 2022

brunobat force-pushed the otel-autoconfigure-3 branch 4 times, most recently from c72884c to dfd650b Compare October 28, 2022 10:13

brunobat marked this pull request as ready for review October 28, 2022 10:15

brunobat requested review from radcortez and ozangunalp October 28, 2022 10:15

brunobat mentioned this pull request Oct 28, 2022

OpenTelemetry SDK Autoconfigure #26444

Closed

5 tasks

brunobat force-pushed the otel-autoconfigure-3 branch from dfd650b to 1b35191 Compare November 15, 2022 14:14

brunobat requested a review from stuartwdouglas November 15, 2022 14:15

This comment has been minimized.

Sign in to view

radcortez requested changes Nov 22, 2022

View reviewed changes

...kus/opentelemetry/runtime/tracing/intrumentation/vertx/OpenTelemetryVertxTracingFactory.java Show resolved Hide resolved

stuartwdouglas requested changes Nov 22, 2022

View reviewed changes

.../opentelemetry/runtime/tracing/intrumentation/vertx/wrapper/DevModeOpenTelemetryWrapper.java Outdated Show resolved Hide resolved

...quarkus/opentelemetry/runtime/tracing/intrumentation/vertx/wrapper/OpenTelemetryWrapper.java Outdated Show resolved Hide resolved

Fix vertx otel sdk reload in devmode with Delegate on tracer factory

71c6f7d

brunobat force-pushed the otel-autoconfigure-3 branch from 1b35191 to 71c6f7d Compare November 22, 2022 18:05

brunobat requested review from stuartwdouglas and radcortez November 23, 2022 10:28

radcortez approved these changes Nov 23, 2022

View reviewed changes

stuartwdouglas approved these changes Nov 24, 2022

View reviewed changes

brunobat added the triage/backport? label Nov 30, 2022

ozangunalp approved these changes Nov 30, 2022

View reviewed changes

radcortez mentioned this pull request Dec 5, 2022

OpenTelemetry instrumentation stops working for resteasy-reactive resources and JDBC after dev mode reload #29645

Closed

radcortez merged commit be10ea3 into quarkusio:main Dec 5, 2022

quarkus-bot bot added this to the 2.16 - main milestone Dec 5, 2022

quarkus-bot bot added the kind/bugfix label Dec 5, 2022

gsmet modified the milestones: 2.16 - main, 2.15.0.Final Dec 6, 2022

gsmet removed the triage/backport? label Dec 6, 2022

brunobat mentioned this pull request Dec 14, 2022

OpenTelemetry JDBC instrumentation stops working after dev mode reload #29849

Closed

michalvavrik mentioned this pull request May 11, 2023

Quarkus 3 OpenTelemetry hardening quarkus-qe/quarkus-test-suite#1222

Merged

9 tasks

brunobat deleted the otel-autoconfigure-3 branch August 21, 2024 10:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix vertx otel sdk reload in devmode #28792

Fix vertx otel sdk reload in devmode #28792

brunobat commented Oct 24, 2022 •

edited by radcortez

Loading

This comment was marked as resolved.

This comment has been minimized.

radcortez left a comment

stuartwdouglas commented Nov 22, 2022

brunobat commented Nov 22, 2022 •

edited

Loading

radcortez commented Nov 22, 2022

brunobat commented Nov 22, 2022 •

edited

Loading

brunobat commented Dec 5, 2022

radcortez commented Dec 5, 2022

Fix vertx otel sdk reload in devmode #28792

Fix vertx otel sdk reload in devmode #28792

Conversation

brunobat commented Oct 24, 2022 • edited by radcortez Loading

This comment was marked as resolved.

This comment has been minimized.

radcortez left a comment

Choose a reason for hiding this comment

stuartwdouglas commented Nov 22, 2022

brunobat commented Nov 22, 2022 • edited Loading

radcortez commented Nov 22, 2022

brunobat commented Nov 22, 2022 • edited Loading

brunobat commented Dec 5, 2022

radcortez commented Dec 5, 2022

brunobat commented Oct 24, 2022 •

edited by radcortez

Loading

brunobat commented Nov 22, 2022 •

edited

Loading

brunobat commented Nov 22, 2022 •

edited

Loading