feat: add tracing to worker and proxy #1014

SantiagoPittella · 2024-12-11T18:22:42Z

this PR is part of #1004

bin/tx-prover/src/commands/proxy.rs

bobbinth · 2024-12-13T01:56:50Z

What is left to do on this PR? I would probably try to finish this one first, then address #1008, and only after that try to tackle metrics.

SantiagoPittella · 2024-12-13T12:15:06Z

What is left to do on this PR?

This PR is missing a cleanup, some configuration options and documentation.

I would probably try to finish this one first, then address #1008, and only after that try to tackle metrics.

Ok! Sounds good.

bobbinth

Looks good! Thank you! I left some comments inline - most doc-related, but would be good for @igamigo and @Mirko-von-Leipzig to take a look as well.

bobbinth · 2024-12-13T20:30:53Z

bin/tx-prover/src/proxy/mod.rs

+#[derive(Debug)]
 pub struct LoadBalancer(pub Arc<LoadBalancerState>);


Not from this PR, but we should update section header on line 338 above.

bobbinth · 2024-12-13T20:32:05Z

bin/tx-prover/src/proxy/mod.rs

+    // The following methods are a copy of the default implementation defined in the trait, but
+    // with tracing instrumentation.


Could we add a brief explanation of why we need these methods implemented?

bobbinth · 2024-12-13T20:34:39Z

bin/tx-prover/src/proxy/mod.rs

            worker: None,
+            parent_span: info_span!("proxy:new_request", request_id = request_id.to_string()),


Question: why don't we need to specify target both here and in other places in this file?

bobbinth · 2024-12-13T20:42:48Z

bin/tx-prover/src/proxy/mod.rs

+        let server_session = session.as_mut();
+        let code = match e.etype() {
+            HTTPStatus(code) => *code,
+            _ => {
+                match e.esource() {
+                    ErrorSource::Upstream => 502,
+                    ErrorSource::Downstream => {
+                        match e.etype() {
+                            WriteError | ReadError | ConnectionClosed => {
+                                /* conn already dead */
+                                0
+                            },
+                            _ => 400,
+                        }
+                    },
+                    ErrorSource::Internal | ErrorSource::Unset => 500,
+                }
+            },
+        };
+        if code > 0 {
+            server_session.respond_error(code).await
+        }
+        code


I'm assuming that this is just a copy of a default implementation, right?

bobbinth · 2024-12-13T20:46:34Z

bin/tx-prover/src/utils.rs

+// Construct TracerProvider for OpenTelemetryLayer
+pub(crate) fn init_tracer_provider() -> TracerProvider {


nit: let's use /// for doc comments (here and in other places in this file).

Also, could we add more details about how we configure the tracing provider? For example, why do we need to add ID generator, what does with_sampler() do etc.

bobbinth · 2024-12-13T20:49:05Z

bin/tx-prover/src/utils.rs

+// Setup tracing subscriber
+pub(crate) fn setup_tracing(provider: TracerProvider) -> Result<(), String> {


Similar to the previous comment - could we add more details about what this function does?

Also, what's the motivation for have this and init_tracer_provider() as two separate functions? It seems like one is called right after the other. Should we combine them into one function?

bobbinth · 2024-12-13T20:49:37Z

bin/tx-prover/src/utils.rs

+use tracing::Level;
+use tracing_subscriber::{layer::SubscriberExt, Registry};
+
+pub const TRACING_TARGET_NAME: &str = "miden-tx-prover";


nit: I maybe would still call it MIDEN_TX_PROVER.

bobbinth · 2024-12-13T20:50:14Z

bin/tx-prover/Cargo.toml

+opentelemetry-semantic-conventions = "0.27.0"
+opentelemetry-jaeger = "0.22.0"


nit: I would get rid of the patch versions.

bobbinth · 2024-12-13T20:52:59Z

bin/tx-prover/README.md

@@ -114,6 +114,16 @@ The proxy service uses this health check to determine if a worker is available t

 Both the worker and the proxy will use the `info` log level by default, but it can be changed by setting the `RUST_LOG` environment variable.

+## Traces


I would maybe combine this and logging into one section (or make this a sub-section of logging?).

Also, I would add more details here. For example, it is not clear where tracing/logging info is written to. Is it stdout? Is it some logging file? Somewhere else? It is also not clear whether there is a way to not use Jaeger (or maybe use something else) to view the logs.

Basically, a bit more context about how tracing/logging works would be helpful.

bobbinth · 2024-12-13T20:54:26Z

bin/tx-prover/README.md

+The service uses the `tracing` crate for structured logging and tracing. Traces are enabled by default, and uses opentelemetry to export traces to a Jaeger instance. The traces can be visualized using the Jaeger UI, which can be used by running:
+
+```bash
+docker run -d -p4317:4317 -p16686:16686 jaegertracing/all-in-one:latest


This assumes that we have docker installed on the machine, right? If so, I would mention this.

Also, are there alternative ways to do this? We don't need to describe them but if there is a link to how to do it w/o Docker, I'd include it.

SantiagoPittella force-pushed the santiagopittella-add-tracing-to-worker-proxy branch 2 times, most recently from a06eda5 to d68170d Compare December 12, 2024 19:13

bobbinth reviewed Dec 13, 2024

View reviewed changes

bin/tx-prover/src/commands/proxy.rs Outdated Show resolved Hide resolved

feat: add tracing for proxy and worker

e09374f

SantiagoPittella force-pushed the santiagopittella-add-tracing-to-worker-proxy branch from 9ba6f50 to e09374f Compare December 13, 2024 16:55

SantiagoPittella changed the title ~~wip: add tracing to worker and proxy~~ feat: add tracing to worker and proxy Dec 13, 2024

SantiagoPittella marked this pull request as ready for review December 13, 2024 16:55

SantiagoPittella requested review from bobbinth and igamigo December 13, 2024 16:56

bobbinth reviewed Dec 13, 2024

View reviewed changes

bobbinth requested a review from Mirko-von-Leipzig December 13, 2024 20:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add tracing to worker and proxy #1014

feat: add tracing to worker and proxy #1014

SantiagoPittella commented Dec 11, 2024 •

edited

Loading

bobbinth commented Dec 13, 2024

SantiagoPittella commented Dec 13, 2024

bobbinth left a comment

bobbinth Dec 13, 2024

bobbinth Dec 13, 2024

bobbinth Dec 13, 2024

bobbinth Dec 13, 2024

bobbinth Dec 13, 2024

bobbinth Dec 13, 2024

bobbinth Dec 13, 2024

bobbinth Dec 13, 2024

bobbinth Dec 13, 2024

bobbinth Dec 13, 2024

		#[derive(Debug)]
		pub struct LoadBalancer(pub Arc<LoadBalancerState>);

		// The following methods are a copy of the default implementation defined in the trait, but
		// with tracing instrumentation.

		worker: None,
		parent_span: info_span!("proxy:new_request", request_id = request_id.to_string()),

		// Construct TracerProvider for OpenTelemetryLayer
		pub(crate) fn init_tracer_provider() -> TracerProvider {

		// Setup tracing subscriber
		pub(crate) fn setup_tracing(provider: TracerProvider) -> Result<(), String> {

		opentelemetry-semantic-conventions = "0.27.0"
		opentelemetry-jaeger = "0.22.0"

		@@ -114,6 +114,16 @@ The proxy service uses this health check to determine if a worker is available t

		Both the worker and the proxy will use the `info` log level by default, but it can be changed by setting the `RUST_LOG` environment variable.

		## Traces

feat: add tracing to worker and proxy #1014

Are you sure you want to change the base?

feat: add tracing to worker and proxy #1014

Conversation

SantiagoPittella commented Dec 11, 2024 • edited Loading

bobbinth commented Dec 13, 2024

SantiagoPittella commented Dec 13, 2024

bobbinth left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

SantiagoPittella commented Dec 11, 2024 •

edited

Loading