Need guidance for creating message-based tracing system using Apache Pulsar #1945

devinbost · 2019-11-27T21:01:32Z

Requirement - Integrating Jaeger into Apache Pulsar for message-based tracing

Apache Pulsar is like a next-generation Kafka that supports functions.
Tracing integrations have already been built for Kafka. They have not yet been built for Pulsar.
Moreover, we wish to build a message-based implementation, which is a little different from other architectures that we've seen so far.

Problem - how to create message-based Spans

For example, let's say that we receive several different messages that all have a commonID that represent different parts of a sequence of operations.
e.g.
message1 -> function1 -> message2 -> function2 -> message3 -> function3

In this case, we don't have the ability to make code changes to the functions, but we can access the messages in a different way. Each message contains the same commonID that we can use to associate the messages together. The question is if we can use this commonID to link the messages into a single span.

Question details

Do we need to have access to all of the messages simultaneously to put them into the same trace? That would require us to join the messages and then manually construct the spans.
It would be ideal if we could instead create the spans as the messages arrive in a way that would include them all in the same span.

The concern

My concern is that it appears that we would need to have access to their contexts in order to link them as parts of the same span.
In the book Mastering Distributed Tracing (which is a great book @yurishkuro ), it wasn't clear to me how the inject and extract methods work and if I can use them on messages that come in sequence (i.e. if the Jaeger collector is somehow able to put the spans together), rather than needing to join the messages together to create a span that includes all of them.

Is this explanation of the problem sufficiently clear?

objectiser · 2019-11-28T12:01:48Z

@devinbost Sorry it is not clear to me.

e.g. message1 -> function1 -> message2 -> function2 -> message3 -> function3

This seems to imply that the output of function1 is message2 which is then input into function2, etc. Is that correct?

If so then why would you want a single span to represent all three messages, rather than having a span per function call?

devinbost · 2019-12-02T17:59:27Z

Sorry, I misspoke. We want a single trace to represent all three messages. We want a span for each function call. In the diagram below, in the Jaeger Sink, it may be more correct to represent the parent span as a trace.

Since we're dealing with a distributed messaging system, we don't necessarily have the ability to make code changes to the functions in the system. (e.g. Imagine that you were a service provider like AWS and needed to support tracing on those who uploaded Lambda functions without requiring them to make code changes.)
It seems that the key is for us to be able to control how we're constructing the UUIDs that are consumed by the Jaeger collectors. Since there's not a wire protocol for OpenTracing, we're wondering if there's another way that we can construct the spans.

Does that answer your question?

devinbost · 2019-12-02T19:51:58Z

I think this is actually related to this issue: opentracing/specification#81

objectiser · 2019-12-03T10:10:42Z

@devinbost Can't the tap function extract the span context, create the span and then inject the new context back into the message?

devinbost · 2019-12-06T03:22:51Z

We decided to do something similar. We ended up using Flink to join the messages.

devinbost closed this as completed Dec 6, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Need guidance for creating message-based tracing system using Apache Pulsar #1945

Need guidance for creating message-based tracing system using Apache Pulsar #1945

devinbost commented Nov 27, 2019

objectiser commented Nov 28, 2019

devinbost commented Dec 2, 2019

devinbost commented Dec 2, 2019

objectiser commented Dec 3, 2019

devinbost commented Dec 6, 2019

Need guidance for creating message-based tracing system using Apache Pulsar #1945

Need guidance for creating message-based tracing system using Apache Pulsar #1945

Comments

devinbost commented Nov 27, 2019

Requirement - Integrating Jaeger into Apache Pulsar for message-based tracing

Problem - how to create message-based Spans

Question details

The concern

objectiser commented Nov 28, 2019

devinbost commented Dec 2, 2019

devinbost commented Dec 2, 2019

objectiser commented Dec 3, 2019

devinbost commented Dec 6, 2019