Proposal: remove Span #68

yurishkuro · 2019-11-18T05:00:21Z

This is obviously controversial, but it would be nice if we removed the concept of the Span from tracing API, and replace it with methods on the Tracer, such as:

tracer.SetSpanAttribute(ctx, key, value)
tracer.RecordSpanEvent(ctx, event)

There are two reasons for that:

When Context API is moved into an independent layer below all other layers, the way extractors might work is like this: ctx = extractor.extract(ctx, request). Because extract() cannot start a new span, it must store span context in the returned ctx. With this proposal, it will always keep the span context only in the context, never the Span.
Not giving users references to Span objects simplifies memory management. In OpenTracing it was pretty difficult to pool span objects because user can keep a reference to the span even after calling span.finish(). The tracer can keep a buffer of active spans indexed by immutable span contexts that are kept by the user code. When span is finished the tracer can reuse the span object for another trace. if a later call from user code comes with span context not in the table, trace can either ignore the call, or capture that data by other means (if the backend supports merging of spans).

The text was updated successfully, but these errors were encountered:

Oberon00 · 2019-11-18T13:23:29Z

First, I think this issue belongs more to the spec repository than the OTEP repository (even though an OTEP may come out of it).

Second, as I see it the main change here is that for each Span operation you now have the Tracer as an additional input (you could just implement Span as a reference to SpanContext and a Tracer and have the public methods of Span delegate to Tracer methods of the form you suggested). Is that correct?

This has implications for the SpanContext too, because we probably would want some additional property like int activeSpanIndex otherwise we force something like a HashMap access for each Span operation. Then the question is who has (read or write) access to this property? Should it be defined in the SDK or API? If in the SDK, we are back to #58. And so on.

jmacd · 2019-11-20T21:17:41Z

This feels like a Go-specific proposal to me, and it's one that's been discussed in the past I'm sure many times. The question is, even if every operation on a Span is equivalent to some Tracer operation on a context, should we still have a Span type? I can be convinced not, in this case, but there's a specific line in the Golang Context documentation that causes some concern:

Do not store Contexts inside a struct type; instead, pass a Context explicitly to each function that needs it.

I personally disagree with the advice not to store Contexts, but as a compromise to some of my co-workers, whenever I've done so I embed it in another struct and clearly document the behavior of the stored context, i.e., never store a bare Context, only store contexts in derived Context structures with a clear intent. Then, we can remove the Span type.

yurishkuro · 2019-11-21T06:12:42Z

@Oberon00

you could just implement Span as a reference to SpanContext

Yes, but that's the point - if it can be done, why have the Span object in the API?

I agree that we'd need to revisit #58.

@jmacd I didn't follow your train of though. How is this related to storing Context anywhere? Context is a propagation mechanism, which is independent of Span vs. SpanContext duality. The code examples I had are Go specific, but in another language you can just replace ctx with spanCtx.

carlosalberto · 2019-11-21T15:43:47Z

I remember that we mentioned back in OpenTracing having an abstraction layer (working on top of the OT api) that effectively would hide these details, so you would always work with the current Span, as depicted in your example. I feel this approach would be great, as it would simplify things for 'end users', but wouldn't probably help with optimizations like the Span pooling you mentioned ;)

tedsuo · 2019-11-25T06:48:12Z

@yurishkuro Josh means that some gophers refuse to store a reference to a Context. So if you take away their Span handle, they have nothing to store but the Context, and they are sad. This comes from a literal reading of the following advice from https://golang.org/pkg/context/

Do not store Contexts inside a struct type; instead, pass a Context explicitly to each function that needs it. The Context should be the first parameter, typically named ctx

I won't go into it here, but there are enough edge cases that a literal interpretation of this advice is silly. Occasionally, you have to store a context for use later.

@carlosalberto this is the old example from OpenTracing Java. It's easy to see how this change would make the instrumentation more readable, in just about every language: https://docs.google.com/presentation/d/1FKnzs19iuJyDIy0cHffgC8H1B4y_U4sXDw7N_YswnQw/edit

jmacd · 2019-11-26T18:39:42Z

I fully support this proposal, also I'm prepared to stand before any and all gophers and state that storing Contexts is the right thing to do. I appreciated the related discussion here: https://github.com/open-telemetry/oteps/pull/66/files#r347198758

bogdandrutu · 2019-11-26T18:46:34Z

There may be cases where it is necessary to have a Span object, for example an interceptor pattern like this:

Interceptor {
  onStart(Headers);

  Context beforeExecutedHandler(Context);

  onEnd();
}

The Interceptor implementation needs to create a Span in the onStart and install the Span in the Context later. With the proposed API this is not possible because it is not easy to "merge" contexts in the beforeExecutedHandler.

So I would definitely not rush a decision like this before v1.

lmolkova · 2019-11-26T18:56:09Z

Is this proposal specific for Go and if not, how it could work for other languages?

It seems to imply that all spans have to be active and that language allows to rely on current spans in all cases. I don't think this is the case for Java or C++. Even for languages with full async support, it's not always possible to use the current span reliably.

tsloughter · 2019-11-26T19:09:56Z

I'm a +1 on this. It at least applies to Erlang/Elixir, so not only Go.

It is also basically how the Erlang impl already works. We provide an additional interface that sits on top of the official API, a module named otel.

We also don't allow modifications to a span after it has been finished. A finished span is moved to a new buffer and out of the active spans buffer. I'm guessing this is related to what @yurishkuro is talking about with simplifying memory management.

Actually, yes:

The tracer can keep a buffer of active spans indexed by immutable span contexts that are kept by the user code.

This is exactly how the Erlang impl works. And when end_span is called the span is removed from the active spans and moved to a separate buffer that is used by the batch processor (assuming the batch processor has been added as a span processor).

jmacd · 2019-12-05T23:55:51Z

There was some discussion about this in a call about Context Propagation today. I would be satisfied with a specification update stating that Span objects are logical constructs which refer to a span context. The notion of "CurrentSpan" returning a span is really just semantic sugar for a type which encapsulates a span context. I believe this addresses your concern, without needing to actually remove Span from the specification. It would allow a valid implementation to be stateless, it allows a meaningful interpretation for span events that happen after Spans finish, and so on.

tedsuo · 2019-12-06T18:38:10Z

@lmolkova I believe it implies that we can simply when adding a context object to every language. Even if this context object is automatically propagated most of the time, there are cases where the context object must also be instantiated. So, the proposal is that all operations work directly on this context object, rather than first extracting a span from the context.

A pseudocode example, with no automated context propagation:

// operate on the context object directly
SetSpanAttribute(context, "foo", "bar")

// rather than first grab a span handle
span = GetSpan(context)
span.SetAttribute("foo", "bar")

When automated context propagation is available, then the context object is implied.

// use the context directly
SetCurrentSpanAttribute("foo", "bar")

// rather than grab a span handle
span = GetCurrentSpan()
span.SetAttribute("foo", "bar")

Is that a helpful description? I'm not proposing it would look exactly like this in C#, but I do think the result would be a more declarative API, that hides the span object as an implementation detail. I think this leaves more room for improvement in the future, as the API now exposes fewer details. I believe .Net uses activities as the equivalent of "context" in this case.

BTW, it is this OTEP (#66) proposing a separate context layer which is bringing this discussion up. I agree with @bogdandrutu that we have to confirm that edge cases like moving contexts still work. I actually do not think it is complicated to move/merge a span from one context to another, but we would need to create tests and examples for all of these cases first. So I would not want to rush a big change. Perhaps in some cases, this is just sugar to make the API more declarative and take up fewer lines of code.

lmolkova · 2019-12-06T18:52:00Z

So my understanding is that this simply eliminates Span interface:

we have some kind of handler for the span.
we keep all the Span interface APIs to enrich spans, but expose them through different class (Tracer or whatever)
We keep readable Span in some shape for exporters

With .NET team we are entertaining the idea of providing all Span capabilities in the platform (which simplifies dependency management enormously). So this is definitely an interesting proposal that helps with this idea.

At the same time, it seems we need something in the SDK to store all active spans (which is not trivial to do right) AND we still ask users to carry handler around AND we keep all the same methods on the different interface.

If we want to do this in v1, I hope to see some real benefits for this choice. I believe there are some, but I'm not sure what is ROI. But after v1, Span interface would need to stay for a while and it might not make sense to do anything about it.

tedsuo · 2019-12-06T19:00:09Z

@lmolkova yes I really like the way that .NET is going with this, I hope other language runtimes eventually catch up. :)

I'm not convinced that this would be a major change to the current implementations in other languages, even though it changes the API significantly.

For each language, it would need to be done as a very complete PR, with the edge cases handled and benefit shown; and not dribbled out. I would not want to block the main line of SDK work with this.

jmacd · 2019-12-06T20:53:35Z

I will vote to keep the Span type but change the specification for Span to define it as semantic sugar, making it in every way equivalent to the proposal in this issue but without removing the Span interface. The original Go "streaming" SDK was implemented this way. A Span was effectively just a context and all operations recorded an event with the context and operation details.

https://github.com/open-telemetry/opentelemetry-go/blob/4172bdfbf9def7962191c955228b7f9e4561466c/experimental/streaming/sdk/span.go#L29

IOW I believe a valid SDK can already avoid the memory management problem -- there is no requirement for an SDK to maintain state in memory to implement a Span. I still believe the specification should be firmed up to say that Span is semantically equivalent to a context.

jmacd · 2019-12-09T22:05:08Z

I believe this is a very important topic. Please see the related/opposed issue #73.

I believe it's an ergonomic loss to remove the Span API. One of the points of confusion that could arise, were Span to be removed, is that it becomes easier to mis-use the API by recording a single Span to multiple Tracers (i.e., you could use one Tracer for the Start event, one for the End event).

lmolkova · 2019-12-09T22:09:55Z

basically it requires to expose Span APIs on the span context/handler, not the tracer.

There is already enough confusion around current span on the tracer open-telemetry/opentelemetry-specification#108 (is it current for this tracer or current globally).

tsloughter · 2019-12-14T16:41:42Z

There is already enough confusion around current span on the tracer open-telemetry/opentelemetry-specification#108 (is it current for this tracer or current globally).

And with this change it is current for the current context, right?

I'd never heard of current span being tied to a tracer or globally, this may be why I've always been so confused in discussions about current/active spans?

I wanted to reiterate that I'm a big +1 on this and really hope the spec goes this way. It is already basically how the Erlang/Elixir implementation works, only basically because to keep the API's for tracer and span modules roughly the same as the spec we have an additional module that acts as a layer above them to combine their APIs.

Oberon00 · 2020-03-18T17:02:05Z

I think this proposal should receive some serious consideration again now that OTEP 66 (Context OTEP) is merged. Although now, Span should arguably not be replaced by SpanContext (we can remove that too, #73), but OTEP 66's Context + operations on the Tracer.

Oberon00 · 2020-09-04T07:13:30Z

While we would have had the chance to do this with OTEP66, I think it's too late now. Can we close this?

yurishkuro mentioned this issue Nov 18, 2019

Proposal: Separate Layer for Context Propagation #66

Merged

jmacd mentioned this issue Dec 7, 2019

Proposal: Span Context == Span #73

Closed

jmacd mentioned this issue Dec 11, 2019

WIP: Context Propagation @ OTEP 66 open-telemetry/opentelemetry-go#381

Closed

mwear mentioned this issue Jan 9, 2020

Context prop combined open-telemetry/opentelemetry-ruby#147

Merged

jmacd mentioned this issue Jan 11, 2020

Proposal: Resource Scope and Namespace API #78

Closed

mwear mentioned this issue Feb 29, 2020

Span vs Context open-telemetry/opentelemetry-ruby#201

Closed

jmacd mentioned this issue Mar 18, 2020

add finishing the currently active span to tracer operations open-telemetry/opentelemetry-specification#516

Closed

tsloughter mentioned this issue Mar 18, 2020

distinguish starting an active span and creating an inactive span open-telemetry/opentelemetry-specification#485

Closed

Oberon00 mentioned this issue Mar 18, 2020

Consider using Context as the unique way to specify parenthood open-telemetry/opentelemetry-specification#510

Closed

Oberon00 mentioned this issue Mar 29, 2020

How should other forms of trace/span ids be handled? open-telemetry/opentelemetry-specification#525

Open

Oberon00 mentioned this issue Aug 6, 2020

W3C TraceState underspecified open-telemetry/opentelemetry-specification#759

Closed

Oberon00 mentioned this issue Sep 4, 2020

Spans's parent must be passed as Context instead of Span(Context)s. open-telemetry/opentelemetry-specification#875

Merged

mwear mentioned this issue Sep 28, 2020

Context interaction in different packages like trace/baggage is unclear open-telemetry/opentelemetry-specification#1019

Closed

jmacd mentioned this issue Sep 28, 2020

Allow SpanContext to be exposed directly on Span. open-telemetry/opentelemetry-specification#1022

Closed

andrewhsu added Future Discussions Things to look at for future improvements release:after-ga Not required before GA release, and not going to work on before GA labels Oct 23, 2020

Oberon00 closed this as completed May 20, 2022

Oberon00 closed this as not planned Won't fix, can't repro, duplicate, stale May 20, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Proposal: remove Span #68

Proposal: remove Span #68

yurishkuro commented Nov 18, 2019

Oberon00 commented Nov 18, 2019

jmacd commented Nov 20, 2019

yurishkuro commented Nov 21, 2019

carlosalberto commented Nov 21, 2019

tedsuo commented Nov 25, 2019

jmacd commented Nov 26, 2019

bogdandrutu commented Nov 26, 2019

lmolkova commented Nov 26, 2019

tsloughter commented Nov 26, 2019

jmacd commented Dec 5, 2019

tedsuo commented Dec 6, 2019 •

edited

Loading

lmolkova commented Dec 6, 2019

tedsuo commented Dec 6, 2019

jmacd commented Dec 6, 2019

jmacd commented Dec 9, 2019

lmolkova commented Dec 9, 2019

tsloughter commented Dec 14, 2019

Oberon00 commented Mar 18, 2020 •

edited

Loading

Oberon00 commented Sep 4, 2020

Proposal: remove Span #68

Proposal: remove Span #68

Comments

yurishkuro commented Nov 18, 2019

Oberon00 commented Nov 18, 2019

jmacd commented Nov 20, 2019

yurishkuro commented Nov 21, 2019

carlosalberto commented Nov 21, 2019

tedsuo commented Nov 25, 2019

jmacd commented Nov 26, 2019

bogdandrutu commented Nov 26, 2019

lmolkova commented Nov 26, 2019

tsloughter commented Nov 26, 2019

jmacd commented Dec 5, 2019

tedsuo commented Dec 6, 2019 • edited Loading

lmolkova commented Dec 6, 2019

tedsuo commented Dec 6, 2019

jmacd commented Dec 6, 2019

jmacd commented Dec 9, 2019

lmolkova commented Dec 9, 2019

tsloughter commented Dec 14, 2019

Oberon00 commented Mar 18, 2020 • edited Loading

Oberon00 commented Sep 4, 2020

tedsuo commented Dec 6, 2019 •

edited

Loading

Oberon00 commented Mar 18, 2020 •

edited

Loading