distinguish starting an active span and creating an inactive span #485

tsloughter · 2020-02-22T15:59:02Z

Includes a note about async callbacks which are a common use case
for creating an inactive span.

Closes #469

Oberon00 · 2020-02-22T16:31:08Z

Are you suggesting to introduce the possibility of creating unstarted Spans? I'm strongly against that tbh, I think it makes many things more complicated.

specification/api-tracing.md

yurishkuro · 2020-02-22T16:34:13Z

specification/api-tracing.md

@@ -122,24 +122,29 @@ mechanism, for instance the `ServiceLoader` class in Java.

 The `Tracer` MUST provide functions to:

- Create a new `Span`
+- Start a new active `Span`
+- Create a new inactive `Span`


"inactive" reads very confusing. I would separate "starting a span" and "making a span active" (even though both can be done by a single function).

yurishkuro · 2020-02-22T16:37:15Z

specification/api-tracing.md


 The `Tracer` SHOULD provide methods to:

 - Get the currently active `Span`
 - Make a given `Span` as active

 The `Tracer` MUST internally leverage the `Context` in order to get and set the
-current `Span` state and how `Span`s are passed across process boundaries.
+currently active `Span` and how `Span`s are passed across process boundaries. A
+`Span` that is created, as opposed to started, is not tracked in the `Context`


created, as opposed to started

-1, this sounds like a whole new dimension of behavior. If you want to discuss this, please create an OTEP.

It isn't. The created span still has a start timestamp.

specification/api-tracing.md

yurishkuro · 2020-02-22T16:40:01Z

I would recommend to do away with a duality of "creating" vs. "starting" a span. They are the same thing in all prior art, and I would prefer we use a single verb "starting a span" to avoid confusion.

tsloughter · 2020-02-22T16:47:46Z

@yurishkuro currently we have started spans and active spans (which are started). I'm attempting to make a clear distinction between functions to create inactive spans (but have start timestamps) and starting active spans.

Right now the API only says you can start an inactive span.

tsloughter · 2020-02-22T16:49:18Z

@Oberon00 no, created spans still have start timestamps, they simply aren't made active. This is the behaviour currently defined in the API. The change is to specify that start_span function creates a span and makes it active. Which I believe is the default behaviour in actual implementations? So it differs between how the API is written and how implementations I've seen work.

tsloughter · 2020-02-22T16:57:08Z

To maybe make it more clear why I'm using start and create.

The existing APIs and SDKs use start to create an active span. In Go Start results in the span being set in the context https://github.com/open-telemetry/opentelemetry-go/blob/master/sdk/trace/tracer.go#L66

The API specifies it should not be set active in the context. I wanted to keep the naming the same as what is done today (Start) while adding a new function that acts like the API specifies that "start" is supposed to work (but doesn't in implementations as far as I can tell) so added create.

yurishkuro · 2020-02-22T17:45:22Z

I don't think it's a good idea to be using 3 words (start, create, active) to describe 2 orthogonal behaviors (starting a span and making it active). In OT-Java we had to roll-back "making span active" from start_span function, because it was causing issues with try/catch.

Have we considered NOT having tracer make the span active at all? It's merely a convenience, which I personally don't believe is worth it, and it works on shielding the developer from understanding that there is such a thing as context propagation, which has nothing to do with managing spans. For example, in Go:

ctx := tracer.StartSpan(ctx, ...)

is just a convenience over more clear separation of:

span := tracer.StartSpan(...)
ctx := otel.ActiveSpan(ctx, span)

One important benefit of the latter is that starting a span is an implementation-specific action, thus it requires a tracer. Activating the span is NOT implementation specific, all tracers will share the propagation mechanism. The one-liner can always be implemented as a helper function:

ctx := otel.StartSpan(ctx, tracer, ...)

tsloughter · 2020-02-22T17:56:47Z

I think it would be very confusing to force devs to start having to manage the context explicitly.

Also, the API does specify that it is not made active, so what you are suggesting is the API spec currently.

If the Tracer is responsible for tracking the active span in the context then I think it needs to handle it when the user starts a span unless they specify otherwise, and based on existing implementations of both OpenTelemetry and OpenTracing/Census this is clearly what the user expects.

If the Tracer were not responsible for this then I could see an argument that some other module handles the context, like in your example where the Tracer returns the span and then otel is used to activate it, unrelated to the Tracer.

But that is a much larger change. Today the API spec is confusing to me because it both has the Tracer as responsible for the span being activated and doesn't offer a way to start+activate -- although the implementations do just that for Start.

I think we need both and would suggest, since the use of start and create clearly causes confusion over whether create sets a start time:

start_active: Tracer operation to start a new span and update the active span it is tracking.
start: Tracer operation to start a span but not touch the tracked active span.

tsloughter · 2020-02-22T21:08:28Z

I didn't want to touch it in this PR to keep it as small and focused as possible but a related confusion I have is that the spec specifies:

When an active Span is made inactive, the previously-active Span SHOULD be made active.

I agree with this API but it doesn't seem to match that End is now moved to the Span, and I see many implementations do use an End function on the Span.

If the Tracer is responsible for tracking the active Span and for making the previously-active Span active again when deactivating the current active span it seems clear that End should be a Tracer operation.

tsloughter · 2020-02-23T15:01:53Z

otep-66 states:

StartSpan(context, options) -> context When a span is started, a new context is returned, with the new span set as the current span.

It calls this "not final", so maybe there was other conversation on why this shouldn't be the case? otep-66 also uses Tracer::EndSpan to end and deactivate the current span, which appears to only partially be defined in the API spec (it says the tracer should set the previously active span as active again when deactivating a span, but I don't know how that can work if ending a span is an operation done directly on a Span instead of through Tracer).

specification/api-tracing.md

arminru · 2020-02-24T14:22:59Z

specification/api-tracing.md


 The `Tracer` SHOULD provide methods to:

 - Get the currently active `Span`
- Make a given `Span` as active
+- Make a given `Span` active


If providing a method to start an inactive span is a MUST, this should also be one, right?

Hm, good question. I suppose so.

Me too! So please add it to the MUST section then.

specification/api-tracing.md

Oberon00 · 2020-02-24T14:51:10Z

A fundamental question: After OTEP 66, is there even such a thing as an inactive span now? See https://github.com/open-telemetry/oteps/blob/49316bc20167a0a6e2214bbf5806e0e7d763b2d0/text/0066-separate-context-propagation.md#observability-api

StartSpan(context, options) -> context When a span is started, a new context is returned, with the new span set as the current span.

It seems the description of StartSpan in the spec is simply wrong now.

tsloughter · 2020-02-24T14:55:46Z

Even with otep66 there has to be a concept of an inactive span, it is just to useful and in use. But I do think that the default action of a tracer's startspan should make the span active, like the otep has.

tsloughter · 2020-02-24T14:58:09Z

This might be better discussed tomorrow in the morning meeting. Then I can rework the PR based on the outcome there and discussion can pick up again on the PR.

bogdandrutu · 2020-03-10T15:13:56Z

@open-telemetry/specs-approvers please review this. We need to make progress.

arminru

Please update the PR title so that it matches the content (starting an active vs. inactive span) since you removed the separate create without starting option in the meantime.

arminru · 2020-03-11T17:18:25Z

specification/api-tracing.md

@@ -120,31 +120,45 @@ mechanism, for instance the `ServiceLoader` class in Java.

 ### Tracer operations

+The currently active `Span` is the one that is tracked in the current `Context`
+by the `Tracer`. An inactive `Span` is not currently tracked in any `Context`.


An inactive span can still be tracked in a non-current context, right?

no, it would make it active (unless you track via some private mechanism, which would be out of scope here)

Actually in Java (and most likely in Python) you can have a Span in a Context that is not the active one:

// ctx contains span1 but ctx itself is not active. Context ctx = TracingContextUtils.withSpan(span1, Context.current()); // *Now* it is. try (Scope scope = ContextUtils.withScopedContext(ctx)) { }

Because of this I suggest changing the second sentence to: "A Span is considered inactive when it is not tracked in the currently active Context." or something along that.

I'd consider that span to be active for that context, but the context itself is not active.

We could have a different term for it though, "current span" to replace how I've been using "active span". So a span can be "current" to a context without being active because the context is not active.

I suggest, for the sake of moving on with this PR, to not change the wording between active/current, and use whatever is used in the given section (we could do a follow up this in another PR). Likewise, let's use active/current only when the Span is associated with the current Context (else, you'd say its associated with a given Context).

yurishkuro · 2020-03-11T17:35:44Z

specification/api-tracing.md

@@ -120,31 +120,45 @@ mechanism, for instance the `ServiceLoader` class in Java.

 ### Tracer operations

+The currently active `Span` is the one that is tracked in the current `Context`
+by the `Tracer`. An inactive `Span` is not currently tracked in any `Context`.


no, it would make it active (unless you track via some private mechanism, which would be out of scope here)

yurishkuro · 2020-03-11T17:38:45Z

specification/api-tracing.md

 The `Tracer` MUST provide functions to:

- Create a new `Span`
+- Start a new active `Span`


This sounds like a requirement on the API, but, for example, in OpenTracing we explicitly removed such ability, in favor of starting inactive span and them manually activating it via Scope. What are the implications of this change?

NB: this kind of change sounds to me like it should go through OTEP with some code examples.

IIRC there was agreement to clearly separate these two operations ("start active Span" and "start Span"), but "start active Span" MUST be optional (in Java, as you mentioned, we can't implement this one correctly).

Let's go with separate operations as decided in the SIG call, but I'd suggest changing this to:

Start a new Span

Start a new Span as the current instance (optional operation).

2 is optional as, for example, Go already handles this explicitly in 1) depending on any specified Context, and also because in Java we won't implement it.

@carlosalberto is this what you are thinking:

The `Tracer` MUST provide functions to: - Start a new active `Span` The `Tracer` SHOULD provide methods to: - Get the currently active `Span` - Start a new inactive `Span` - Make a given `Span` active

Or wait, you wanted the start inactive span to be the MUST?

OK, switched it in the PR.

yurishkuro · 2020-03-11T17:39:52Z

specification/api-tracing.md


 The `Tracer` SHOULD provide methods to:

 - Get the currently active `Span`
- Make a given `Span` as active


it looks odd that making span active is in MUST section, but getting active span is in SHOULD. I would think they should go in the same category.

I think #527 might actually hit this one, so I'd suggest we fix that based on that one ;)

yurishkuro · 2020-03-11T17:41:00Z

specification/api-tracing.md

-current `Span` state and how `Span`s are passed across process boundaries.
+currently active `Span` and how `Span`s are passed across process boundaries. A
+`Span` that is started but inactive is not tracked in the `Context` by the
+`Tracer`, but it still MUST have a start timestamp set at the time of creation.


why is the point about the timestamp bundled into the paragraph about context?

Same feeling here, I think we don't need to mention timestamp. Having two separated operations for start-span and start-span-as-current should make this clear enough.

Ok, I was thinking that "inactive" may give the impression to someone that it could not have a timestamp. But since it is still called start and this is defined elsewhere it can be removed.

specification/api-tracing.md

yurishkuro · 2020-03-11T17:42:59Z

specification/api-tracing.md

-selected, or the current active `Span` is invalid.
+SHOULD create each new `Span` as a child of its active `Span` unless an explicit
+parent is provided or the option to create a span without a parent is selected,
+or the current active `Span` is invalid. Last the `Tracer` would check if


"or the current active Span is invalid" doesn't seem to belong here (can be in #determining-the-parent-span-from-a-context).

yurishkuro · 2020-03-11T17:43:35Z

specification/api-tracing.md

+SHOULD create each new `Span` as a child of its active `Span` unless an explicit
+parent is provided or the option to create a span without a parent is selected,
+or the current active `Span` is invalid. Last the `Tracer` would check if
+`Context` has an extracted `SpanContext`. See [Determining the Parent Span from


Last the Tracer would check if Context has an extracted SpanContext

I don't follow why this is here.

The paragraph is going through the ways a span's parent is set when a span is created. If that part is removed I think the whole thing should be removed and simply link to "Determining the Parent Span".

specification/api-tracing.md

yurishkuro · 2020-03-11T17:48:00Z

specification/api-tracing.md

-as a separate operation.
-
-The API MUST accept the following parameters:
+The API functions for starting a `Span` MUST accept the following parameters:

 - The span name. This is a required parameter.
 - The parent `Span` or a `Context` containing a parent `Span` or `SpanContext`,


this bullet is confusing. I think it's mutually exclusive to accept parent Span OR an indicator to create a new root.

That is outside the changes of this PR. Though I agree there doesn't appear to be a reason to have to explicitly state if a span is a root span.

I think this will also be covered/clarified/improved by #510 so I'd suggest we work on this part there, so we can move on with this PR.

Else, track this clarification in its own issue ;)

carlosalberto · 2020-03-11T18:15:21Z

specification/api-tracing.md

-SHOULD create each new `Span` as a child of its active `Span` unless an
-explicit parent is provided or the option to create a span without a parent is
-selected, or the current active `Span` is invalid.
+SHOULD create each new `Span` as a child of its active `Span` unless an explicit


"a child of THE active" Span? Otherwise, it sounds like each Tracer can have its own active Span, which just adds excessive (and not needed) complexity.

jmacd · 2020-03-18T16:34:33Z

As stated in #516, I see "starting an inactive span" as the Tracer's responsibility, and "making a span active" the context library's responsibility.

tsloughter · 2020-03-18T16:52:13Z

I guess I go the other way since I think Span should go away and the Tracer take the context to do everything, like in open-telemetry/oteps#68 :)

Includes a note about async callbacks which are a common use case for creating an inactive span.

Co-Authored-By: Armin Ruech <[email protected]>

Co-Authored-By: Yuri Shkuro <[email protected]>

yurishkuro · 2020-04-28T16:50:17Z

specification/trace/api.md

 - Get the currently active `Span`
- Make a given `Span` as active
+- Make a given `Span` active


I think there is still some vagueness here. I have users at Uber who are absolutely opposed to thread-local based context propagation, and instead insist that in their services all context propagation is done explicitly, similar to Go. That means that while they may still use the same Context implementation, the three SHOULD operations here are against the explicitly passed Context object, rather than building an assumption that a thread-local mechanism is being used and understood by the tracer. In other words, the "Get the currently active Span" is equivalent to this function in OpenTracing Go:

func SpanFromContext(ctx context.Context) Span { val := ctx.Value(activeSpanKey) if sp, ok := val.(Span); ok { return sp } return nil }

Which opens up another trail of questions: why does this need to be a functionality of the Tracer? Tracer is an API that can be implemented differently. Accessing spans in the Context is not related to specific tracer implementation, it's a common behavior at the API level.

I consider it part of the Tracer functionality because the Tracer is what knows where its data is stored in the context.

In your example from OpenTracing that would be activeSpanKey.

In OpenTracing it's not part of Tracer interface, it's a shared static function. If it was part of Tracer, then every tracer implementation would have to implement it identically.

Assuming Tracers from multiple Tracer Providers in a single application are meant to be able to read traces from the same contexts, yes they'd have to use the same key.

Actually, wouldn't we want tracers from different providers to be able to act separate and not clash?

Observe this will probably go away with #527 - Maybe hold on till that one is resolved?

It doesn't really change this PR much since it is mainly about starting the spans and #527 isn't. Having re-read it I'm not opposed to #527 anymore either, I had thought it also moved starting a span to "utils".

The change in this PR is adding an additional function for starting a span, the "make span active" already exists in the spec and is not changed by this PR. So I think this should be considered separate.

And don't think the function to make a span active should hold up this PR since it already exists in the spec and is not being changed here.

tsloughter · 2020-05-05T21:51:39Z

I saw that Python has gone with start_as_current_span: https://github.com/open-telemetry/opentelemetry-python/blob/090b6640495bcb994a9467ab23e2b1abe7fc4af9/opentelemetry-api/src/opentelemetry/trace/__init__.py#L563

tsloughter · 2020-07-17T19:23:37Z

So what is going on with this PR?

Right now the spec says the API call must only start inactive spans and I don't think that is what implementations are doing -- instead they are providing both like in this PR.

I'm going to update it to resolve the conflicts in a bit, hopefully it can get merged after that?

tsloughter · 2020-07-18T00:11:19Z

Nevermind. It now says:

Span creation MUST NOT set the newly created Span as the currently
active Span by default, but this functionality MAY be offered additionally
as a separate operation.

I think it should be the other way around (as in start_span set it active and have an alternative function start_inactive_span) but this should be good enough.

tsloughter requested review from arminru, bogdandrutu, c24t, carlosalberto, iredelmeier, jmacd, reyang, SergeyKanzhelev, tedsuo, tigrannajaryan and yurishkuro as code owners February 22, 2020 15:59

tsloughter force-pushed the start-active-span branch 2 times, most recently from 76a8524 to 0f0e9ad Compare February 22, 2020 16:02

yurishkuro reviewed Feb 22, 2020

View reviewed changes

specification/api-tracing.md Outdated Show resolved Hide resolved

yurishkuro reviewed Feb 22, 2020

View reviewed changes

specification/api-tracing.md Outdated Show resolved Hide resolved

arminru reviewed Feb 24, 2020

View reviewed changes

tsloughter force-pushed the start-active-span branch from 913aee4 to 5b96bda Compare March 10, 2020 15:22

jmacd approved these changes Mar 10, 2020

View reviewed changes

arminru approved these changes Mar 11, 2020

View reviewed changes

yurishkuro reviewed Mar 11, 2020

View reviewed changes

carlosalberto reviewed Mar 11, 2020

View reviewed changes

yurishkuro mentioned this pull request Mar 11, 2020

Rename span.end() to span.finish() #514

Closed

tsloughter mentioned this pull request Mar 14, 2020

add finishing the currently active span to tracer operations #516

Closed

tsloughter and others added 8 commits April 28, 2020 07:39

distinguish starting an active span and creating an inactive span

240aeb7

Includes a note about async callbacks which are a common use case for creating an inactive span.

remove use of 'created span' and simply call it 'start inactive'

bb813e8

Update specification/api-tracing.md

f026458

Co-Authored-By: Armin Ruech <[email protected]>

remove talk of creating a span, only start which is active or inactive

a48c99a

move paragraph on use of async callbacks to Tracer operations

91a7830

move 'make given span active' operation to a MUST

c3e88b2

Update specification/api-tracing.md

d436555

Co-Authored-By: Yuri Shkuro <[email protected]>

update based on yuri and carlos suggestions

82cef8c

tsloughter force-pushed the start-active-span branch from 21d8da6 to 82cef8c Compare April 28, 2020 13:53

make start inactive span the required operation

bec2d01

yurishkuro reviewed Apr 28, 2020

View reviewed changes

trask mentioned this pull request May 8, 2020

More user-friendly Context util classes open-telemetry/opentelemetry-java#1189

Closed

carlosalberto added spec:trace Related to the specification/trace directory area:api Cross language API specification issue labels Jun 12, 2020

reyang added the release:required-for-ga Must be resolved before GA release, or nice to have before GA label Jul 10, 2020

tsloughter closed this Jul 18, 2020

tsloughter mentioned this pull request Aug 25, 2020

Clarifying definition of the concepts of "started" vs "active" needed #469

Closed

distinguish starting an active span and creating an inactive span #485

distinguish starting an active span and creating an inactive span #485

Conversation

tsloughter commented Feb 22, 2020

Oberon00 commented Feb 22, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

yurishkuro commented Feb 22, 2020

tsloughter commented Feb 22, 2020

tsloughter commented Feb 22, 2020

tsloughter commented Feb 22, 2020

yurishkuro commented Feb 22, 2020 • edited Loading

tsloughter commented Feb 22, 2020

tsloughter commented Feb 22, 2020

tsloughter commented Feb 23, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Oberon00 commented Feb 24, 2020

tsloughter commented Feb 24, 2020

tsloughter commented Feb 24, 2020

bogdandrutu commented Mar 10, 2020

arminru left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jmacd commented Mar 18, 2020

tsloughter commented Mar 18, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tsloughter commented May 5, 2020

tsloughter commented Jul 17, 2020

tsloughter commented Jul 18, 2020

Oberon00 commented Feb 22, 2020 •

edited

Loading

yurishkuro commented Feb 22, 2020 •

edited

Loading

tsloughter commented Feb 23, 2020 •

edited

Loading