Add feature: OpenTelemetry integration for observability #18744

AstraBert · 2025-05-15T15:16:58Z

Description

Added an integration for OpenTelemetry with a custom EventHandler that is able to trace all events and log their details. Highly customizable but easy-to-set-up interface.

New Package?

Did I fill in the tool.llamahub section in the pyproject.toml and provide a detailed README.md for my new integration or package?

Yes
No

Version Bump?

Did I bump the version in the pyproject.toml file of the package I am updating? (Except for the llama-index-core package)

Yes
No

Type of Change

Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update

How Has This Been Tested?

Your pull-request will likely not be merged unless it is covered by some form of impactful unit testing.

I added new unit tests to cover this change
I believe this change is already covered by existing unit tests

Suggested Checklist:

I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
I have added Google Colab support for the newly added notebooks.
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes
I ran uv run make format; uv run make lint to appease the lint gods

AstraBert · 2025-05-15T15:37:51Z

Solving the conflicts, will be ready to be tested soon :)

logan-markewich

This works! But, I think we can make it better 💪🏻

Right now, we create a new span for each event, so you get something like this

I think if we combine a span handler and event handler, we can get more accurate traces that better represent the execution paths inside llama-index

Thoughts? Do you think its possible?

...grations/observability/llama-index-observability-otel/llama_index/observability/otel/base.py

…/run-llama/llama_index into clelia/opentelemetry-integration

AstraBert · 2025-05-15T16:29:08Z

This works! But, I think we can make it better 💪🏻

Right now, we create a new span for each event, so you get something like this

I think if we combine a span handler and event handler, we can get more accurate traces that better represent the execution paths inside llama-index

Thoughts? Do you think its possible?

@logan-markewich I think it is possible to do this, I'll have a look later :))

AstraBert · 2025-05-16T11:36:16Z

Hey @logan-markewich, I added some more logic so that spans emitted from openetelemetry can be backtraced among each other (we have parent-children relationships now!)... Let me know if this is aligned with what you thought :)

logan-markewich · 2025-05-16T20:08:53Z

llama-index-integrations/observability/llama-index-observability-otel/README.md

+from llama_index.core import SimpleDirectoryReader, VectorStoreIndex
+
+span_handler = OpenTelemetrySpanHandler()
+event_handler = OpenTelemetryEventHandler(span_handler=span_handler)


Just for ergonomics, I wonder if we should instead define some parent instrument_otel function that automatically sets up the span and event handlers

Some helper function like:

def instrument_otel(tracer_operator=None, dispatcher=None): dispatcher = dispatcher or instrument.get_dispatcher() span_handler = OpenTelemetrySpanHandler(tracer_operator=tracer_operator) event_handler = OpenTelemetryEventHandler(span_handler=span_handler) dispatcher.add_event_handler(event_handler) dispatcher.add_span_handler(span_handler)

Yes, I am working exactly on that😁

...grations/observability/llama-index-observability-otel/llama_index/observability/otel/base.py

AstraBert · 2025-05-16T21:57:00Z

Ok, with this version you can create your own LlamaIndexOpenTelemetry instrumentation class, start listening and recording events, and then you can turn those events into a OpenTelemetry span whenever you want!

This is more or less the code:

from llama_index.observability.otel import LlamaIndexOpenTelemetry
from llama_index.core import SimpleDirectoryReader, VectorStoreIndex
from opentelemetry.exporter.otlp.proto.http.trace_exporter import (
    OTLPSpanExporter,
)

# define a custom span exporter
span_exporter = OTLPSpanExporter("http://0.0.0.0:4318/v1/traces")

# initialize the instrumentation object
instrumentor = LlamaIndexOpenTelemetry(
    service_name_or_resource="my.otel.service", span_exporter=span_exporter
)

if __name__ == "__main__":
    # start listening!
    instrumentor.start_registering()
    # register events
    documents = SimpleDirectoryReader(
        input_dir="./data/paul_graham/"
    ).load_data()
    index = VectorStoreIndex.from_documents(documents)
    query_engine = index.as_query_engine()
    query_result = query_engine.query("Who is Paul?")
    # turn the events into a span and streamline them to OpenTelemetry
    instrumentor.to_otel_traces()
    # register another batch of events
    quere_result_one = query_engine.query("What did Paul do?")
    # turn the events into another span and streamline them to OpenTelemetry
    instrumentor.to_otel_traces()

And with this you will have two spans containing respectively 15 and 13 events😁

logan-markewich · 2025-05-16T22:49:29Z

@AstraBert this is closer! But I think we can do even better and make it even more automatic 💪🏻

Think of it this way

With the span handler, we know when each span starts and stop. Inside the framework, we are constantly making new spans, and tracking the parent span ID, etc. Each span is mapped to a function call
With the event handler, we know each event as it happens
Put those two together, and we know
a. Which event belongs to which span
b. The entire hierarchy of spans

I think once a user runs instrumentor.start_registering(), then every span llama-index creates after that should be sent automatically over otel.

Basically, I think you can do something like

def new_span(...):
  ...
  otel_span = self._tracer.start_span(span_name, context=ctx) 
  ...

<similar handling for exit/drop_span>

And in the event handler

from opentelemetry import trace

def handle(...):
  ...
  current_span = trace.get_current_span()
  current_span.add_event(....)
  ...

In both cases, there is a lot of metadata we can attach to the otel spans and otel events. For example on the spans, we can set attributes for the function name and args. And for events, we can attach the event data (which we already do 💪🏻)

I hope this makes sense!

… to otel

AstraBert · 2025-05-19T10:46:55Z

Hey @logan-markewich, this should now finally work as we wanted ;)

You can try out this code that pipes the traces in Jaeger:

from llama_index.observability.otel import LlamaIndexOpenTelemetry
from llama_index.core import SimpleDirectoryReader, VectorStoreIndex
from opentelemetry.exporter.otlp.proto.http.trace_exporter import (
    OTLPSpanExporter,
)
from llama_index.core.llms import MockLLM
from llama_index.core.embeddings import MockEmbedding
from llama_index.core import Settings

# define a custom span exporter
span_exporter = OTLPSpanExporter("http://0.0.0.0:4318/v1/traces")

# initialize the instrumentation object
instrumentor = LlamaIndexOpenTelemetry(
    service_name="my.test.service.1",
    span_exporter=span_exporter,
    debug=True,
    dispatcher_name="my.dispatcher.name",
)

if __name__ == "__main__":
    embed_model = MockEmbedding(embed_dim=256)
    llm = MockLLM()
    Settings.embed_model = embed_model
    # start listening!
    instrumentor.start_registering()
    # register events
    documents = SimpleDirectoryReader(
        input_dir="./data/paul_graham/"
    ).load_data()
    index = VectorStoreIndex.from_documents(documents)
    query_engine = index.as_query_engine(llm=llm)
    query_result = query_engine.query("Who is Paul?")
    query_result_one = query_engine.query("What did Paul do?")

logan-markewich · 2025-05-19T21:21:31Z

...grations/observability/llama-index-observability-otel/llama_index/observability/otel/base.py

+        span = self.all_spans[id_]
+        for event in self.all_events:
+            span.add_event(name=event.name, attributes=event.attributes)
+        self.all_events.clear()


hmm, I guess one issue with this approach is if I do something like await asyncio.gather(index.aquery(...), index.aquery(...)) -- events from both queries would get mixed together (I think, I couldn't quite test this, see other comment 👍🏻)

logan-markewich · 2025-05-19T21:26:16Z

@AstraBert hmm, looks like I still get traces with only one span

Here's the code

from opentelemetry.exporter.otlp.proto.http.trace_exporter import (
    OTLPSpanExporter,
)
from llama_index.observability.otel import LlamaIndexOpenTelemetry
from llama_index.core import SimpleDirectoryReader, VectorStoreIndex

# define a custom span exporter
span_exporter = OTLPSpanExporter("http://0.0.0.0:4318/v1/traces")

# initialize the instrumentation object
instrumentor = LlamaIndexOpenTelemetry(
    service_name_or_resource="my.otel.service.testv2", 
    span_exporter=span_exporter
)

if __name__ == "__main__":
    # try it out with a simple RAG example!
    instrumentor.start_registering()

    documents = SimpleDirectoryReader(
        input_dir="./docs/docs/examples/data/paul_graham/"
    ).load_data()
    index = VectorStoreIndex.from_documents(documents)

    query_engine = index.as_query_engine()
    query_result = query_engine.query("Who is Paul?")
    print(query_result)

From this code, I think I would expect only two top-level traces

one trace for VectorStoreIndex.from_document(...) -- this would contain some splitting and embedding events
one trace for query_engine.query() -- this would include events from retrieval, embedding, synthesis, calling the llm, etc. (quite a few here)

Where each trace has the hierarchy of spans

Here's what I currently get (let me know if you get something else locally! I hope I installed this branch properly lol)

AstraBert · 2025-05-20T08:03:16Z

So actually I got similar results, but:

could you please check that the spans are correctly labelled as "ok"? Because from the screenshot it doesn't seem so :(
most of these spans are empty, maybe it would be easier if we only registered non-empty spans

I'll work on it :)

…/run-llama/llama_index into clelia/opentelemetry-integration

logan-markewich · 2025-05-20T20:23:58Z

@AstraBert It works! 🎉

I think the only remaining thing is the events don't quite get attached to the proper spans, I think?

I would expect LLM.apredict to have LLMPredictStart/End, while the OpenAI.chat call would have LLMChatStart/End 🤔 Will take a peek at the code and see if there is an easy fix

AstraBert · 2025-05-20T20:45:42Z

Yeah, that's actually weird and unexpected, but I might have a hint here: the way we handle events is just to add them into a list within the SpanHandler - when a span is preparing to exit, all the events in the list are added to the span and then the span is ended. After that, the event list is cleared and filled by the new events, that are gonna go in the following span, and so on... Might be that this is too simplistic for the way we emit spans/events, and thus these result in being messed up.
I can also take a deeper look into it tomorrow, if needed :))

logan-markewich · 2025-05-20T21:10:57Z

@AstraBert I fixed it! Yes your suspicion was right! There was also a hidden issue where sometimes ChatResponse.raw was sometimes hitting a serialization error (its well known when using openai's sdk apparently, they way they use pydantic is janky)

I fixed both issues :)

Add feature: OpenTelemetry integration for observability

096e957

AstraBert self-assigned this May 15, 2025

AstraBert added the enhancement New feature or request label May 15, 2025

dosubot bot added the size:L This PR changes 100-499 lines, ignoring generated files. label May 15, 2025

AstraBert added 2 commits May 15, 2025 18:01

sync uv.lock with main

e888069

making suitable for python <3.11

14a594d

logan-markewich reviewed May 15, 2025

View reviewed changes

...grations/observability/llama-index-observability-otel/llama_index/observability/otel/base.py Outdated Show resolved Hide resolved

logan-markewich and others added 2 commits May 15, 2025 10:15

small spacing nit

4eb5760

Merge branch 'clelia/opentelemetry-integration' of https://github.com…

3fb5774

…/run-llama/llama_index into clelia/opentelemetry-integration

Adding a SpanHandler to manage spans and links between them

f78e9c8

dosubot bot added size:XL This PR changes 500-999 lines, ignoring generated files. and removed size:L This PR changes 100-499 lines, ignoring generated files. labels May 16, 2025

logan-markewich reviewed May 16, 2025

View reviewed changes

...grations/observability/llama-index-observability-otel/llama_index/observability/otel/base.py Outdated Show resolved Hide resolved

AstraBert added 2 commits May 16, 2025 23:45

instrumentation updates

46ea9c4

Fixed import of utils module

72cedfe

removing to_otel_traces method and adding automating piping of traces…

9b09ba5

… to otel

merge main

90f4284

logan-markewich reviewed May 19, 2025

View reviewed changes

AstraBert added 3 commits May 20, 2025 15:42

empty spans are not exported to OTel

bb73882

Merge branch 'clelia/opentelemetry-integration' of https://github.com…

ea48d84

…/run-llama/llama_index into clelia/opentelemetry-integration

Adding multiple spans together in the same trace

f2560bd

Fixing tests

63f755a

logan-markewich added 2 commits May 20, 2025 15:08

make serialization great again (and fix events and their spans)

a470283

format imports

9b598a0

mypy fixes

769df9e

logan-markewich merged commit 632b0d3 into main May 21, 2025
6 of 10 checks passed

logan-markewich deleted the clelia/opentelemetry-integration branch May 21, 2025 18:19

colca mentioned this pull request Jun 9, 2025

add message id colca/llama_index#2

Closed

18 tasks

Add feature: OpenTelemetry integration for observability #18744

Add feature: OpenTelemetry integration for observability #18744

Uh oh!

Conversation

AstraBert commented May 15, 2025

Description

New Package?

Version Bump?

Type of Change

How Has This Been Tested?

Suggested Checklist:

Uh oh!

AstraBert commented May 15, 2025

Uh oh!

logan-markewich left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

AstraBert commented May 15, 2025

Uh oh!

AstraBert commented May 16, 2025

Uh oh!

logan-markewich May 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

AstraBert May 16, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

AstraBert commented May 16, 2025

Uh oh!

logan-markewich commented May 16, 2025

Uh oh!

AstraBert commented May 19, 2025

Uh oh!

logan-markewich May 19, 2025

Choose a reason for hiding this comment

Uh oh!

logan-markewich commented May 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AstraBert commented May 20, 2025

Uh oh!

logan-markewich commented May 20, 2025

Uh oh!

AstraBert commented May 20, 2025

Uh oh!

logan-markewich commented May 20, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

logan-markewich May 16, 2025 •

edited

Loading

logan-markewich commented May 19, 2025 •

edited

Loading