🐛 Bug Report: Trace-ID Injection Missing Between OpenAI and vLLM Spans #2291

ronensc · 2024-11-13T16:00:04Z

Which component is this bug for?

OpenAI Instrumentation

📜 Description

This bug is related to #1983. Similar to what's described there, the OpenAIInstrumentor doesn't propagate the trace context of the exported span through the LLM request.

This fix will also benefit other frameworks built on top of the OpenAI client library, such as CrewAI.

👟 Reproduction steps

Run Jaeger as the trace collector

docker run --rm --name jaeger \
    -e COLLECTOR_ZIPKIN_HOST_PORT=:9411 \
    -p 6831:6831/udp \
    -p 6832:6832/udp \
    -p 5778:5778 \
    -p 16686:16686 \
    -p 4317:4317 \
    -p 4318:4318 \
    -p 14250:14250 \
    -p 14268:14268 \
    -p 14269:14269 \
    -p 9411:9411 \
    jaegertracing/all-in-one:1.57

Run vLLM

export OTEL_SERVICE_NAME="vllm-server"
export OTEL_EXPORTER_OTLP_TRACES_INSECURE=true
vllm serve meta-llama/Llama-3.2-1B-Instruct --otlp-traces-endpoint "grpc://localhost:4317"

Run OpenAI client with OpenAIInstrumentor:

from opentelemetry.exporter.otlp.proto.grpc.trace_exporter import OTLPSpanExporter
from opentelemetry.sdk.trace import TracerProvider
from opentelemetry.sdk.resources import Resource
from opentelemetry.sdk.trace.export import BatchSpanProcessor, ConsoleSpanExporter
from opentelemetry.trace import set_tracer_provider


trace_provider = TracerProvider(resource=Resource(attributes={"service.name": "OpenAI-client"}))
trace_provider.add_span_processor(BatchSpanProcessor(OTLPSpanExporter(endpoint="grpc://localhost:4317", insecure=True)))
trace_provider.add_span_processor(BatchSpanProcessor(ConsoleSpanExporter()))
set_tracer_provider(trace_provider)


from opentelemetry.instrumentation.openai import OpenAIInstrumentor
OpenAIInstrumentor().instrument()

from openai import OpenAI

client = OpenAI(
    base_url="http://localhost:8000/v1",
    api_key="EMPTY",
)

chat_completion = client.chat.completions.create(
    messages=[
        {
            "role": "user",
            "content": "Say this is a test",
        }
    ],
    model="meta-llama/Llama-3.2-1B-Instruct",
)
print(chat_completion)

👍 Expected behavior

I should see in Jaeger WebUI one trace with 2 spans.

👎 Actual Behavior with Screenshots

I see 2 traces, each with a single span.

http://localhost:16686/

🤖 Python Version

No response

📃 Provide any additional context for the Bug.

No response

👀 Have you spent some time to check if this bug has been raised before?

I checked and didn't find similar issue

Are you willing to submit PR?

Yes I am willing to submit a PR!

The text was updated successfully, but these errors were encountered:

dosubot · 2024-11-13T16:02:24Z

The issue you're experiencing with the OpenAIInstrumentor not propagating the trace context between OpenAI and vLLM spans could be related to how the trace context is being handled in the instrumentor. The OpenAIInstrumentor is supposed to propagate trace context by including it in the HTTP request headers, as verified by tests that check for the presence of the TRACEPARENT_HEADER_NAME in the request headers ^[1].

To address this issue, you might want to ensure that the trace context is correctly injected and propagated through the request headers. The _OpenAITracingWrapper class is responsible for injecting tracing information into OpenAI client requests by modifying the extra_headers in the request ^[2]. Make sure that this mechanism is functioning as expected in your setup.

Additionally, verify that the TracerProvider and SpanProcessor are correctly configured to manage spans and trace context. The TracerProvider should be set globally, and the spans should be processed and exported properly ^[3].

If the issue persists, consider checking for any discrepancies in the OpenAI client library version or any other dependencies that might affect trace context propagation ^[4].

To continue talking to Dosu, mention @dosu.

nirga · 2024-11-13T18:59:09Z

Thanks @ronensc! I imagine you plan on doing something similar to what you've done for Langchain?

ronensc · 2024-11-13T20:29:45Z

Right. I believe the implementation for OpenAIInstrumentor would be simpler. I'll work on a PR and submit once it's ready.

dosubot bot added the bug Something isn't working label Nov 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🐛 Bug Report: Trace-ID Injection Missing Between OpenAI and vLLM Spans #2291

🐛 Bug Report: Trace-ID Injection Missing Between OpenAI and vLLM Spans #2291

ronensc commented Nov 13, 2024

dosubot bot commented Nov 13, 2024

nirga commented Nov 13, 2024

ronensc commented Nov 13, 2024

🐛 Bug Report: Trace-ID Injection Missing Between OpenAI and vLLM Spans #2291

🐛 Bug Report: Trace-ID Injection Missing Between OpenAI and vLLM Spans #2291

Comments

ronensc commented Nov 13, 2024

Which component is this bug for?

📜 Description

👟 Reproduction steps

👍 Expected behavior

👎 Actual Behavior with Screenshots

🤖 Python Version

📃 Provide any additional context for the Bug.

👀 Have you spent some time to check if this bug has been raised before?

Are you willing to submit PR?

dosubot bot commented Nov 13, 2024

nirga commented Nov 13, 2024

ronensc commented Nov 13, 2024