[Prototype] Pipeline Components - Extension Support by gouslu · Pull Request #2141 · open-telemetry/otel-arrow

gouslu · 2026-03-01T21:50:05Z

Pipeline Components - Extension Support Proposal

Extensions are pipeline components alongside receivers, processors, and exporters -- but they do not participate in data-path connections (no PData channels, control messages only).

Extensions can be implemented as local (!Send futures, single-threaded LocalSet) or shared (Send futures, multi-threaded), giving extension authors flexibility with some caveats based on the Send only extension traits in this design.

Extensions can optionally implement extension traits (e.g., BearerTokenProvider) defined in the engine crate to expose capabilities to other components, or opt out and serve purely as background tasks.

Extension traits require Send + Clone + 'static. If an extension publishes any trait, the concrete type implementing that trait must satisfy these bounds -- meaning shared state must be managed via Arc (or similar) by the extension author.

The extension's lifecycle method (start) takes Box<Self> by move -- the extension instance is consumed, not cloned. What is cloned is the extension struct itself during extension_traits(): the macro clones self into each TraitRegistration, and those clones are inserted into the ExtensionRegistry. After trait collection, the original extension is consumed by start(). Each consumer (receiver/exporter) receives a clone of the registry, and calling registry.get::<dyn Trait>(name) returns a fresh clone of the stored trait object.

Extensions are configured as a sibling to nodes in the pipeline YAML (dedicated extensions: section), not inside nodes.

Extension trait types are sealed -- new trait types can only be added inside the engine crate; external crates can implement existing traits but cannot define new ones.

Extensions are started first, before exporters, processors, and receivers, so their capabilities are available at component initialization.

Extension registry is passed via start methods of exporters and receivers. Can be added to "process" as well if needed. Considered putting it into the EffectHandler, but it didn't feel like the right place for that.

Created a separate control message channel for Extensions based on the changes that were done by @utpilla 's PR #2141.

Alternatives Considered

Arc-based registry -- true single instance with Arc clones, but enforces a Sync boundary on all extensions.
Rc-based registry -- only works for local components. It might be acceptable to say that shared components don't have extension support, but this limits flexibility.
All other alternatives seem to require at least one deep clone or multiple instances and never achieve a true single instance. For this reason I opted for a design where deep clone is accepted and shared extension state is the responsibility of the extension author via an explicit Clone requirement.

Why Local and Shared Variants

Local and shared extension variants exist for two reasons:

Consistency -- receivers and exporters already have local/shared variants; extensions follow the same pattern.
Event-loop optimizations -- despite extension traits requiring Send (because the registry is Send), the start() async body of a local extension can use !Send types (Rc, RefCell, LocalSet spawning, etc.). This means a local extension that publishes Send traits can still use non-thread-safe optimizations inside its own event loop, even though its struct fields must be Send + Clone for trait registration.

- Add extensions as a dedicated section in PipelineConfig (sibling to nodes) - Introduce ExtensionWrapper (Local/Shared), ExtensionRegistry, sealed ExtensionTrait, and extension_traits! macro - Add BearerTokenProvider trait for authentication extensions - Implement AzureIdentityAuthExtension (managed identity + dev credentials) - Refactor Azure Monitor exporter to consume auth via extension registry - Pass ExtensionRegistry to receiver/exporter start() signatures - Generate EXTENSION_FACTORIES distributed slice via engine-macros - Reject extension URNs placed in the nodes section with a clear error - Add extension-system.md documentation

codecov · 2026-03-01T21:53:06Z

Codecov Report

❌ Patch coverage is 59.79381% with 585 lines in your changes missing coverage. Please review.
✅ Project coverage is 87.22%. Comparing base (fc73f05) to head (4e8aa37).

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #2141      +/-   ##
==========================================
- Coverage   87.44%   87.22%   -0.23%     
==========================================
  Files         558      566       +8     
  Lines      185764   186872    +1108     
==========================================
+ Hits       162447   163003     +556     
- Misses      22791    23343     +552     
  Partials      526      526

Components	Coverage Δ
otap-dataflow	`89.32% <59.79%> (-0.33%)`	⬇️
query_abstraction	`80.61% <ø> (ø)`
query_engine	`90.30% <ø> (ø)`
syslog_cef_receivers	`∅ <ø> (∅)`
otel-arrow-go	`52.44% <ø> (ø)`
quiver	`91.83% <ø> (ø)`

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Based on utpilla's insight in open-telemetry#2113 that extensions never touch pipeline data.

jmacd

I reviewed extension-system.md. Looks good!

I think we can add more detail and flexibility in the future. We should focus on the configuration model, especially threads-vs-cores sharing questions, and the minimum required for our Azure auth core to serve as an extension for both azmon and parquet+object_store exporters.

As a nice-to-have, I think we should consider adding a core component that does some extremely-basic form of the extension we're adding, like a basicauth extension (receivers), like a headersetter extension (exporters).

jmacd · 2026-03-05T18:46:30Z

+   Processors do not receive the registry (they don't need cross-cutting
+   capabilities directly).


nit: Eventually, we will find ways to use processor extensions. In the Collector, we find cross cutting concerns like memory limiters and persistent key/value stores.

jmacd · 2026-03-05T18:49:07Z

+   data-path components initialize.
+
+2. **PData-free.** Extensions are completely decoupled from the pipeline data
+   type (`PData`). They receive their own `ExtensionControlMsg` messages


Note: the admin component is in a similar position, needing to be decoupled from the PData type of the engine, yet interoperating with the engine.

# Change Summary This PR adds a design proposal describing the extension system for the **OTel Dataflow Engine**. The document introduces a capability-based extension architecture allowing receivers, processors, and exporters to access non-pdata functionality through well-defined capability interfaces maintained in the engine core. The proposal covers: * core concepts such as **capabilities**, **extension providers**, and **extension instances** * integration of extensions into the **existing configuration model** * the **user experience** for declaring extensions and binding capabilities * the **developer experience** for implementing extension providers * the **runtime architecture** for resolving and instantiating extensions * the **execution models** supported by extensions (local vs shared) * comparison with the **Go Collector extension model** * a **phased evolution plan** (native extensions → hierarchical placement → WASM extensions) * implementation recommendations for building **high-performance extensions aligned with the engine's thread-per-core design** The goal of this document is to provide maintainers with a clear architectural proposal to review before implementing the extension system. ## What issue does this PR close? * Related to #2267, #2230, #2141, #2113 ## How are these changes tested? This PR introduces **documentation only** and does not modify runtime code. ## Are there any user-facing changes? Yes. This proposal describes a **future extension system** that will introduce new configuration capabilities such as: * an `extensions` section in pipeline configurations * a `capabilities` section in node definitions These changes are not implemented yet but outline the intended user-facing configuration model for extensions. --------- Co-authored-by: Joshua MacDonald <jmacd@users.noreply.github.com>

github-project-automation Bot added this to OTel-Arrow Mar 1, 2026

github-actions Bot added the rust Pull requests that update Rust code label Mar 1, 2026

gouslu mentioned this pull request Mar 1, 2026

Add extension support to the pipeline engine #2113

Closed

gouslu force-pushed the gouslu/extension-system branch 2 times, most recently from 7b8e00e to b9bf187 Compare March 1, 2026 22:19

gouslu changed the title ~~[DRAFT] Extension Support Prototype~~ [Prototype] Pipeline Components - Extension Support Mar 1, 2026

gouslu marked this pull request as ready for review March 1, 2026 22:25

gouslu requested a review from a team as a code owner March 1, 2026 22:25

gouslu force-pushed the gouslu/extension-system branch from 7bf7e21 to 5f970e0 Compare March 1, 2026 23:18

Merge branch 'main' into gouslu/extension-system

f71275d

gouslu force-pushed the gouslu/extension-system branch from 5f970e0 to f71275d Compare March 2, 2026 01:37

gouslu added 7 commits March 1, 2026 19:07

Make extension system PData-free

565bb7a

Based on utpilla's insight in open-telemetry#2113 that extensions never touch pipeline data.

Merge branch 'main' into gouslu/extension-system

5e4b1dd

Clean up stale comments, fix doc accuracy, tighten encapsulation

4aec4eb

Merge branch 'main' into gouslu/extension-system

82ddd04

Merge branch 'main' into gouslu/extension-system

5da6271

fix merge issues

44eb441

fixes

32b1ee0

jmacd mentioned this pull request Mar 5, 2026

[DRAFT] OTAP Dataflow extension support #1958

Closed

jmacd reviewed Mar 5, 2026

View reviewed changes

gouslu added 2 commits March 5, 2026 14:14

Merge branch 'main' into gouslu/extension-system

d48bc1f

added a revision of ext system arch doc

d861dba

gouslu mentioned this pull request Mar 7, 2026

docs: add extension system architecture document #2230

Closed

gouslu added 5 commits March 6, 2026 16:10

polish arch doc and remove old one

288dc70

Merge branch 'main' into gouslu/extension-system

ffda830

merge main

4e8aa37

extensions provide capabilities

b40aa4f

fix issues, update docs

cdce4e9

fix spacing

97a702c

utpilla mentioned this pull request Mar 11, 2026

example extension for comparison based on utpilla's design #2267

Closed

lquerel mentioned this pull request Mar 12, 2026

Design proposal for extension system in the OTel Dataflow Engine #2293

Merged

update docs

6ecf7cd

gouslu closed this Mar 19, 2026

github-project-automation Bot moved this to Done in OTel-Arrow Mar 19, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Prototype] Pipeline Components - Extension Support #2141

[Prototype] Pipeline Components - Extension Support #2141
gouslu wants to merge 18 commits into
open-telemetry:mainfrom
gouslu:gouslu/extension-system

gouslu commented Mar 1, 2026 •

edited

Loading

Uh oh!

codecov Bot commented Mar 1, 2026 •

edited

Loading

Uh oh!

jmacd left a comment

Uh oh!

jmacd Mar 5, 2026

Uh oh!

jmacd Mar 5, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		Processors do not receive the registry (they don't need cross-cutting
		capabilities directly).

Conversation

gouslu commented Mar 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pipeline Components - Extension Support Proposal

Alternatives Considered

Why Local and Shared Variants

Uh oh!

codecov Bot commented Mar 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

jmacd left a comment

Choose a reason for hiding this comment

Uh oh!

jmacd Mar 5, 2026

Choose a reason for hiding this comment

Uh oh!

jmacd Mar 5, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

gouslu commented Mar 1, 2026 •

edited

Loading

codecov Bot commented Mar 1, 2026 •

edited

Loading