Skip to content

CORE-324 | chore: bump otel collector to v1.44/v0.138.0#3669

Merged
RonFed merged 3 commits into
odigos-io:mainfrom
RonFed:collecotr_0_138_and_1_44
Oct 26, 2025
Merged

CORE-324 | chore: bump otel collector to v1.44/v0.138.0#3669
RonFed merged 3 commits into
odigos-io:mainfrom
RonFed:collecotr_0_138_and_1_44

Conversation

@RonFed
Copy link
Copy Markdown
Collaborator

@RonFed RonFed commented Oct 24, 2025

Bump the collector dependencies to v1.44/v0.138.
The main motivation for this upgrade is to have the latest pdata performance improvements around memory allocation and re-using allocated objects.

Deprecations

Upstream bug fixes:

@RonFed RonFed force-pushed the collecotr_0_138_and_1_44 branch 2 times, most recently from 15dd7c7 to 806c385 Compare October 26, 2025 13:58
@RonFed RonFed force-pushed the collecotr_0_138_and_1_44 branch from 806c385 to 5a4fa94 Compare October 26, 2025 14:34
@RonFed RonFed changed the title [WIP] chore: bump otel collector to v1.44/v0.138.0 CORE 324 | chore: bump otel collector to v1.44/v0.138.0 Oct 26, 2025
@RonFed RonFed changed the title CORE 324 | chore: bump otel collector to v1.44/v0.138.0 CORE-324 | chore: bump otel collector to v1.44/v0.138.0 Oct 26, 2025
@RonFed RonFed marked this pull request as ready for review October 26, 2025 14:37
@RonFed RonFed merged commit 0d7fd4d into odigos-io:main Oct 26, 2025
68 of 71 checks passed
RonFed added a commit to RonFed/odigos_fork that referenced this pull request Oct 26, 2025
Bump the collector dependencies to v1.44/v0.138.
The main motivation for this upgrade is to have the latest `pdata`
performance improvements around memory allocation and re-using allocated
objects.

* loki exporter is deprecated:
open-telemetry/opentelemetry-collector-contrib#41413.
. https://grafana.com/docs/grafana-cloud/send-data/otlp/send-data-otlp/
The loki exporter is kept in the collector build to allow users to still
use these destinations - but we should follow up with changing the
destinations to use the modern OTLP variant.
* opencensus exporter is deprecated
open-telemetry/opentelemetry-collector-contrib#42239
* routingprocessor is deprecated
open-telemetry/opentelemetry-collector-contrib#42154

* Removed the untis from metrics name
open-telemetry/opentelemetry-go#7039 - this
impacts the self metrics we report to the UI - the names are ajusted.
RonFed added a commit to RonFed/odigos_fork that referenced this pull request Oct 27, 2025
Bump the collector dependencies to v1.44/v0.138.
The main motivation for this upgrade is to have the latest `pdata`
performance improvements around memory allocation and re-using allocated
objects.

* loki exporter is deprecated:
open-telemetry/opentelemetry-collector-contrib#41413.
. https://grafana.com/docs/grafana-cloud/send-data/otlp/send-data-otlp/
The loki exporter is kept in the collector build to allow users to still
use these destinations - but we should follow up with changing the
destinations to use the modern OTLP variant.
* opencensus exporter is deprecated
open-telemetry/opentelemetry-collector-contrib#42239
* routingprocessor is deprecated
open-telemetry/opentelemetry-collector-contrib#42154

* Removed the untis from metrics name
open-telemetry/opentelemetry-go#7039 - this
impacts the self metrics we report to the UI - the names are ajusted.
RonFed added a commit to RonFed/odigos_fork that referenced this pull request Oct 27, 2025
Bump the collector dependencies to v1.44/v0.138.
The main motivation for this upgrade is to have the latest `pdata`
performance improvements around memory allocation and re-using allocated
objects.

* loki exporter is deprecated:
open-telemetry/opentelemetry-collector-contrib#41413.
. https://grafana.com/docs/grafana-cloud/send-data/otlp/send-data-otlp/
The loki exporter is kept in the collector build to allow users to still
use these destinations - but we should follow up with changing the
destinations to use the modern OTLP variant.
* opencensus exporter is deprecated
open-telemetry/opentelemetry-collector-contrib#42239
* routingprocessor is deprecated
open-telemetry/opentelemetry-collector-contrib#42154

* Removed the untis from metrics name
open-telemetry/opentelemetry-go#7039 - this
impacts the self metrics we report to the UI - the names are ajusted.
alonkeyval pushed a commit that referenced this pull request Oct 28, 2025
Following up after:
#3669

- Implements Loki V3 migration:

https://github.com/open-telemetry/opentelemetry-collector-contrib/blob/release/v0.130.x/exporter/lokiexporter/README.md#migration-instructions
- Adds optional basic auth config for self hosted Loki

<img width="528" height="302" alt="Screenshot 2025-10-27 at 17 47 50"
src="https://github.com/user-attachments/assets/cbaaa481-5d38-46d8-95e4-9c980bbba069"
/>

---

Proof of data for Loki Cloud:

<img width="1716" height="990" alt="Screenshot 2025-10-27 at 16 30 03"
src="https://github.com/user-attachments/assets/36662b46-970d-42e4-9624-380f7c55f1f2"
/>

---

Proof of data for Loki Self Hosted:

<img width="1716" height="990" alt="Screenshot 2025-10-27 at 17 30 53"
src="https://github.com/user-attachments/assets/d3868e21-66b0-4ca9-83e2-79d1cc4cf3d5"
/>
damemi added a commit that referenced this pull request Feb 4, 2026
…or/otel to 141 + Remove deprecated components + Bump k8s min version to 1.21 (#4111)

The clickhouse exporter supports TLS settings similar to otlp:
https://github.com/open-telemetry/opentelemetry-collector-contrib/blob/945e5a71ef31793ff3280b28c1425086ea5332b6/exporter/clickhouseexporter/README.md#tls

Some users need this to connect to clickhouse, adding them as options in
the destination here

This adds:

* `insecure_skip_verify`
* `ca_file` (using the k8sconfig interface to mount the secret as a
file, similar to how the GCP exporter supports application default
credentials)

The direct string fields (such as CAPem, CertPem, KeyPem) aren't yet
supported in the clickhouse exporter, so it has to be a mounted file.
See
open-telemetry/opentelemetry-collector-contrib#43911 (comment)

---

To do this, it required bumping the collector/otel deps to 136 when TLS
config support was added to clickhouse. This required the following
changes:

This actually needs collector v0.136.0 for these settings from
open-telemetry/opentelemetry-collector-contrib#42581
(open-telemetry/opentelemetry-collector-contrib@d9769f7)

Also needs to remove loki exporter (removed in 131) for 136 🙃
open-telemetry/opentelemetry-collector-contrib#41413,
see
https://github.com/open-telemetry/opentelemetry-collector-contrib/tree/v0.130.0/exporter/lokiexporter#deprecation-notice
it's replaced with just otlp. The only destination that actually looks
like it's using the loki exporter is OpsVerse

As well as the opencensus exporter, removed in 133 upstream by
open-telemetry/opentelemetry-collector-contrib#42239

Also routing processor
open-telemetry/opentelemetry-collector-contrib#36616

See previous attempt #3669
(reverted in #3734)

---

Then, it turns out that 136 was bugged and did not have full support for
TLS settings like `insecure_skip_verify`. This was fixed in 141, which
required the following extra changes:

Actually needs collector v141 due to this bug in clickhouse not handling
all tls settings:
open-telemetry/opentelemetry-collector-contrib#43911
fixed in
open-telemetry/opentelemetry-collector-contrib#44093

Remove deprecated carbon exporter support (unmaintained upstream)
open-telemetry/opentelemetry-collector-contrib#44532

another upstream breaking change giving go mod trouble
open-telemetry/opentelemetry-collector#13948

configgrpc update:
open-telemetry/opentelemetry-collector#13996

and now metadata.yaml metrics require stablity levels
open-telemetry/opentelemetry-collector#13756

```
Error: failed loading /app/collector/receivers/odigosebpfreceiver/metadata.yaml: decoding failed due to the following error(s):

'telemetry.metrics[ebpf_memory_pressure_wait_time_total]' missing required field: `stability`
'telemetry.metrics[ebpf_total_bytes_read]' missing required field: `stability`
'telemetry.metrics[ebpf_lost_samples]' missing required field: `stability`
Error: failed loading /app/collector/receivers/odigosebpfreceiver/metadata.yaml: decoding failed due to the following error(s):

'telemetry.metrics[ebpf_memory_pressure_wait_time_total]' missing required field: `stability`
'telemetry.metrics[ebpf_total_bytes_read]' missing required field: `stability`
'telemetry.metrics[ebpf_lost_samples]' missing required field: `stability`
Error: metadata.yaml ordering check failed: [telemetry metrics] keys are not sorted: [odigos_log_data_size odigos_metric_data_size odigos_trace_data_size odigos_accepted_spans odigos_accepted_metric_points odigos_accepted_log_records]
Error: metadata.yaml ordering check failed: [telemetry metrics] keys are not sorted: [odigos_log_data_size odigos_metric_data_size odigos_trace_data_size odigos_accepted_spans odigos_accepted_metric_points odigos_accepted_log_records]
```

This bump also required adding the `endpointslices` permission to the
odiglet service account for the data-collection collector

---

Finally, endpointslices was not GA in k8s 1.20. This PR bumps our
minimum supported k8s version to 1.21. Enterprise update in
odigos-io/odigos-enterprise#2117
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants