Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

update config dependency #11611

Merged

Conversation

codeboten
Copy link
Contributor

@codeboten codeboten commented Nov 5, 2024

This PR does a couple of things that I couldn't quite split up so I put together a PR w/ individual commits to help reviewers get through it. This PR does the following:

  1. update go.opentelemetry.io/contrib/config package to latest. this brings in breaking changes. in order to prevent those breaking changes from impacting end users, i've also added a layer of config unmarshaling
  2. updates the collector to instantiate the meter provider (and exporters) via the config package. this allows us to remove all the code in otelinit. the reason for including this change was that unmarshaling the config was causing circular dependencies i didn't want to address by moving code that could be deleted around.

Replacement for #11458.

Fixes #12021

@codeboten codeboten force-pushed the codeboten/update-config-dependency branch 3 times, most recently from 6f225b7 to 47f403a Compare November 6, 2024 21:45
@mjnowen
Copy link

mjnowen commented Nov 12, 2024

Looking forward to seeing this get merged!

Copy link
Contributor

This PR was marked stale due to lack of activity. It will be closed in 14 days.

@github-actions github-actions bot added the Stale label Nov 27, 2024
@mjnowen
Copy link

mjnowen commented Nov 27, 2024

Not stale? @codeboten

@codeboten codeboten removed the Stale label Nov 27, 2024
@codeboten codeboten force-pushed the codeboten/update-config-dependency branch from 47f403a to 2a0e6f8 Compare December 9, 2024 19:03
Copy link

codecov bot commented Dec 9, 2024

Codecov Report

Attention: Patch coverage is 82.05805% with 68 lines in your changes missing coverage. Please review.

Project coverage is 91.62%. Comparing base (2447a81) to head (f2cddc3).
Report is 1 commits behind head on main.

Files with missing lines Patch % Lines
service/service.go 72.38% 34 Missing and 3 partials ⚠️
service/telemetry/internal/migration/v0.3.0.go 54.54% 12 Missing and 3 partials ⚠️
service/telemetry/internal/migration/v0.2.0.go 93.67% 8 Missing and 2 partials ⚠️
component/componenttest/configtest.go 0.00% 2 Missing and 1 partial ⚠️
service/telemetry/metrics.go 60.00% 1 Missing and 1 partial ⚠️
service/telemetry/config.go 92.30% 1 Missing ⚠️
Additional details and impacted files
@@            Coverage Diff             @@
##             main   #11611      +/-   ##
==========================================
- Coverage   91.75%   91.62%   -0.14%     
==========================================
  Files         464      465       +1     
  Lines       24763    24775      +12     
==========================================
- Hits        22722    22699      -23     
- Misses       1656     1687      +31     
- Partials      385      389       +4     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

@codeboten
Copy link
Contributor Author

Could use some feedback from @open-telemetry/collector-approvers on this latest commit to the PR to update the config support from v0.2.0 to v0.3.0.

tl;dr there's a backwards incompatible change in how headers are passed in and I wanted to support users transitioning

i wanted to support both the 0.3.0 and the 0.2.0 schema, which resulted in some migration code. I still have a few tests to add, and this change will need a PR in go-contrib to be merged before it can be completely done. Anyways, i could use your input: df83265

This is waiting on open-telemetry/opentelemetry-go-contrib#6412

@dmitryax
Copy link
Member

The approach to handle the backward incompatible change looks good to me 👍

// compatibility attempts
return err
}
// TODO: add a warning log to tell users to migrate their config
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I guess it might be complicated to pass a logger here, but it's pretty important. I think we should come up with a solution by the time we release this change

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The way we do this on the Datadog exporter is to have a private field that is written to during unmarshaling and then a method to log warnings.

I don't love it, but I haven't been able to come up with anything better

Copy link
Member

@jpkrohling jpkrohling left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I appreciate your efforts to not break our current users. And explicitly about this:

Anyways, i could use your input: df83265

I like your approach. We don't have a schema version anywhere, so trying to fallback to the previous version before failing is a good approach. It doesn't scale nicely, but I also don't see a good way to determine the version other than requiring users to provide a schema version (which is bad UX for this specific place of the config, IMO).

@@ -363,3 +385,43 @@ func pdataFromSdk(res *sdkresource.Resource) pcommon.Resource {
}
return pcommonRes
}

func disableHighCardinalityMetrics() []config.View {
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

looks OK for now, but if we expect this list to grow before we remove the gate, it might make sense to externalize this somehow?

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is the purpose of #11754

service/service.go Show resolved Hide resolved
Copy link
Member

@mx-psi mx-psi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think this is a good opportunity to resolve #10808 by writing down what our plans are regarding this.

// compatibility attempts
return err
}
// TODO: add a warning log to tell users to migrate their config
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The way we do this on the Datadog exporter is to have a private field that is written to during unmarshaling and then a method to log warnings.

I don't love it, but I haven't been able to come up with anything better

@@ -363,3 +385,43 @@ func pdataFromSdk(res *sdkresource.Resource) pcommon.Resource {
}
return pcommonRes
}

func disableHighCardinalityMetrics() []config.View {
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is the purpose of #11754

@codeboten codeboten force-pushed the codeboten/update-config-dependency branch 4 times, most recently from 3bcf5f1 to b0021fb Compare December 16, 2024 23:07
@codeboten codeboten marked this pull request as ready for review December 16, 2024 23:07
@codeboten codeboten requested a review from a team as a code owner December 16, 2024 23:07
@codeboten codeboten requested a review from atoulme December 16, 2024 23:07
@codeboten codeboten force-pushed the codeboten/update-config-dependency branch from 41beb42 to e30a736 Compare December 17, 2024 19:44
Copy link
Contributor

github-actions bot commented Jan 1, 2025

This PR was marked stale due to lack of activity. It will be closed in 14 days.

Signed-off-by: Alex Boten <[email protected]>
Signed-off-by: Alex Boten <[email protected]>
@codeboten codeboten force-pushed the codeboten/update-config-dependency branch from 9791d42 to 52c4f85 Compare January 22, 2025 20:18
InstrumentName: ptr("otelcol_processor_batch_batch_send_size_bytes"),
}))
}
return views
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@mx-psi ended up moving the logic to append views in here, as the views are needed to instantiate the SDK, PTAL

@codeboten codeboten force-pushed the codeboten/update-config-dependency branch from 52c4f85 to d314b4d Compare January 22, 2025 20:26
Signed-off-by: Alex Boten <[email protected]>
@codeboten codeboten force-pushed the codeboten/update-config-dependency branch from d314b4d to d742afd Compare January 22, 2025 20:28
Signed-off-by: Alex Boten <[email protected]>
@codeboten
Copy link
Contributor Author

Attempting to fix the integration test that's failing in contrib: open-telemetry/opentelemetry-collector-contrib#37425

Unclear to me whether this change is making that test flakier or not as this issue exists: open-telemetry/opentelemetry-collector-contrib#34836

@codeboten
Copy link
Contributor Author

Another attempt at fixing the contrib test open-telemetry/opentelemetry-collector-contrib#37431

Copy link
Member

@mx-psi mx-psi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The new place for views looks reasonable

service/service.go Show resolved Hide resolved
service/service.go Show resolved Hide resolved
service/service.go Show resolved Hide resolved
service/service.go Show resolved Hide resolved
Copy link
Member

@mx-psi mx-psi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I am fine with copy-pasting the scope name, it seems unlikely to change

@codeboten
Copy link
Contributor Author

codeboten commented Jan 23, 2025

The contrib exporter-1 test is still consistently failing https://github.com/open-telemetry/opentelemetry-collector/actions/runs/12932562549/job/36070135348?pr=11611, @songy23 do you have any thoughts on what's causing it to fail? i can't repro locally

@codeboten
Copy link
Contributor Author

i can't repro locally
was finally able to repro locally... will continue debugging the issue

@songy23
Copy link
Member

songy23 commented Jan 23, 2025

do you have any thoughts on what's causing it to fail?

Datadog relies on the internal meter to report some component specific metrics. Those metrics are then exposed with the internal Prometheus server. If anything changes the metric names / attributes from internal meter and/or Prometheus, it will break the Datadog integrations.

@codeboten
Copy link
Contributor Author

Thanks for your help @songy23, bug has been fixed

@codeboten codeboten enabled auto-merge January 24, 2025 17:37
@codeboten codeboten added this pull request to the merge queue Jan 24, 2025
Merged via the queue into open-telemetry:main with commit 4edaacd Jan 24, 2025
52 of 53 checks passed
@codeboten codeboten deleted the codeboten/update-config-dependency branch January 24, 2025 18:06
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Mismatch in resource attributes between metrics and everything else
10 participants