Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: Add objstore support for Swift using thanos.io/objstore #11672

Merged
merged 35 commits into from
Nov 25, 2024

Conversation

btaani
Copy link
Contributor

@btaani btaani commented Jan 12, 2024

What this PR does / why we need it:

  • Update thanos-io/objstore to use the latest version
  • Implement a new swift chunk client that relies on Thanos
  • Add TLS support for swift bucket and chuck clients

Which issue(s) this PR fixes:
Fixes LOG-4819, LOG-5063

Special notes for your reviewer:

Checklist

  • Reviewed the CONTRIBUTING.md guide (required)
  • Documentation added
  • Tests updated
  • CHANGELOG.md updated
    • If the change is worth mentioning in the release notes, add add-to-release-notes label
  • Changes that require user attention or interaction to upgrade are documented in docs/sources/setup/upgrade/_index.md
  • For Helm chart changes bump the Helm chart version in production/helm/loki/Chart.yaml and update production/helm/loki/CHANGELOG.md and production/helm/loki/README.md. Example PR
  • If the change is deprecating or removing a configuration option, update the deprecated-config.yaml and deleted-config.yaml files respectively in the tools/deprecated-config-checker directory. Example PR

@github-actions github-actions bot added the type/docs Issues related to technical documentation; the Docs Squad uses this label across many repositories label Jan 12, 2024
@pull-request-size pull-request-size bot added size/L and removed size/M labels Jan 12, 2024
@btaani btaani changed the title Add support for custom TLS config in swift storage client Add support for TLS config in swift storage client Jan 16, 2024
@btaani btaani marked this pull request as ready for review January 16, 2024 09:49
@btaani btaani requested a review from a team as a code owner January 16, 2024 09:49
@pull-request-size pull-request-size bot added size/M and removed size/L labels Jan 18, 2024
Comment on lines 34 to 36
Timeout time.Duration `yaml:"timeout"`
IdleConnTimeout time.Duration `yaml:"idle_conn_timeout"`
ResponseHeaderTimeout time.Duration `yaml:"response_header_timeout"`
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

These values are define here but I don't see anywhere they are actually being used to configure the underlying transport?

Copy link
Contributor Author

@btaani btaani Feb 5, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks, you're right I had assumed that this was the struct being used inside the chunks package. I've moved it there already.

Following @periklis 's advice to reuse the bucket_http config inside the swift bucket config, I've noticed that the swift config used is coming from thanos-io/objstore, which doesn't have an HTTP field built into it (in contrary to s3's thanos/objstore config struct which does have the HTTP field).
This limits the addition of the HTTPConfig.Transport field to the swift config.

So I need your advice here since I lack experience in this codebase, do you suggest that we wait until it's added to thanos-io/objstore's swift config? Or should we proceed with this PR only with the object client, while leaving the bucket client until either a restructure of the config is done, or till the correct fields are added to the thanos/objstore config?
Edit: after discussing with @periklis, we agreed to 1) add the required HTTP config in thanos-io/objstore and 2) use the thanos-io/objstore config in both the bucket and object client, for the sake of unifying all storage clients.

@@ -54,6 +64,11 @@ func (cfg *Config) RegisterFlagsWithPrefix(prefix string, f *flag.FlagSet) {
f.IntVar(&cfg.MaxRetries, prefix+"swift.max-retries", 3, "Max retries on requests error.")
f.DurationVar(&cfg.ConnectTimeout, prefix+"swift.connect-timeout", 10*time.Second, "Time after which a connection attempt is aborted.")
f.DurationVar(&cfg.RequestTimeout, prefix+"swift.request-timeout", 5*time.Second, "Time after which an idle request is aborted. The timeout watchdog is reset each time some data is received, so the timeout triggers after X time no data is received on a request.")
f.DurationVar(&cfg.HTTPConfig.IdleConnTimeout, prefix+"swift.http.idle-conn-timeout", 90*time.Second, "The maximum amount of time an idle connection will be held open.")
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This would be a change from the currently used default value of 0 for unlimited, I think as we are trying to get this change into a patch fix release it would be better to keep the current default value.

Also as mentioned above I don't see where this value actually configures the underlying transport

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@slim-bean AFAICS, GCS, Azure and S3 all define as default 90sec. Maybe you refer to some different place?

@periklis
Copy link
Collaborator

@btaani @slim-bean After careful consideration the present state of this PR does not consider all the places to configure the HTTP transport capabilities as proposed. TL;DR; The PR considers adding parameters only to the chunk/object client but not to the bucket client.

@btaani FYI the Loki code base is handling for historical reasons (part of it from Cortex's legacy) the buckets and object stored into them via separate clients. The main rationale for this separation is that for each use case we have different transport and encryption requirements (i.e. buckets are created once and is more of a fire and forget operation, while objects are accessed regularly). In any case we need to consider adding the HTTP capabilities in two places as of now:

For the sake of simplicity, you can consider reusing the bucket/http/config.go. For example S3 is using this in bucket/s3/config.go for the bucket client. OTH for the Swift object client you can keep your custom struct as other providers are doing.

@pull-request-size pull-request-size bot added size/L and removed size/M labels Feb 9, 2024
@btaani btaani changed the title Add support for TLS config in swift storage client storage: Swift backend using thanos.io/objstore Feb 9, 2024
@@ -5174,6 +5174,16 @@ The `swift_storage_config` block configures the connection to OpenStack Object S
# is received on a request.
# CLI flag: -<prefix>.swift.request-timeout
[request_timeout: <duration> | default = 5s]

# Set to false to skip verifying the certificate chain and hostname.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Did you edit this file directly? Or is this change automatically generated? The configuration reference should be updated in docs/sources/configure/index.template.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I did so directly.. thanks for pointing this out!

@github-actions github-actions bot added the type/docs Issues related to technical documentation; the Docs Squad uses this label across many repositories label Oct 30, 2024
@JoaoBraveCoding JoaoBraveCoding force-pushed the swift-tls branch 3 times, most recently from 747401d to c2f588f Compare November 15, 2024 11:00
Copy link
Contributor

@ashwanthgoli ashwanthgoli left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm! thank you @JoaoBraveCoding

@ashwanthgoli ashwanthgoli merged commit 44523e0 into grafana:main Nov 25, 2024
59 checks passed
JoaoBraveCoding added a commit to JoaoBraveCoding/loki that referenced this pull request Dec 4, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
size/L type/docs Issues related to technical documentation; the Docs Squad uses this label across many repositories
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants