Skip to content

Conversation

@simonswine
Copy link
Contributor

What this PR does:

Imports cortex mixin from upstream including history and placing it under jsonnet/mimir-mixin

Which issue(s) this PR fixes:

This allows to diverge with alerts and runbooks from the Cortex project.

Continued from #366

Checklist

  • Tests CI updated
  • Documentation added
  • [ ] CHANGELOG.md updated - the order of entries should be [CHANGE], [FEATURE], [ENHANCEMENT], [BUGFIX]

pracucci and others added 30 commits January 27, 2021 13:05
…ster-memory-alert-threshold

Increased CortexAllocatingTooMuchMemory alert threshold
Signed-off-by: Goutham Veeramachaneni <[email protected]>
…-memory-alert

Add alert for etcd memory limits close
Signed-off-by: Marco Pracucci <[email protected]>
…tooltip-decrescent-sorting

Sort legend descending in the CPU/memory panels.
…h-alert

Fixed CortexQuerierHighRefetchRate alert
Signed-off-by: Marco Pracucci <[email protected]>
…ueries-dashboard

Add slow queries dashboard
- Update dashboard so it only shows under provisioned services and why
- Add sizing rules based on limits.
- Add some docs to the dashboard.

Signed-off-by: Tom Wilkie <[email protected]>
Add recording rules to calculate Cortex scaling
…isk-panels

Fixed "Disk Writes" and "Disk Reads" panels
Signed-off-by: Marco Pracucci <[email protected]>
…ecording-rules

Pre-compute aggregations to optimize scaling recording rules
…on-to-create-compactor-statefulset

Add function to customize compactor statefulset
…tor-alert

Fixed CortexCompactorRunFailed threshold
…t-progress-dashboard

Added Cortex Rollout progress dashboard
stevesg and others added 26 commits September 22, 2021 09:20
…ations-rules

Add recording rules for Alertmanager dashboard,
This is a workaround for large clusters where this group can become slow to evaluate.
…rules

Split `cortex_api` recording rule group into three groups.
…til-playbook

Update gsutil installation playbook
This fixes panels where `cortex-gw` was hardcoded.
…ntainer-names

Use `$._config.job_names.gateway` in resources dashboards.
Signed-off-by: Marco Pracucci <[email protected]>
Signed-off-by: Marco Pracucci <[email protected]>
…rtexIngesterReachingSeriesLimit

Fine tune CortexIngesterReachingSeriesLimit alert
…tuck-rollout

Add CortexRolloutStuck alert
Signed-off-by: Marco Pracucci <[email protected]>
…onsul-failures

Added CortexFailingToTalkToConsul alert
@simonswine
Copy link
Contributor Author

This is a duplicate of #373

Copy link
Collaborator

@pracucci pracucci left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I promise this time I will merge it the right way.

@pracucci pracucci merged commit 487fb5b into main Oct 19, 2021
@pracucci pracucci deleted the 20211018_import-cortex-mixin branch October 19, 2021 16:50
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.