Skip to content
This repository was archived by the owner on Apr 28, 2025. It is now read-only.

Commit e7cbfe4

Browse files
authored
Merge pull request #321 from grafana/lower-ingester-restarts-severity
Lower CortexIngesterRestarts severity
2 parents bf9729e + 2624c08 commit e7cbfe4

File tree

2 files changed

+6
-2
lines changed

2 files changed

+6
-2
lines changed

CHANGELOG.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -5,6 +5,7 @@
55
* [CHANGE] `namespace` template variable in dashboards now only selects namespaces for selected clusters. #311
66
* [CHANGE] Alertmanager: mounted overrides configmap to alertmanager too. #315
77
* [CHANGE] Memcached: upgraded memcached from `1.5.17` to `1.6.9`. #316
8+
* [CHANGE] `CortexIngesterRestarts` alert severity changed from `critical` to `warning`. #321
89
* [CHANGE] Store-gateway: increased memory request and limit respectively from 6GB / 6GB to 12GB / 18GB. #322
910
* [CHANGE] Store-gateway: increased `-blocks-storage.bucket-store.max-chunk-pool-bytes` from 2GB (default) to 12GB. #322
1011
* [ENHANCEMENT] cortex-mixin: Make `cluster_namespace_deployment:kube_pod_container_resource_requests_{cpu_cores,memory_bytes}:sum` backwards compatible with `kube-state-metrics` v2.0.0. #317

cortex-mixin/alerts/alerts.libsonnet

Lines changed: 5 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -198,10 +198,13 @@
198198
{
199199
alert: 'CortexIngesterRestarts',
200200
expr: |||
201-
changes(process_start_time_seconds{job=~".+(cortex|ingester.*)"}[30m]) > 1
201+
changes(process_start_time_seconds{job=~".+(cortex|ingester.*)"}[30m]) >= 2
202202
|||,
203203
labels: {
204-
severity: 'critical',
204+
// This alert is on a cause not symptom. A couple of ingesters restarts may be suspicious but
205+
// not necessarily an issue (eg. may happen because of the K8S node autoscaler), so we're
206+
// keeping the alert as warning as a signal in case of an outage.
207+
severity: 'warning',
205208
},
206209
annotations: {
207210
message: '{{ $labels.job }}/{{ $labels.instance }} has restarted {{ printf "%.2f" $value }} times in the last 30 mins.',

0 commit comments

Comments
 (0)