Skip to content
This repository was archived by the owner on Apr 28, 2025. It is now read-only.

Conversation

@pracucci
Copy link
Collaborator

What this PR does:
In the last few days we experienced TSDB WAL corruption twice and we found the alerts to not be adequate. In this PR I'm proposing to add more critical alerts in TSDB possible issue, including the usage of new metrics introduced in cortexproject/cortex#3373.

Which issue(s) this PR fixes:
N/A

Checklist

  • CHANGELOG.md updated - the order of entries should be [CHANGE], [FEATURE], [ENHANCEMENT], [BUGFIX]

@pracucci pracucci requested a review from a team as a code owner October 28, 2020 13:42
@codesome codesome self-requested a review October 28, 2020 13:47
Copy link
Contributor

@codesome codesome left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

(Realising after 5hrs that I had not clicked on submit :P)

@pracucci pracucci force-pushed the add-tsdb-critical-alerts branch from 5eb23ea to 1959732 Compare November 10, 2020 13:53
Signed-off-by: Marco Pracucci <[email protected]>
@codesome codesome merged commit e2333a6 into master Nov 10, 2020
@codesome codesome deleted the add-tsdb-critical-alerts branch November 10, 2020 14:23
simonswine pushed a commit to grafana/mimir that referenced this pull request Oct 18, 2021
…onnet#208)

* Added more critical alerts on Cortex ingester TSDB

Signed-off-by: Marco Pracucci <[email protected]>

* Added CHANGELOG entry

Signed-off-by: Marco Pracucci <[email protected]>

* Addressed review comments

Signed-off-by: Marco Pracucci <[email protected]>
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants