Skip to content
This repository was archived by the owner on Apr 28, 2025. It is now read-only.

Conversation

@pracucci
Copy link
Collaborator

What this PR does:
In this PR I'm proposing to cleanup a bit compactor alerts:

  • Renamed CortexCompactorHasNotUploadedBlocksSinceStart into CortexCompactorHasNotUploadedBlocks. Rationale: no need to distinguish if it's "since start" or not, we just want to alert if a compactor is not uploading blocks.
  • Renamed CortexCompactorRunFailed into CortexCompactorHasNotSuccessfullyRunCompaction. Rationale: CortexCompactorHasNotSuccessfullyRunCompaction is alerting on the case the compactor run is consistently failing, same as CortexCompactorRunFailed, so I think merging makes sense and simplifies.

Which issue(s) this PR fixes:
N/A

Checklist

  • CHANGELOG.md updated - the order of entries should be [CHANGE], [FEATURE], [ENHANCEMENT], [BUGFIX]

@pracucci pracucci requested a review from pstibrany June 21, 2021 10:47
@pracucci pracucci requested a review from a team as a code owner June 21, 2021 10:47

Same as [`CortexCompactorHasNotUploadedBlocks`](#CortexCompactorHasNotUploadedBlocks).

### CortexCompactorHasNotSuccessfullyRunCompaction
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Moved above, to have "Compactor is failing because of not healthy index found" as a sub-section of it.

gsutil mv gs://BUCKET/TENANT/BLOCK gs://BUCKET/TENANT/corrupted-BLOCK
```

### CortexCompactorHasNotUploadedBlocksSinceStart
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

After the renaming it's covered by CortexCompactorHasNotUploadedBlocks.


_TODO: this playbook has not been written yet._

### CortexCompactorRunFailed
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

After the renaming it's covered by CortexCompactorHasNotSuccessfullyRunCompaction


### CortexCompactorHasNotSuccessfullyRunCompaction

This alert fires if the compactor is not able to successfully run a full compaction.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Explain what "full" means? Is it all tenants? Is it all cycles for a single tenant?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Right. Is it more clear now?

pracucci added 2 commits June 21, 2021 15:07
Signed-off-by: Marco Pracucci <[email protected]>
@pracucci pracucci force-pushed the playbooks-for-compactor-alerts branch from 2538f75 to 11f5495 Compare June 21, 2021 13:10
Signed-off-by: Marco Pracucci <[email protected]>

Co-authored-by: Peter Štibraný <[email protected]>
@pracucci pracucci merged commit 8817fc8 into main Jun 21, 2021
@pracucci pracucci deleted the playbooks-for-compactor-alerts branch June 21, 2021 13:55
simonswine pushed a commit to grafana/mimir that referenced this pull request Oct 18, 2021
…for-compactor-alerts

Improve compactor alerts and playbooks
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants