Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve compactor alerts and playbooks #334

Merged
merged 3 commits into from
Jun 21, 2021
Merged

Conversation

pracucci
Copy link
Collaborator

What this PR does:
In this PR I'm proposing to cleanup a bit compactor alerts:

  • Renamed CortexCompactorHasNotUploadedBlocksSinceStart into CortexCompactorHasNotUploadedBlocks. Rationale: no need to distinguish if it's "since start" or not, we just want to alert if a compactor is not uploading blocks.
  • Renamed CortexCompactorRunFailed into CortexCompactorHasNotSuccessfullyRunCompaction. Rationale: CortexCompactorHasNotSuccessfullyRunCompaction is alerting on the case the compactor run is consistently failing, same as CortexCompactorRunFailed, so I think merging makes sense and simplifies.

Which issue(s) this PR fixes:
N/A

Checklist

  • CHANGELOG.md updated - the order of entries should be [CHANGE], [FEATURE], [ENHANCEMENT], [BUGFIX]

@pracucci pracucci requested a review from pstibrany June 21, 2021 10:47
@pracucci pracucci requested a review from a team as a code owner June 21, 2021 10:47

Same as [`CortexCompactorHasNotUploadedBlocks`](#CortexCompactorHasNotUploadedBlocks).

### CortexCompactorHasNotSuccessfullyRunCompaction
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Moved above, to have "Compactor is failing because of not healthy index found" as a sub-section of it.

@@ -251,18 +261,6 @@ To rename a block stored on GCS you can use the `gsutil` CLI:
gsutil mv gs://BUCKET/TENANT/BLOCK gs://BUCKET/TENANT/corrupted-BLOCK
```

### CortexCompactorHasNotUploadedBlocksSinceStart
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

After the renaming it's covered by CortexCompactorHasNotUploadedBlocks.


_TODO: this playbook has not been written yet._

### CortexCompactorRunFailed
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

After the renaming it's covered by CortexCompactorHasNotSuccessfullyRunCompaction

CHANGELOG.md Outdated Show resolved Hide resolved
cortex-mixin/alerts/compactor.libsonnet Show resolved Hide resolved
cortex-mixin/docs/playbooks.md Outdated Show resolved Hide resolved
- If the alert `CortexIngesterHasNotShippedBlocks` or `CortexIngesterHasNotShippedBlocksSinceStart` have fired as well, then investigate that issue first
- Ensure ingesters are successfully shipping blocks to the storage
- Look for any error in the compactor logs

### CortexCompactorHasNotSuccessfullyRunCompaction

This alert fires if the compactor is not able to successfully run a full compaction.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Explain what "full" means? Is it all tenants? Is it all cycles for a single tenant?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Right. Is it more clear now?

pracucci added 2 commits June 21, 2021 15:07
Signed-off-by: Marco Pracucci <marco@pracucci.com>
Signed-off-by: Marco Pracucci <marco@pracucci.com>
@pracucci pracucci force-pushed the playbooks-for-compactor-alerts branch from 2538f75 to 11f5495 Compare June 21, 2021 13:10
cortex-mixin/docs/playbooks.md Outdated Show resolved Hide resolved
Signed-off-by: Marco Pracucci <marco@pracucci.com>

Co-authored-by: Peter Štibraný <peter.stibrany@grafana.com>
@pracucci pracucci merged commit 8817fc8 into main Jun 21, 2021
@pracucci pracucci deleted the playbooks-for-compactor-alerts branch June 21, 2021 13:55
simonswine pushed a commit to grafana/mimir that referenced this pull request Oct 18, 2021
…for-compactor-alerts

Improve compactor alerts and playbooks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants