-
Notifications
You must be signed in to change notification settings - Fork 55
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Improve compactor alerts and playbooks #334
Conversation
|
||
Same as [`CortexCompactorHasNotUploadedBlocks`](#CortexCompactorHasNotUploadedBlocks). | ||
|
||
### CortexCompactorHasNotSuccessfullyRunCompaction |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Moved above, to have "Compactor is failing because of not healthy index found
" as a sub-section of it.
@@ -251,18 +261,6 @@ To rename a block stored on GCS you can use the `gsutil` CLI: | |||
gsutil mv gs://BUCKET/TENANT/BLOCK gs://BUCKET/TENANT/corrupted-BLOCK | |||
``` | |||
|
|||
### CortexCompactorHasNotUploadedBlocksSinceStart |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
After the renaming it's covered by CortexCompactorHasNotUploadedBlocks
.
|
||
_TODO: this playbook has not been written yet._ | ||
|
||
### CortexCompactorRunFailed |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
After the renaming it's covered by CortexCompactorHasNotSuccessfullyRunCompaction
cortex-mixin/docs/playbooks.md
Outdated
- If the alert `CortexIngesterHasNotShippedBlocks` or `CortexIngesterHasNotShippedBlocksSinceStart` have fired as well, then investigate that issue first | ||
- Ensure ingesters are successfully shipping blocks to the storage | ||
- Look for any error in the compactor logs | ||
|
||
### CortexCompactorHasNotSuccessfullyRunCompaction | ||
|
||
This alert fires if the compactor is not able to successfully run a full compaction. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Explain what "full" means? Is it all tenants? Is it all cycles for a single tenant?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Right. Is it more clear now?
Signed-off-by: Marco Pracucci <marco@pracucci.com>
Signed-off-by: Marco Pracucci <marco@pracucci.com>
2538f75
to
11f5495
Compare
Signed-off-by: Marco Pracucci <marco@pracucci.com> Co-authored-by: Peter Štibraný <peter.stibrany@grafana.com>
…for-compactor-alerts Improve compactor alerts and playbooks
What this PR does:
In this PR I'm proposing to cleanup a bit compactor alerts:
CortexCompactorHasNotUploadedBlocksSinceStart
intoCortexCompactorHasNotUploadedBlocks
. Rationale: no need to distinguish if it's "since start" or not, we just want to alert if a compactor is not uploading blocks.CortexCompactorRunFailed
intoCortexCompactorHasNotSuccessfullyRunCompaction
. Rationale:CortexCompactorHasNotSuccessfullyRunCompaction
is alerting on the case the compactor run is consistently failing, same asCortexCompactorRunFailed
, so I think merging makes sense and simplifies.Which issue(s) this PR fixes:
N/A
Checklist
CHANGELOG.md
updated - the order of entries should be[CHANGE]
,[FEATURE]
,[ENHANCEMENT]
,[BUGFIX]