Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Import cortex mixin from upstream #373

Merged
merged 535 commits into from
Oct 19, 2021
Merged

Import cortex mixin from upstream #373

merged 535 commits into from
Oct 19, 2021

Conversation

simonswine
Copy link
Contributor

@simonswine simonswine commented Oct 18, 2021

What this PR does:

Imports cortex mixin from upstream including history and placing it under jsonnet/mimir-mixin

Which issue(s) this PR fixes:

This allows to diverge with alerts and runbooks from the Cortex project.

Continued from #366

Checklist

  • Tests CI updated
  • Documentation added
  • [ ] CHANGELOG.md updated - the order of entries should be [CHANGE], [FEATURE], [ENHANCEMENT], [BUGFIX]

pracucci and others added 30 commits January 27, 2021 13:05
Signed-off-by: Marco Pracucci <marco@pracucci.com>
…ster-memory-alert-threshold

Increased CortexAllocatingTooMuchMemory alert threshold
Signed-off-by: Goutham Veeramachaneni <gouthamve@gmail.com>
…-memory-alert

Add alert for etcd memory limits close
Signed-off-by: Mauro Stettler <mauro.stettler@gmail.com>
Signed-off-by: Marco Pracucci <marco@pracucci.com>
Signed-off-by: Marco Pracucci <marco@pracucci.com>
Signed-off-by: Marco Pracucci <marco@pracucci.com>
…tooltip-decrescent-sorting

Sort legend descending in the CPU/memory panels.
…h-alert

Fixed CortexQuerierHighRefetchRate alert
Signed-off-by: Marco Pracucci <marco@pracucci.com>
Signed-off-by: Marco Pracucci <marco@pracucci.com>
…ueries-dashboard

Add slow queries dashboard
- Update dashboard so it only shows under provisioned services and why
- Add sizing rules based on limits.
- Add some docs to the dashboard.

Signed-off-by: Tom Wilkie <tom@grafana.com>
Add recording rules to calculate Cortex scaling
Signed-off-by: Marco Pracucci <marco@pracucci.com>
Signed-off-by: Marco Pracucci <marco@pracucci.com>
…isk-panels

Fixed "Disk Writes" and "Disk Reads" panels
Signed-off-by: Marco Pracucci <marco@pracucci.com>
Signed-off-by: Marco Pracucci <marco@pracucci.com>
…ecording-rules

Pre-compute aggregations to optimize scaling recording rules
Signed-off-by: Marco Pracucci <marco@pracucci.com>
Signed-off-by: Marco Pracucci <marco@pracucci.com>
…on-to-create-compactor-statefulset

Add function to customize compactor statefulset
Signed-off-by: Marco Pracucci <marco@pracucci.com>
…tor-alert

Fixed CortexCompactorRunFailed threshold
Signed-off-by: Marco Pracucci <marco@pracucci.com>
…t-progress-dashboard

Added Cortex Rollout progress dashboard
Signed-off-by: Marco Pracucci <marco@pracucci.com>
pracucci and others added 7 commits October 14, 2021 09:45
…rtexIngesterReachingSeriesLimit

Fine tune CortexIngesterReachingSeriesLimit alert
…tuck-rollout

Add CortexRolloutStuck alert
Signed-off-by: Marco Pracucci <marco@pracucci.com>
Signed-off-by: Marco Pracucci <marco@pracucci.com>
Signed-off-by: Marco Pracucci <marco@pracucci.com>
…onsul-failures

Added CortexFailingToTalkToConsul alert
@simonswine simonswine changed the title 20211018 import cortex mixin Import cortex mixin from upstream Oct 18, 2021
@simonswine simonswine mentioned this pull request Oct 18, 2021
3 tasks
@simonswine
Copy link
Contributor Author

@56quarters: Another thing to note is that it would be nice to standardize where mixins are in a repository between Mimir, Loki, and Tempo.

This is a good point, which of the two do we prefer. (I have a slight preference towards operations, how do others feel?)

@simonswine
Copy link
Contributor Author

@56quarters: I notice this doesn't import the jsonnet for running Mimir with Tanka, should it?

I think eventually yes, but for now I would like to make a start with adding some alerts and runbook content for query-sharding only. I think the operational jsonnet manifests can be added separtely.

@56quarters
Copy link
Contributor

@56quarters: Another thing to note is that it would be nice to standardize where mixins are in a repository between Mimir, Loki, and Tempo.

This is a good point, which of the two do we prefer. (I have a slight preference towards operations, how do others feel?)

I prefer "operations" as well

Copy link
Collaborator

@pracucci pracucci left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good job!

I know you didn't do it, but let me say saying it: I wouldn't do any "Cortex -> Mimir" renaming in alerts / dashboards at this stage. We can do it later on, in separate PRs (mixing it with this import PR would make review more difficult).

@simonswine simonswine force-pushed the 20211018_import-cortex-mixin branch from cb26f73 to 8b4b942 Compare October 19, 2021 08:20
@simonswine simonswine force-pushed the 20211018_import-cortex-mixin branch from a598f4e to e7b4eab Compare October 19, 2021 11:01
@simonswine simonswine requested a review from pracucci October 19, 2021 12:34
Copy link
Collaborator

@pracucci pracucci left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good job! I didn't check every single command in Makefile but, as far as CI passes, I'm good :) I think we can eventually reiterate on it if we find any issue in the future.

@pracucci pracucci merged commit 025285e into main Oct 19, 2021
@pracucci pracucci deleted the 20211018_import-cortex-mixin branch October 19, 2021 16:06
pracucci added a commit that referenced this pull request Oct 19, 2021
@pracucci pracucci restored the 20211018_import-cortex-mixin branch October 19, 2021 16:08
@simonswine simonswine mentioned this pull request Oct 19, 2021
2 tasks
@pracucci pracucci deleted the 20211018_import-cortex-mixin branch October 19, 2021 16:50
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.