Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

tweak OTEL alerts to use SR #1721

Merged
merged 3 commits into from
Sep 20, 2024
Merged

tweak OTEL alerts to use SR #1721

merged 3 commits into from
Sep 20, 2024

Conversation

thampiotr
Copy link
Contributor

@thampiotr thampiotr commented Sep 20, 2024

PR Description

Changes OTEL Collector alerts to use success rate of 95% over 10min period to alert. Otherwise, we may have 99.9999% success rate and still alert. Some level of failures is acceptable for most users, especially for tracing which is frequently sampled.

Which issue(s) this PR fixes

Notes to the Reviewer

Tested in https://github.com/grafana/deployment_tools/pull/174974

PR Checklist

  • CHANGELOG.md updated
  • Documentation added
  • Tests updated
  • Config converters updated

@thampiotr thampiotr marked this pull request as ready for review September 20, 2024 12:39
@thampiotr thampiotr requested a review from a team as a code owner September 20, 2024 12:39
Copy link
Contributor

@wildum wildum left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@thampiotr thampiotr merged commit aecdeba into main Sep 20, 2024
18 checks passed
@thampiotr thampiotr deleted the thampiotr/update-otel-alerts branch September 20, 2024 13:52
@thampiotr thampiotr mentioned this pull request Oct 3, 2024
4 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants