-
Notifications
You must be signed in to change notification settings - Fork 2.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Thanos Downsampling Checksum Mismatch #5944
Comments
This sounds a good idea! |
@yeya24 Thanks! Do you have any recommendations as to how we can avoid this issue currently? The only thing I can really think of is to edit the |
@RohitKochhar Editing metadata directly sounds dangerous. I guess it should not be that hard to add the new marker you mentioned. Checking the metadata before downsampling is good enough. |
@yeya24 Thanks so much for your feedback. I will be opening a PR in the coming days to add this flag. |
This changed was implemented and is a part of |
Thanos, Prometheus and Golang version used: v0.25.1 / NA
Object Storage Provider: AWS S3
What happened:
When Thanos Compact encounters a block with a checksum mismatch during compaction, we have an automated program that marks the block
no-compact
and restarts Compact, allowing the backlog of blocks to be compacted without issue.However, we are now encountering an error where Compact encounters blocks with checksum mismatches during downsampling, causing an error to be thrown with the following message:
What you expected to happen:
Ideally, there would be a mark that could be added using
thanos tools mark
that specifiesno-downsample
, or if the block is markedno-compaction
it would be inferred that it should beno-downsample
to prevent compact from failing.How to reproduce it (as minimally and precisely as possible):
Reproducing steps can be a bit difficult, but it seems like a block with a checksum mismatch needs to be marked
no-compact
and then Thanos Compact must attempt to downsample it without compaction.The text was updated successfully, but these errors were encountered: