Fix the “out of order samples” and “duplicate sample for timestamp” issues #10

alolita · 2021-02-05T01:44:55Z

Fix the “out of order samples” and “duplicate sample for timestamp” issues in the Prometheus RW exporter.

OTel Components:
https://github.com/open-telemetry/opentelemetry-collector/tree/main/exporter/prometheusremotewriteexporter
https://github.com/open-telemetry/opentelemetry-collector-contrib/tree/main/exporter/awsprometheusremotewriteexporter

Also see open-telemetry/opentelemetry-collector#2315

tomwilkie · 2021-02-10T12:50:06Z

I think this is due to the exporter sending samples in parallel without any sharding. The Prometheus remote write code goes to quite some length to ensure samples for a specific series are delivered in order, but between series there is parallelism - see the queuemanager code: https://github.com/prometheus/prometheus/blob/6bc67805332dad9345f9d11069f321203bf89f8d/storage/remote/queue_manager.go#L883

The easiest way to do this would be to reuse all this code - this guarantees it'll also work as we improve it upstream. In particular, we're in the process of redesigning and rewriting it so that we can guarantee metadata, exemplars and histograms are all delivered atomically.

rakyll · 2021-04-13T17:21:19Z

We are producing samples in non-chronological order due to the bug explained at open-telemetry/opentelemetry-collector#2315 (comment).

@rakyll

…t of order errors Ensures that before a prompb.WriteRequest is created, that the TimeSeries contained have their Sample values sorted chronologically by Prometheus to avoid out of order errors reported by Prometheus barfing. Thanks to @rakyll for a reproducer and for diagnosing the problem, which helped distill the issue from a very complex setup that required super expensive Kubernetes clusters with many replicas, but essentially the problem became more apparently when the number of TimeSeries grew. It is important to note that the presence of such a bug signifies that with a large number of replicas are being scraped from, this stalls scraping and takes a long time which means that targets scraped in a round-robin fashion experience staleness when many. This might be even more reasons for setups to adopt a push model as opposed to scrape endpoints. Fixes open-telemetry#2315 Fixes open-telemetry/prometheus-interoperability-spec#10

@rakyll

…t of order errors Ensures that before a prompb.WriteRequest is created, that the TimeSeries contained have their Sample values sorted chronologically by Prometheus to avoid out of order errors reported by Prometheus barfing. Thanks to @rakyll for a reproducer and for diagnosing the problem, which helped distill the issue from a very complex setup that required super expensive Kubernetes clusters with many replicas, but essentially the problem became more apparently when the number of TimeSeries grew. It is important to note that the presence of such a bug signifies that with a large number of replicas are being scraped from, this stalls scraping and takes a long time which means that targets scraped in a round-robin fashion experience staleness when many. This might be even more reasons for setups to adopt a push model as opposed to scrape endpoints. Fixes open-telemetry#2315 Fixes open-telemetry/prometheus-interoperability-spec#10

@rakyll

…t of order errors Ensures that before a prompb.WriteRequest is created, that the TimeSeries contained have their Sample values sorted chronologically by Prometheus to avoid out of order errors reported by Prometheus barfing. Thanks to @rakyll for a reproducer and for diagnosing the problem, which helped distill the issue from a very complex setup that required super expensive Kubernetes clusters with many replicas, but essentially the problem became more apparently when the number of TimeSeries grew. It is important to note that the presence of such a bug signifies that with a large number of replicas are being scraped from, this stalls scraping and takes a long time which means that targets scraped in a round-robin fashion experience staleness when many. This might be even more reasons for setups to adopt a push model as opposed to scrape endpoints. Fixes open-telemetry#2315 Fixes open-telemetry/prometheus-interoperability-spec#10

@rakyll

…t of order errors Ensures that before a prompb.WriteRequest is created, that the TimeSeries contained have their Sample values sorted chronologically by Prometheus to avoid out of order errors reported by Prometheus barfing. Thanks to @rakyll for a reproducer and for diagnosing the problem, which helped distill the issue from a very complex setup that required super expensive Kubernetes clusters with many replicas, but essentially the problem became more apparently when the number of TimeSeries grew. It is important to note that the presence of such a bug signifies that with a large number of replicas are being scraped from, this stalls scraping and takes a long time which means that targets scraped in a round-robin fashion experience staleness when many. This might be even more reasons for setups to adopt a push model as opposed to scrape endpoints. Fixes open-telemetry#2315 Fixes open-telemetry/prometheus-interoperability-spec#10

@rakyll

…t of order errors (#2941) Ensures that before a prompb.WriteRequest is created, that the TimeSeries contained have their Sample values sorted chronologically by Prometheus to avoid out of order errors reported by Prometheus barfing. Thanks to @rakyll for a reproducer and for diagnosing the problem, which helped distill the issue from a very complex setup that required super expensive Kubernetes clusters with many replicas, but essentially the problem became more apparently when the number of TimeSeries grew. It is important to note that the presence of such a bug signifies that with a large number of replicas are being scraped from, this stalls scraping and takes a long time which means that targets scraped in a round-robin fashion experience staleness when many. This might be even more reasons for setups to adopt a push model as opposed to scrape endpoints. Fixes #2315 Fixes open-telemetry/prometheus-interoperability-spec#10

alolita added phase1 phase1 tasks prom-exporter Prometheus exporter tasks labels Feb 5, 2021

alolita changed the title ~~Fix the “out of order samples” and “duplicate sample for timestamp” issues.~~ Fix the “out of order samples” and “duplicate sample for timestamp” issues Feb 5, 2021

rakyll mentioned this issue Feb 9, 2021

Decide on performance benchmarking criteria #19

Closed

mxiamxia mentioned this issue Mar 12, 2021

Error "Permanent error: server returned HTTP status 400 Bad Request" aws-observability/aws-otel-collector#396

Closed

rakyll added the P0 An issue that needs to be addressed immediately. Breaking change. label Apr 7, 2021

rakyll assigned odeke-em Apr 14, 2021

odeke-em mentioned this issue Apr 14, 2021

exporter/prometheusremotewriter: sort Sample by Timestamp to avoid out of order errors open-telemetry/opentelemetry-collector#2941

Merged

bogdandrutu closed this as completed in open-telemetry/opentelemetry-collector#2941 Apr 15, 2021

mxiamxia mentioned this issue Apr 26, 2021

ECS resource labels are not exported to prometheus using awsprometheusremotewrite exporter aws-observability/aws-otel-collector#472

Closed

alolita added this to the Phase 1 Implementation milestone Jun 18, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix the “out of order samples” and “duplicate sample for timestamp” issues #10

Fix the “out of order samples” and “duplicate sample for timestamp” issues #10

alolita commented Feb 5, 2021 •

edited

Loading

tomwilkie commented Feb 10, 2021

rakyll commented Apr 13, 2021

Fix the “out of order samples” and “duplicate sample for timestamp” issues #10

Fix the “out of order samples” and “duplicate sample for timestamp” issues #10

Comments

alolita commented Feb 5, 2021 • edited Loading

tomwilkie commented Feb 10, 2021

rakyll commented Apr 13, 2021

alolita commented Feb 5, 2021 •

edited

Loading