Enable sharding for `active_series` requests #6784

flxbk · 2023-11-30T12:49:38Z

What this PR does

To build a response for an /active_series request, queriers need to hold the full series set in memory for deduplication. For selectors that return a large set of series this can consume a lot of memory, as allocations are proportional to the size of the set.

This PR allows sharding these requests in the frontend such that each querier only needs to bring part of the series set into memory for deduplication. The frontend then interleaves the partial responses into a single set that is returned back to the client. It also introduces a response size limit for active series responses in queriers to prevent unbounded allocation and OOMs.

This PR also introduces a dedicated roundtripper for active series requests to bypass the generic cache. Since this endpoint is supposed to return "fresh" data, setups with a large cache TTL config could yield outdated results if the cache is enabled and not manually bypassed for querying.

Which issue(s) this PR fixes or relates to

n/a

Checklist

Tests updated.
Documentation added.
CHANGELOG.md updated - the order of entries should be [CHANGE], [FEATURE], [ENHANCEMENT], [BUGFIX].
about-versioning.md updated with experimental features.

pkg/querier/cardinality_analysis_handler.go

Logiraptor · 2023-12-11T04:14:58Z

In an earlier version of this PR I used a tenant's query_sharding_total_shards to set the shard count. This proved very inefficient, as queriers would be flooded with requests that spend most of their life waiting in the queue.

Interesting, I guess that makes sense in hindsight, since this API is used for a huge range of response sizes. Even in huge tenants, many requests will return <10 series for example, and sharding that 12 times is mostly overhead.

This looks great! Thanks for working on it. My main feedback is about the potential for over-sharding. Say for example I send a request with Sharding-Control: 1000000, I believe the code as written will open 1M parallel requests which could be an issue. I can think of two solutions to consider:

We could bound the number of shards to query-frontend.query-sharding-max-sharded-queries (which defaults to 128). Arguably this limit is meant for promql, so I'm not sure in practice if that default also makes sense for active series queries. 128 * 5MB response limit is 640MB, which sounds like a lot, but I'm not sure how that translates to number of series. 🤔
We could use concurrency.ForEachJob from dskit to process the shard requests with some bounded level of parallelism.

WDYT?

pkg/frontend/querymiddleware/shard_active_series.go

colega · 2024-01-03T11:33:32Z

pkg/frontend/querymiddleware/roundtrip.go

-		strings.HasSuffix(path, cardinalityLabelValuesPathSuffix) ||
-		strings.HasSuffix(path, cardinalityActiveSeriesPathSuffix)
+		strings.HasSuffix(path, cardinalityLabelValuesPathSuffix)
 }


One of the side-effects here is that we no longer apply the newCardinalityQueryCacheRoundTripper middleware to active series endpoint, is that intended?

Yes, that's intended, I've tried to explain the reasoning in the PR description.

This PR also introduces a dedicated roundtripper for active series requests to bypass the generic cache. Since this endpoint is supposed to return "fresh" data, setups with a large cache TTL config could yield outdated results if the cache is enabled and not manually bypassed for querying.

pkg/frontend/querymiddleware/shard_active_series.go

colega

LGTM, great work.

As discussed on Slack, my main concern is that there's nothing actually checking that we're not loading all shards into memory before responding. Can we add a test for that? You can orchestrate the next middlewares in the way that we check we're getting a response before the second shard's next starts writing its json.

… contents

colega · 2024-01-04T11:39:29Z

Thanks for adding the test.

flxbk force-pushed the felix/shard-active-series branch 17 times, most recently from 4e94fba to 0c08d3c Compare December 4, 2023 11:53

flxbk marked this pull request as ready for review December 4, 2023 15:59

flxbk requested review from a team as code owners December 4, 2023 15:59

flxbk force-pushed the felix/shard-active-series branch from 7fffcc3 to cdfac0d Compare December 5, 2023 06:25

flxbk commented Dec 7, 2023

View reviewed changes

pkg/querier/cardinality_analysis_handler.go Outdated Show resolved Hide resolved

colega reviewed Dec 11, 2023

View reviewed changes

pkg/frontend/querymiddleware/shard_active_series.go Outdated Show resolved Hide resolved

colega reviewed Dec 11, 2023

View reviewed changes

pkg/frontend/querymiddleware/shard_active_series.go Outdated Show resolved Hide resolved

colega reviewed Dec 11, 2023

View reviewed changes

pkg/frontend/querymiddleware/shard_active_series.go Show resolved Hide resolved

colega reviewed Dec 11, 2023

View reviewed changes

pkg/frontend/querymiddleware/shard_active_series.go Outdated Show resolved Hide resolved

colega reviewed Dec 11, 2023

View reviewed changes

pkg/frontend/querymiddleware/shard_active_series.go Outdated Show resolved Hide resolved

colega reviewed Dec 11, 2023

View reviewed changes

pkg/frontend/querymiddleware/shard_active_series.go Outdated Show resolved Hide resolved

colega reviewed Dec 11, 2023

View reviewed changes

pkg/frontend/querymiddleware/shard_active_series.go Outdated Show resolved Hide resolved

flxbk added 16 commits January 3, 2024 11:12

bound shard count

64bb918

more explicit error message

b417ba7

remove mutex

e17b0bd

refactor selector parsing

9d35201

don't swallow parser errors, enable sharding in docker-compose

736d62a

instantiate ErrResponseTooLarge

c9bd72c

make max response size a config parameter

41a0ab8

update changelog and about-versioning.md

a7f20cd

add shard series count to span

4bf9fa1

use tenant override to set default shard count

a70eeb2

add test for ErrResponseTooLarge

8619614

add span for writeMergedResponse

740e6c6

use 413 Content Too Large

192f273

use snappy compression

f52046c

do not use WriteVal

847b0fb

use Accept-Encoding to control compression

9935b3b

flxbk force-pushed the felix/shard-active-series branch from 6f1f55d to 9935b3b Compare January 3, 2024 10:17

colega reviewed Jan 3, 2024

View reviewed changes

pkg/frontend/querymiddleware/shard_active_series.go Outdated Show resolved Hide resolved

colega reviewed Jan 3, 2024

View reviewed changes

pkg/frontend/querymiddleware/shard_active_series.go Outdated Show resolved Hide resolved

colega approved these changes Jan 3, 2024

View reviewed changes

flxbk added 3 commits January 3, 2024 13:01

fix formatting

32537a9

ensure request body is fully read before closing

4d8a08d

add test to demonstrate that the middleware streams the response body…

211723d

… contents

flxbk force-pushed the felix/shard-active-series branch from 4ac2a4e to 211723d Compare January 3, 2024 13:09

fix response body formatting

9004fd9

flxbk merged commit 7de79e5 into main Jan 4, 2024
28 checks passed

flxbk deleted the felix/shard-active-series branch January 4, 2024 11:52

narqo mentioned this pull request Feb 5, 2024

querymiddleware: Fix race condition in shardActiveSeriesMiddleware #7290

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable sharding for `active_series` requests #6784

Enable sharding for `active_series` requests #6784

flxbk commented Nov 30, 2023 •

edited

Loading

Logiraptor commented Dec 11, 2023

colega Jan 3, 2024

flxbk Jan 3, 2024

colega left a comment

colega commented Jan 4, 2024

Enable sharding for active_series requests #6784

Enable sharding for active_series requests #6784

Conversation

flxbk commented Nov 30, 2023 • edited Loading

What this PR does

Which issue(s) this PR fixes or relates to

Checklist

Logiraptor commented Dec 11, 2023

colega Jan 3, 2024

Choose a reason for hiding this comment

flxbk Jan 3, 2024

Choose a reason for hiding this comment

colega left a comment

Choose a reason for hiding this comment

colega commented Jan 4, 2024

Enable sharding for `active_series` requests #6784

Enable sharding for `active_series` requests #6784

flxbk commented Nov 30, 2023 •

edited

Loading