Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Overview
This pull request adds parallel execution of aggregate
SELECT
statements. Iterators within each shard are split using theNewParallelMergeIterator()
function which processes iterator batches based on the number of logical cores on the system./cc @jwilder @toddboom
Results
On an AWS
c4.8xlarge
machine, performance of a simpleCOUNT()
query over 1B points and 100K series drops from1m57s
to16.8s
. The execution of the count itself is significantly improved, however, the bottleneck on the query is now in the single-threaded planning phase for the 100K series. Parallelizing the planning phase should improve performance significantly for large numbers of series.TODO