Check ahead if we can get the count #13899

LuXugang · 2024-10-12T06:44:54Z

Check whether the IndexSortSortedNumericDocValuesRangeQuery's count can be obtained before traversing the BKD tree or performing binary search using DocValues.

see #13890

jpountz

The logic makes sense to me but it's a bit hard to read, could we avoid touching getDocIdSetIteratorOrNull and only have new logic in the Weight#count impl?

LuXugang · 2024-10-21T09:41:43Z

The logic makes sense to me but it's a bit hard to read, could we avoid touching getDocIdSetIteratorOrNull and only have new logic in the Weight#count impl?

Thank you for your feedback! @jpountz I really appreciate your suggestion. I’ve made the changes as you recommended

jpountz

Thanks, the code makes sense and I see that we already have test coverage for counting for our various optimizations.

jpountz · 2024-10-24T15:34:41Z

lucene/core/src/java/org/apache/lucene/search/IndexSortSortedNumericDocValuesRangeQuery.java

-          IteratorAndCount itAndCount = getDocIdSetIteratorOrNull(context);
+          if (lowerValue > upperValue) {
+            return 0;
+          }


This could be moved before the check of whether the segment has deletes?

if lowerValue > upperValue and has deletes, PointRangeQuery's count will reture -1.
I’ll temporarily maintain the consistency of the two queries for now. Later, we can consider whether to add this optimization to the PointRangeQuery. @jpountz

update

Currently, we traverse the BKD tree or perform a binary search using DocValues first, and then check whether the count can be obtained in the count() method of IndexSortSortedNumericDocValuesRangeQuery. we should consider providing a mechanism to perform this check beforehand, avoid unnecessary processing when dealing with a sparseRange

LuXugang requested a review from jpountz October 12, 2024 06:45

LuXugang linked an issue Oct 12, 2024 that may be closed by this pull request

Check ahead of time if the count can be obtained #13890

Closed

LuXugang force-pushed the earlyterminationCout branch from efd0115 to 980f7ea Compare October 13, 2024 03:24

jpountz reviewed Oct 15, 2024

View reviewed changes

LuXugang force-pushed the earlyterminationCout branch from 980f7ea to 105e3c2 Compare October 21, 2024 09:38

LuXugang requested a review from jpountz October 24, 2024 08:04

jpountz approved these changes Oct 24, 2024

View reviewed changes

LuXugang added 4 commits October 25, 2024 13:54

change request

2b999d8

update

5776b52

check bound ahead

880097c

add changes

fa7274a

LuXugang force-pushed the earlyterminationCout branch from c0b5a35 to fa7274a Compare October 25, 2024 05:59

update

7cf54b2

update

LuXugang merged commit 0bbef32 into apache:main Oct 25, 2024
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Check ahead if we can get the count #13899

Check ahead if we can get the count #13899

LuXugang commented Oct 12, 2024

jpountz left a comment

LuXugang commented Oct 21, 2024

jpountz left a comment

jpountz Oct 24, 2024

LuXugang Oct 25, 2024

Check ahead if we can get the count #13899

Check ahead if we can get the count #13899

Conversation

LuXugang commented Oct 12, 2024

jpountz left a comment

Choose a reason for hiding this comment

LuXugang commented Oct 21, 2024

jpountz left a comment

Choose a reason for hiding this comment

jpountz Oct 24, 2024

Choose a reason for hiding this comment

LuXugang Oct 25, 2024

Choose a reason for hiding this comment