gf2121 commented on PR #14176:
URL: https://github.com/apache/lucene/pull/14176#issuecomment-2636202013
**Some new progress**
> Luceneutil now can load 3 implementors of IntersectVisitor: RangeQuery
Visitor, RangeQuery InverseVisitor and DynamicPruning Visitor. Here is the
result on wikimediumall and taskCountPerCat=5
TaskQPS baseline StdDevQPS
my_modified_version StdDev Pct diff p-value
TermDayOfYearSort 259.87 (3.9%) 269.26
(4.2%) 3.6% ( -4% - 12%) 0.005
CountFilteredIntNRQ 61.70 (7.1%) 85.00
(2.0%) 37.8% ( 26% - 50%) 0.000
TermDTSort 149.65 (6.2%) 232.85
(9.6%) 55.6% ( 37% - 76%) 0.000
FilteredIntNRQ 82.76 (10.0%) 135.48
(3.7%) 63.7% ( 45% - 85%) 0.000
IntNRQ 84.62 (10.5%) 139.05
(2.6%) 64.3% ( 46% - 86%) 0.000
The previous result was got by `taskRepeatCount=20` . I find that when i
increase taskRepeatCount to 50, the speedup disappeared:
```
TaskQPS baseline StdDevQPS
my_modified_version StdDev Pct diff p-value
TermDayOfYearSort 196.21 (8.7%) 194.85
(11.2%) -0.7% ( -18% - 21%) 0.871
CountFilteredIntNRQ 84.92 (13.1%) 84.84
(12.1%) -0.1% ( -22% - 28%) 0.987
IntNRQ 137.14 (20.2%) 137.30
(18.4%) 0.1% ( -31% - 48%) 0.989
FilteredIntNRQ 134.41 (20.0%) 135.05
(18.1%) 0.5% ( -31% - 48%) 0.954
TermDTSort 196.18 (9.0%) 201.19
(9.0%) 2.6% ( -14% - 22%) 0.506
```
When i introduce a new task running `PointInSetQuery` , the speedup appears
stably when `taskRepeatCount=50` or `taskRepeatCount=100`:
```
TaskQPS baseline StdDevQPS
my_modified_version StdDev Pct diff p-value
TermDayOfYearSort 200.79 (8.3%) 200.43
(8.5%) -0.2% ( -15% - 18%) 0.947
IntSet 1358.83 (3.5%) 1364.40
(3.6%) 0.4% ( -6% - 7%) 0.714
TermDTSort 200.55 (9.4%) 203.26
(8.2%) 1.3% ( -14% - 20%) 0.630
CountFilteredIntNRQ 61.05 (8.2%) 87.15
(7.9%) 42.7% ( 24% - 64%) 0.000
FilteredIntNRQ 82.13 (10.2%) 139.49
(9.7%) 69.9% ( 45% - 99%) 0.000
IntNRQ 83.22 (10.2%) 141.61
(9.5%) 70.2% ( 45% - 100%) 0.000
```
The PR to introduce `PointInSetQuery` Task:
https://github.com/mikemccand/luceneutil/pull/335. Maybe we should look into
merge it before this PR.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]