[jira] [Commented] (LUCENE-6756) Give MatchAllDocsQuery a dedicated BulkScorer

Adrien Grand (JIRA) Fri, 21 Aug 2015 08:29:32 -0700

    [ 
https://issues.apache.org/jira/browse/LUCENE-6756?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14706850#comment-14706850
 ]


Adrien Grand commented on LUCENE-6756:
--------------------------------------

I added a MatchAll task to wikimedium1m and hotspot looks happy:

{noformat}
                    TaskQPS baseline      StdDev   QPS patch      StdDev        
        Pct diff
                  Fuzzy2      103.55     (32.6%)       95.61     (35.5%)   
-7.7% ( -57% -   89%)
                  Fuzzy1      139.81     (13.1%)      132.03     (17.0%)   
-5.6% ( -31% -   28%)
                 Prefix3      374.46      (8.7%)      368.62      (7.4%)   
-1.6% ( -16% -   15%)
               OrHighLow      322.32      (7.0%)      320.66      (5.9%)   
-0.5% ( -12% -   13%)
               OrHighMed      257.31      (8.7%)      256.59      (4.7%)   
-0.3% ( -12% -   14%)
              OrHighHigh      202.24      (8.1%)      201.80      (6.2%)   
-0.2% ( -13% -   15%)
              HighPhrase      155.66      (4.3%)      155.48      (5.2%)   
-0.1% (  -9% -    9%)
             LowSpanNear      200.83      (5.5%)      200.68      (4.5%)   
-0.1% (  -9% -   10%)
              AndHighLow     1806.85      (5.2%)     1806.05      (8.9%)   
-0.0% ( -13% -   14%)
                HighTerm      573.21      (7.8%)      573.11      (6.6%)   
-0.0% ( -13% -   15%)
         LowSloppyPhrase      132.99      (4.6%)      132.98      (5.7%)   
-0.0% (  -9% -   10%)
             AndHighHigh      401.82      (4.2%)      402.76      (4.3%)    
0.2% (  -7% -    9%)
        HighSloppyPhrase      271.61      (5.7%)      273.46      (7.3%)    
0.7% ( -11% -   14%)
            HighSpanNear      107.11      (6.2%)      107.85      (5.2%)    
0.7% ( -10% -   12%)
               MedPhrase      186.57      (4.5%)      187.88      (4.9%)    
0.7% (  -8% -   10%)
               LowPhrase      402.46      (4.4%)      406.53      (3.5%)    
1.0% (  -6% -    9%)
         MedSloppyPhrase      233.49      (5.0%)      236.66      (3.4%)    
1.4% (  -6% -   10%)
                 MedTerm     1278.37      (8.9%)     1302.62      (6.4%)    
1.9% ( -12% -   18%)
                Wildcard      339.31      (8.8%)      346.33      (6.5%)    
2.1% ( -12% -   19%)
                 Respell      152.28      (9.2%)      155.51      (8.8%)    
2.1% ( -14% -   22%)
              AndHighMed      396.54      (8.1%)      407.13      (3.7%)    
2.7% (  -8% -   15%)
             MedSpanNear      565.97      (6.9%)      581.61      (5.3%)    
2.8% (  -8% -   16%)
                 LowTerm     3143.46     (14.2%)     3244.12      (8.8%)    
3.2% ( -17% -   30%)
                  IntNRQ       90.11     (11.4%)       93.16      (8.0%)    
3.4% ( -14% -   25%)
                MatchAll      117.18      (3.7%)      211.95     (30.9%)   
80.9% (  44% -  119%)
{noformat}

The fuzzy queries are a bit off but I see a lot of variance with these queries 
anyway, even without the change.

> Give MatchAllDocsQuery a dedicated BulkScorer
> ---------------------------------------------
>
>                 Key: LUCENE-6756
>                 URL: https://issues.apache.org/jira/browse/LUCENE-6756
>             Project: Lucene - Core
>          Issue Type: Improvement
>            Reporter: Adrien Grand
>            Assignee: Adrien Grand
>            Priority: Minor
>         Attachments: LUCENE-6756.patch, MABench.java
>
>
> MatchAllDocsQuery currently uses the default BulkScorer, which creates a 
> Scorer and iterates over matching doc IDs up to NO_MORE_DOCS. I tried to 
> build a dedicated BulkScorer, which seemed to help remove abstractions as it 
> helped improve throughput by a ~2x factor with simple collectors.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (LUCENE-6756) Give MatchAllDocsQuery a dedicated BulkScorer

Reply via email to