[ 
https://issues.apache.org/jira/browse/SOLR-10524?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Cao Manh Dat updated SOLR-10524:
--------------------------------
    Attachment: SOLR-10524.patch

Updated patch for this ticket after a discussion with [~noble.paul] and 
[~shalinmangar]. Here are result of OverseerTest.testPerformance()

{panel:title=Without the patch}
Overseer loop finished processing: 
         avgRequestsPerSecond: 0.00809284358238982
         5minRateRequestsPerSecond: 0.0
         15minRateRequestsPerSecond: 0.0
         avgTimePerRequest: 123564881129000000
         medianRequestTime: 123564881129000000
         75thPcRequestTime: 123564881129000000
         95thPcRequestTime: 123564881129000000
         99thPcRequestTime: 123564881129000000
         999thPcRequestTime: 123564881129000000
op: am_i_leader, success: 3, failure: 0
         avgRequestsPerSecond: 0.024318192042511424
         5minRateRequestsPerSecond: 0.2726342664775392
         15minRateRequestsPerSecond: 0.35201956953766844
         avgTimePerRequest: 353111000000
         medianRequestTime: 116973000000
         75thPcRequestTime: 116973000000
         95thPcRequestTime: 1733875000000
         99thPcRequestTime: 1733875000000
         999thPcRequestTime: 1733875000000
op: update_state, success: 20011, failure: 0
         avgRequestsPerSecond: 162.28792277377633
         5minRateRequestsPerSecond: 106.44733871784089
         15minRateRequestsPerSecond: 89.86620980167666
         avgTimePerRequest: 213680000000
         medianRequestTime: 205539000000
         75thPcRequestTime: 221076000000
         95thPcRequestTime: 253206000000
         99thPcRequestTime: 282888000000
         999thPcRequestTime: 548583000000
op: state, success: 20001, failure: 0
         avgRequestsPerSecond: 162.44457624784178
         5minRateRequestsPerSecond: 107.66013079551965
         15minRateRequestsPerSecond: 91.18766381210062
         avgTimePerRequest: 13250000000
         medianRequestTime: 11459000000
         75thPcRequestTime: 16188000000
         95thPcRequestTime: 21414000000
         99thPcRequestTime: 39280000000
         999thPcRequestTime: 67098000000
{panel}

{panel:title=With the patch}
Overseer loop finished processing: 
         avgRequestsPerSecond: 0.00802836931576006
         5minRateRequestsPerSecond: 0.0
         15minRateRequestsPerSecond: 0.0
         avgTimePerRequest: 124556932520000000
         medianRequestTime: 124556932520000000
         75thPcRequestTime: 124556932520000000
         95thPcRequestTime: 124556932520000000
         99thPcRequestTime: 124556932520000000
         999thPcRequestTime: 124556932520000000
op: am_i_leader, success: 3, failure: 0
         avgRequestsPerSecond: 0.024113954682119472
         5minRateRequestsPerSecond: 0.2726342664775392
         15minRateRequestsPerSecond: 0.35201956953766844
         avgTimePerRequest: 306734000000
         medianRequestTime: 116296000000
         75thPcRequestTime: 116296000000
         95thPcRequestTime: 1417483000000
         99thPcRequestTime: 1417483000000
         999thPcRequestTime: 1417483000000
op: update_state, success: 52, failure: 0
         avgRequestsPerSecond: 0.4181288003958347
         5minRateRequestsPerSecond: 0.4
         15minRateRequestsPerSecond: 0.4
         avgTimePerRequest: 2117982000000
         medianRequestTime: 2054633000000
         75thPcRequestTime: 2212862000000
         95thPcRequestTime: 2648609000000
         99thPcRequestTime: 4582074000000
         999thPcRequestTime: 6145919000000
op: state, success: 20001, failure: 0
         avgRequestsPerSecond: 161.02141495173862
         5minRateRequestsPerSecond: 107.06882627730678
         15minRateRequestsPerSecond: 91.09679521134835
         avgTimePerRequest: 17483000000
         medianRequestTime: 16009000000
         75thPcRequestTime: 22093000000
         95thPcRequestTime: 32283000000
         99thPcRequestTime: 46404000000
         999thPcRequestTime: 117668000000
{panel}

As we can see, the number of update_state is much reduced from 20011 to 52.

> Explore in-memory partitioning for processing Overseer queue messages
> ---------------------------------------------------------------------
>
>                 Key: SOLR-10524
>                 URL: https://issues.apache.org/jira/browse/SOLR-10524
>             Project: Solr
>          Issue Type: Improvement
>      Security Level: Public(Default Security Level. Issues are Public) 
>            Reporter: Erick Erickson
>         Attachments: SOLR-10524.patch, SOLR-10524.patch, SOLR-10524.patch, 
> SOLR-10524.patch
>
>
> There are several JIRAs (I'll link in a second) about trying to be more 
> efficient about processing overseer messages as the overseer can become a 
> bottleneck, especially with very large numbers of replicas in a cluster. One 
> of the approaches mentioned near the end of SOLR-5872 (15-Mar) was to "read 
> large no:of items say 10000. put them into in memory buckets and feed them 
> into overseer....".
> This JIRA is to break out that part of the discussion as it might be an easy 
> win whereas "eliminating the Overseer queue" would be quite an undertaking.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to