[ 
https://issues.apache.org/jira/browse/SOLR-17058?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

wei wang updated SOLR-17058:
----------------------------
    Description: 
When distributed IDF is enabled in solr cloud by adding one of the cache 
implementations in solrconfig.xml 
[https://solr.apache.org/guide/solr/latest/deployment-guide/solrcloud-distributed-requests.html#distributedidf],
  each solr query will incur a distributed shard request to get term statistics

"debug": {

        "track": {

            "rid": "-54",

            "PARSE_QUERY": {

                "http://192.168.0.34:8987/solr/shard2_replica_n1/":

               { "QTime": “2”,                                                  
                    

                 "ElapsedTime": "13",                                           
                         

                 "RequestPurpose": "GET_TERM_STATS",                     

                  …           

 

     For queries that does not use distributed IDF information for scoring, the 
stats request is not necessary.  For example when retrieving docs by terms 
filter:         

 

http://localhost:8987/solr/collection1/select?q=*%3A*&wt=json&fq=\\{!terms 
f=id}id1,id2

  Hence I propose to add a disableDistribStats request param so that the 
distributed stats request can be disabled at query time. 
 # disableDistribStats defaults to false. When the param is not present, there 
is no change to current distributed IDF behavior. 
 # When explicitly set disableDistribStats=true, distributed stats call is 
disabled for the current query.  

  was:
When distributed IDF is enabled in solr cloud by adding one of the cache 
implementations in solrconfig.xml 
[https://solr.apache.org/guide/solr/latest/deployment-guide/solrcloud-distributed-requests.html#distributedidf],
  each solr query will incur a distributed shard request to get term statistics

"debug": {

        "track": {

            "rid": "-54",

            "PARSE_QUERY": {

                "http://192.168.0.34:8987/solr/shard2_replica_n1/":

               { "QTime": “2”,                                                  
  

                 "ElapsedTime": "13",                                           
         

                 "RequestPurpose": "GET_TERM_STATS",     

                …       

 

 

For queries that does not use distributed IDF information for scoring, the 
stats request is not necessary.  For example when retrieving docs by terms 
filter:         

http://localhost:8987/solr/collection1/select?q=*%3A*&wt=json&fq=\{!terms 
f=id}id1,id2

  Hence I propose to add a disableDistribStats request param so that the 
distributed stats request can be disabled at query time. 
 # disableDistribStats defaults to false. When the param is not present, there 
is no change to current distributed IDF behavior. 
 # When explicitly set disableDistribStats=true, distributed stats call is 
disabled for the current query.  


> Request param to disable distributed stats request at query time
> ----------------------------------------------------------------
>
>                 Key: SOLR-17058
>                 URL: https://issues.apache.org/jira/browse/SOLR-17058
>             Project: Solr
>          Issue Type: New Feature
>      Security Level: Public(Default Security Level. Issues are Public) 
>          Components: query
>            Reporter: wei wang
>            Priority: Minor
>
> When distributed IDF is enabled in solr cloud by adding one of the cache 
> implementations in solrconfig.xml 
> [https://solr.apache.org/guide/solr/latest/deployment-guide/solrcloud-distributed-requests.html#distributedidf],
>   each solr query will incur a distributed shard request to get term 
> statistics
> "debug": {
>         "track": {
>             "rid": "-54",
>             "PARSE_QUERY": {
>                 "http://192.168.0.34:8987/solr/shard2_replica_n1/":
>                { "QTime": “2”,                                                
>                       
>                  "ElapsedTime": "13",                                         
>                            
>                  "RequestPurpose": "GET_TERM_STATS",                     
>                   …           
>  
>      For queries that does not use distributed IDF information for scoring, 
> the stats request is not necessary.  For example when retrieving docs by 
> terms filter:         
>  
> http://localhost:8987/solr/collection1/select?q=*%3A*&wt=json&fq=\\{!terms 
> f=id}id1,id2
>   Hence I propose to add a disableDistribStats request param so that the 
> distributed stats request can be disabled at query time. 
>  # disableDistribStats defaults to false. When the param is not present, 
> there is no change to current distributed IDF behavior. 
>  # When explicitly set disableDistribStats=true, distributed stats call is 
> disabled for the current query.  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to