[ 
https://issues.apache.org/jira/browse/SOLR-17058?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mikhail Khludnev updated SOLR-17058:
------------------------------------
    Description: 
When distributed IDF is enabled in solr cloud by adding one of the cache 
implementations in solrconfig.xml 
[https://solr.apache.org/guide/solr/latest/deployment-guide/solrcloud-distributed-requests.html#distributedidf],
  each solr query will incur a distributed shard request to get term statistics

"debug": {

        "track": {

            "rid": "-54",

            "PARSE_QUERY": {

                "http://192.168.0.34:8987/solr/shard2_replica_n1/":

               { "QTime": “2”,                                                  
                                                      

                 "ElapsedTime": "13",                                           
                                                             

                 "RequestPurpose": "GET_TERM_STATS",     

                 …                             

 

     For queries that does not use distributed IDF information for scoring such 
as terms filter by id, the stats request is not necessary.  Hence I propose to 
add a {{distrib.statsCache}} request param so that the distributed stats 
request can be disabled at query time. 
 # {{distrib.statsCache}} defaults to {{false}}. When the param is not present, 
there is no change to current distributed IDF behavior. 
 # When explicitly set {{disableDistribStats=true}}, distributed stats call is 
disabled for the current query.  

  was:
When distributed IDF is enabled in solr cloud by adding one of the cache 
implementations in solrconfig.xml 
[https://solr.apache.org/guide/solr/latest/deployment-guide/solrcloud-distributed-requests.html#distributedidf],
  each solr query will incur a distributed shard request to get term statistics

"debug": {

        "track": {

            "rid": "-54",

            "PARSE_QUERY": {

                "http://192.168.0.34:8987/solr/shard2_replica_n1/":

               { "QTime": “2”,                                                  
                                                      

                 "ElapsedTime": "13",                                           
                                                             

                 "RequestPurpose": "GET_TERM_STATS",     

                 …                             

 

     For queries that does not use distributed IDF information for scoring such 
as terms filter by id, the stats request is not necessary.  Hence I propose to 
add a disableDistribStats request param so that the distributed stats request 
can be disabled at query time. 
 # disableDistribStats defaults to false. When the param is not present, there 
is no change to current distributed IDF behavior. 
 # When explicitly set disableDistribStats=true, distributed stats call is 
disabled for the current query.  


> Request param to disable distributed stats request at query time
> ----------------------------------------------------------------
>
>                 Key: SOLR-17058
>                 URL: https://issues.apache.org/jira/browse/SOLR-17058
>             Project: Solr
>          Issue Type: New Feature
>          Components: query
>            Reporter: wei wang
>            Assignee: Mikhail Khludnev
>            Priority: Minor
>             Fix For: 9.6.0
>
>          Time Spent: 3h 20m
>  Remaining Estimate: 0h
>
> When distributed IDF is enabled in solr cloud by adding one of the cache 
> implementations in solrconfig.xml 
> [https://solr.apache.org/guide/solr/latest/deployment-guide/solrcloud-distributed-requests.html#distributedidf],
>   each solr query will incur a distributed shard request to get term 
> statistics
> "debug": {
>         "track": {
>             "rid": "-54",
>             "PARSE_QUERY": {
>                 "http://192.168.0.34:8987/solr/shard2_replica_n1/":
>                { "QTime": “2”,                                                
>                                                         
>                  "ElapsedTime": "13",                                         
>                                                                
>                  "RequestPurpose": "GET_TERM_STATS",     
>                  …                             
>  
>      For queries that does not use distributed IDF information for scoring 
> such as terms filter by id, the stats request is not necessary.  Hence I 
> propose to add a {{distrib.statsCache}} request param so that the distributed 
> stats request can be disabled at query time. 
>  # {{distrib.statsCache}} defaults to {{false}}. When the param is not 
> present, there is no change to current distributed IDF behavior. 
>  # When explicitly set {{disableDistribStats=true}}, distributed stats call 
> is disabled for the current query.  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to