[
https://issues.apache.org/jira/browse/SOLR-17058?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Mikhail Khludnev updated SOLR-17058:
------------------------------------
Description:
When distributed IDF is enabled in solr cloud by adding one of the cache
implementations in solrconfig.xml
[https://solr.apache.org/guide/solr/latest/deployment-guide/solrcloud-distributed-requests.html#distributedidf],
each solr query will incur a distributed shard request to get term statistics
"debug": {
"track": {
"rid": "-54",
"PARSE_QUERY": {
"http://192.168.0.34:8987/solr/shard2_replica_n1/":
{ "QTime": “2”,
"ElapsedTime": "13",
"RequestPurpose": "GET_TERM_STATS",
…
For queries that does not use distributed IDF information for scoring such
as terms filter by id, the stats request is not necessary. Hence I propose to
add a {{distrib.statsCache}} request param so that the distributed stats
request can be disabled at query time.
# {{distrib.statsCache}} defaults to {{false}}. When the param is not present,
there is no change to current distributed IDF behavior.
# When explicitly set {{disableDistribStats=true}}, distributed stats call is
disabled for the current query.
was:
When distributed IDF is enabled in solr cloud by adding one of the cache
implementations in solrconfig.xml
[https://solr.apache.org/guide/solr/latest/deployment-guide/solrcloud-distributed-requests.html#distributedidf],
each solr query will incur a distributed shard request to get term statistics
"debug": {
"track": {
"rid": "-54",
"PARSE_QUERY": {
"http://192.168.0.34:8987/solr/shard2_replica_n1/":
{ "QTime": “2”,
"ElapsedTime": "13",
"RequestPurpose": "GET_TERM_STATS",
…
For queries that does not use distributed IDF information for scoring such
as terms filter by id, the stats request is not necessary. Hence I propose to
add a disableDistribStats request param so that the distributed stats request
can be disabled at query time.
# disableDistribStats defaults to false. When the param is not present, there
is no change to current distributed IDF behavior.
# When explicitly set disableDistribStats=true, distributed stats call is
disabled for the current query.
> Request param to disable distributed stats request at query time
> ----------------------------------------------------------------
>
> Key: SOLR-17058
> URL: https://issues.apache.org/jira/browse/SOLR-17058
> Project: Solr
> Issue Type: New Feature
> Components: query
> Reporter: wei wang
> Assignee: Mikhail Khludnev
> Priority: Minor
> Fix For: 9.6.0
>
> Time Spent: 3h 20m
> Remaining Estimate: 0h
>
> When distributed IDF is enabled in solr cloud by adding one of the cache
> implementations in solrconfig.xml
> [https://solr.apache.org/guide/solr/latest/deployment-guide/solrcloud-distributed-requests.html#distributedidf],
> each solr query will incur a distributed shard request to get term
> statistics
> "debug": {
> "track": {
> "rid": "-54",
> "PARSE_QUERY": {
> "http://192.168.0.34:8987/solr/shard2_replica_n1/":
> { "QTime": “2”,
>
> "ElapsedTime": "13",
>
> "RequestPurpose": "GET_TERM_STATS",
> …
>
> For queries that does not use distributed IDF information for scoring
> such as terms filter by id, the stats request is not necessary. Hence I
> propose to add a {{distrib.statsCache}} request param so that the distributed
> stats request can be disabled at query time.
> # {{distrib.statsCache}} defaults to {{false}}. When the param is not
> present, there is no change to current distributed IDF behavior.
> # When explicitly set {{disableDistribStats=true}}, distributed stats call
> is disabled for the current query.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]