[ 
https://issues.apache.org/jira/browse/SOLR-6319?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14085406#comment-14085406
 ] 

Yonik Seeley commented on SOLR-6319:
------------------------------------

It's not clear to me if you've hit something new, or something that has already 
been considered/documented.

Here's the current comments for reference:
{code}
            // we're sorting by index order.
            // if minCount==0, we should always be able to get accurate results 
w/o over-requesting or refining
            // if minCount==1, we should be able to get accurate results w/o 
over-requesting, but we'll need to refine
            // if minCount==n (>1), we can set the initialMincount to 
minCount/nShards, rounded up.
            // For example, we know that if minCount=10 and we have 3 shards, 
then at least one shard must have a count of 4 for the term
            // For the minCount>1 case, we can generate too short of a list 
(miss terms at the end of the list) unless limit==-1
            // For example: each shard could produce a list of top 10, but some 
of those could fail to make it into the combined list (i.e.
            //   we needed to go beyond the top 10 to generate the top 10 
combined).  Overrequesting can help a little here, but not as
            //   much as when sorting by count.
{code}

It's been years since I wrote that, but IIRC the thinking was that over 
requesting when sorting by index was probably not worth it.  It's a judgement 
call, and shouldn't be categorized as a bug (if I'm understanding this issue 
correctly). 

> if mincount > 1, facet.field needs to overrequest even if facet.sort=index
> --------------------------------------------------------------------------
>
>                 Key: SOLR-6319
>                 URL: https://issues.apache.org/jira/browse/SOLR-6319
>             Project: Solr
>          Issue Type: Bug
>            Reporter: Hoss Man
>            Assignee: Hoss Man
>
> Discovered this while working on SOLR-2894.  the logic for distributed 
> faceting ignores over requesting (beyond the user specified facet.limit) if 
> the facet.sort is index order -- but the rationale for doing this falls apart 
> if the user has specified a facet.mincount > 1



--
This message was sent by Atlassian JIRA
(v6.2#6252)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to