[ 
https://issues.apache.org/jira/browse/SOLR-10552?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15982004#comment-15982004
 ] 

ASF subversion and git services commented on SOLR-10552:
--------------------------------------------------------

Commit 71ce0d31a6a907bf1566fc51324d5f26e4205c21 in lucene-solr's branch 
refs/heads/master from [[email protected]]
[ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=71ce0d3 ]

SOLR-10548: SOLR-10552: numBuckets should use hll and ignore mincount>1 
filtering


> numBuckets is not consistent between distrib and non-distrib requests
> ---------------------------------------------------------------------
>
>                 Key: SOLR-10552
>                 URL: https://issues.apache.org/jira/browse/SOLR-10552
>             Project: Solr
>          Issue Type: Bug
>      Security Level: Public(Default Security Level. Issues are Public) 
>    Affects Versions: 5.1
>            Reporter: Yonik Seeley
>         Attachments: SOLR-10552.patch
>
>
> The main problem is mincount... in a non-distrib query, numBuckets reflects 
> the number of buckets that are screened out after mincount is applied.  In 
> distributed mode, we can't do this (or rather, the only way to do it would be 
> to tramsmit all bucket counts to an aggregator node).
> We should perhaps just make numBuckets always pre-mincount to be consistent, 
> and use hyper-log-log by default?



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to