[ https://issues.apache.org/jira/browse/SOLR-2403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13007513#comment-13007513 ]
Toke Eskildsen commented on SOLR-2403: -------------------------------------- My first example was hills, while the second was spikes, where I agree that the divide-mincount-by-shard# or something similar works well. As it comes down to distribution of counts vs. mincount, we seem to be left with the unsatisfying "it depends, but avoid using mincounts around the average count"-answer. I forgot about the refinement phase. That would ensure that my suggestion of a collector with two separate mincounts would return the correct result for counts as well as terms, as long as it did not exceeded the given limits. Alas, it still only helps somewhat and might not be worth the hassle. > Problem with facet.sort=lex, shards, and facet.mincount > ------------------------------------------------------- > > Key: SOLR-2403 > URL: https://issues.apache.org/jira/browse/SOLR-2403 > Project: Solr > Issue Type: Bug > Components: search > Affects Versions: 4.0 > Environment: RHEL5, Ubuntu 10.04 > Reporter: Peter Cline > > I tested this on a recent trunk snapshot (2/25), haven't verified with 3.1 or > 1.4.1. I can if necessary and update. > Solr is not returning the proper number of facet values when sorting > alphabetically, using distributed search, and using a facet.mincount that > excludes some of the values in the first facet.limit values. > Easiest explained by example. Sorting alphabetically, the first 20 values > for my "subject_facet" field have few documents. 19 facet values have only 1 > document associated, and 1 has 2 documents. There are plenty after that have > more than 2. > {code} > http://localhost:8082/solr/select?q=*:*&facet=true&facet.field=subject_facet&facet.limit=20&facet.sort=lex&facet.mincount=2 > {code} > comes back with the expected 20 facet values with >= 2 documents associated. > If I add a shards parameter that points back to itself, the result is > different. > {code} > http://localhost:8082/solr/select?q=*:*&facet=true&facet.field=subject_facet&facet.limit=20&facet.sort=lex&facet.mincount=2&shards=localhost:8082/solr > {code} > comes back with only 1 facet value: the single value in the first 20 that had > more than 1 document. > It appears to me that mincount is ignored when doing the original query to > the shards, then applied afterwards. > Let me know if you need any more info. > Thanks, > Peter -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org