[ https://issues.apache.org/jira/browse/SOLR-2242?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13071147#comment-13071147 ]
Ryan McKinley commented on SOLR-2242: ------------------------------------- bq. The simplest option seems to be to return the max constraint count taken from all the shards That seems reasonable -- though I think we would also want to be able to have the sum when you know that all shards have unique values. I don't think bill is referring to the accuracy/meaning of distinct count in distributed search. His problem is that if we change the output format, we also need to update the code that collects the various values and passes them along. This patch just add a magic value (numFacetTerms) to the count list so that the value is handled with existing distributed response parsing. This is a fine one-off solution, but I am -1 for adding any more magic field names to solr. To add this feature, i think we need to bite the bullet and update the facet response format. > Get distinct count of names for a facet field > --------------------------------------------- > > Key: SOLR-2242 > URL: https://issues.apache.org/jira/browse/SOLR-2242 > Project: Solr > Issue Type: New Feature > Components: Response Writers > Affects Versions: 4.0 > Reporter: Bill Bell > Assignee: Simon Willnauer > Priority: Minor > Fix For: 4.0 > > Attachments: NumFacetTermsFacetsTest.java, > SOLR-2242-notworkingtest.patch, SOLR-2242.patch, SOLR-2242.patch, > SOLR-2242.shard.patch, SOLR-2242.shard.patch, > SOLR-2242.shard.withtests.patch, SOLR-2242.solr3.1.patch, > SOLR.2242.solr3.1.patch, SOLR.2242.v2.patch > > > When returning facet.field=<name of field> you will get a list of matches for > distinct values. This is normal behavior. This patch tells you how many > distinct values you have (# of rows). Use with limit=-1 and mincount=1. > The feature is called "namedistinct". Here is an example: > http://localhost:8983/solr/select?shards=localhost:8983/solr,localhost:7574/solr&indent=true&q=*:*&facet=true&facet.mincount=1&facet.numFacetTerms=2&facet.limit=-1&facet.field=price > http://localhost:8983/solr/select?shards=localhost:8983/solr,localhost:7574/solr&indent=true&q=*:*&facet=true&facet.mincount=1&facet.numFacetTerms=0&facet.limit=-1&facet.field=price > http://localhost:8983/solr/select?shards=localhost:8983/solr,localhost:7574/solr&indent=true&q=*:*&facet=true&facet.mincount=1&facet.numFacetTerms=1&facet.limit=-1&facet.field=price > This currently only works on facet.field. > {code} > <lst name="facet_fields"> > <lst name="price"> > <int name="numFacetTerms">14</int> > <int name="0.0">3</int><int name="11.5">1</int><int > name="19.95">1</int><int name="74.99">1</int><int name="92.0">1</int><int > name="179.99">1</int><int name="185.0">1</int><int name="279.95">1</int><int > name="329.95">1</int><int name="350.0">1</int><int name="399.0">1</int><int > name="479.95">1</int><int name="649.99">1</int><int name="2199.0">1</int> > </lst> > </lst> > {code} > Several people use this to get the group.field count (the # of groups). -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org