[ 
https://issues.apache.org/jira/browse/SOLR-2242?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13071147#comment-13071147
 ] 

Ryan McKinley commented on SOLR-2242:
-------------------------------------

bq. The simplest option seems to be to return the max constraint count taken 
from all the shards

That seems reasonable -- though I think we would also want to be able to have 
the sum when you know that all shards have unique values.

I don't think bill is referring to the accuracy/meaning of distinct count in 
distributed search.  His problem is that if we change the output format, we 
also need to update the code that collects the various values and passes them 
along.  This patch just add a magic value (numFacetTerms) to the count list so 
that the value is handled with existing distributed response parsing.  This is 
a fine one-off solution, but I am -1 for adding any more magic field names to 
solr.  To add this feature, i think we need to bite the bullet and update the 
facet response format.



> Get distinct count of names for a facet field
> ---------------------------------------------
>
>                 Key: SOLR-2242
>                 URL: https://issues.apache.org/jira/browse/SOLR-2242
>             Project: Solr
>          Issue Type: New Feature
>          Components: Response Writers
>    Affects Versions: 4.0
>            Reporter: Bill Bell
>            Assignee: Simon Willnauer
>            Priority: Minor
>             Fix For: 4.0
>
>         Attachments: NumFacetTermsFacetsTest.java, 
> SOLR-2242-notworkingtest.patch, SOLR-2242.patch, SOLR-2242.patch, 
> SOLR-2242.shard.patch, SOLR-2242.shard.patch, 
> SOLR-2242.shard.withtests.patch, SOLR-2242.solr3.1.patch, 
> SOLR.2242.solr3.1.patch, SOLR.2242.v2.patch
>
>
> When returning facet.field=<name of field> you will get a list of matches for 
> distinct values. This is normal behavior. This patch tells you how many 
> distinct values you have (# of rows). Use with limit=-1 and mincount=1.
> The feature is called "namedistinct". Here is an example:
> http://localhost:8983/solr/select?shards=localhost:8983/solr,localhost:7574/solr&indent=true&q=*:*&facet=true&facet.mincount=1&facet.numFacetTerms=2&facet.limit=-1&facet.field=price
> http://localhost:8983/solr/select?shards=localhost:8983/solr,localhost:7574/solr&indent=true&q=*:*&facet=true&facet.mincount=1&facet.numFacetTerms=0&facet.limit=-1&facet.field=price
> http://localhost:8983/solr/select?shards=localhost:8983/solr,localhost:7574/solr&indent=true&q=*:*&facet=true&facet.mincount=1&facet.numFacetTerms=1&facet.limit=-1&facet.field=price
> This currently only works on facet.field.
> {code}
> <lst name="facet_fields">
>   <lst name="price">
>     <int name="numFacetTerms">14</int>
>     <int name="0.0">3</int><int name="11.5">1</int><int 
> name="19.95">1</int><int name="74.99">1</int><int name="92.0">1</int><int 
> name="179.99">1</int><int name="185.0">1</int><int name="279.95">1</int><int 
> name="329.95">1</int><int name="350.0">1</int><int name="399.0">1</int><int 
> name="479.95">1</int><int name="649.99">1</int><int name="2199.0">1</int>
>   </lst>
> </lst>
> {code} 
> Several people use this to get the group.field count (the # of groups).

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to