[ https://issues.apache.org/jira/browse/SOLR-3642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13418890#comment-13418890 ]
Yandong Yao commented on SOLR-3642: ----------------------------------- Hi Hoss, Thanks for the quick commit, one further question: if i would like to implement stats with facet field which is multi-valued field, would you please provide some guidance on this? Currently StatsComponent don't support multivalued facet field because it is using FieldCache which don't support multivalued field. Any alternatives? If it is possible, I would like to create a JIRA issue for it and try to work on it. Thanks! Regards, Yandong > Count is inconsistent between facet and stats > --------------------------------------------- > > Key: SOLR-3642 > URL: https://issues.apache.org/jira/browse/SOLR-3642 > Project: Solr > Issue Type: Bug > Components: SearchComponents - other > Affects Versions: 4.0-ALPHA > Environment: 4.0 alpha on macos 10.6 > Reporter: Yandong Yao > Assignee: Hoss Man > Fix For: 4.0, 5.0 > > Attachments: SOLR-3642.patch > > > Steps to reproduce: > 1) Download apache-solr-4.0.0-ALPHA > 2) cd example; java -jar start.jar > 3) cd exampledocs; ./post.sh *.xml > 4) Use statsComponent to get the stats info for field 'popularity' based on > facet 'cat'. And the 'count' for 'electronics' is 3 > http://localhost:8983/solr/collection1/select?q=cat:electronics&wt=json&rows=0&stats=true&stats.field=popularity&stats.facet=cat > { > stats_fields: > { > popularity: > { > min: 0, > max: 10, > count: 14, > missing: 0, > sum: 75, > sumOfSquares: 503, > mean: 5.357142857142857, > stddev: 2.7902892835178013, > facets: > { > cat: > { > music: > { > min: 10, > max: 10, > count: 1, > missing: 0, > sum: 10, > sumOfSquares: 100, > mean: 10, > stddev: 0 > }, > monitor: > { > min: 6, > max: 6, > count: 2, > missing: 0, > sum: 12, > sumOfSquares: 72, > mean: 6, > stddev: 0 > }, > hard drive: > { > min: 6, > max: 6, > count: 2, > missing: 0, > sum: 12, > sumOfSquares: 72, > mean: 6, > stddev: 0 > }, > scanner: > { > min: 6, > max: 6, > count: 1, > missing: 0, > sum: 6, > sumOfSquares: 36, > mean: 6, > stddev: 0 > }, > memory: > { > min: 0, > max: 7, > count: 3, > missing: 0, > sum: 12, > sumOfSquares: 74, > mean: 4, > stddev: 3.605551275463989 > }, > graphics card: > { > min: 7, > max: 7, > count: 2, > missing: 0, > sum: 14, > sumOfSquares: 98, > mean: 7, > stddev: 0 > }, > electronics: > { > min: 1, > max: 7, > count: 3, > missing: 0, > sum: 9, > sumOfSquares: 51, > mean: 3, > stddev: 3.4641016151377544 > } > } > } > } > } > } > 5) Facet on 'cat' and the count is 14. > http://localhost:8983/solr/collection1/select?q=cat:electronics&wt=json&rows=0&facet=true&facet.field=cat > { > cat: > [ > "electronics", > 14, > "memory", > 3, > "connector", > 2, > "graphics card", > 2, > "hard drive", > 2, > "monitor", > 2, > "camera", > 1, > "copier", > 1, > "multifunction printer", > 1, > "music", > 1, > "printer", > 1, > "scanner", > 1, > "currency", > 0, > "search", > 0, > "software", > 0 > ] > }, > So from StatsComponent the count for 'electronics' cat is 3, while > FacetComponent report 14 'electronics'. Is this a bug? > Following is the field definition for 'cat'. > <field name="cat" type="string" indexed="true" stored="true" > multiValued="true"/> -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org