[
https://issues.apache.org/jira/browse/SOLR-8768?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15174008#comment-15174008
]
Pablo Anzorena commented on SOLR-8768:
--------------------------------------
UPDATE:
It does not depend on the range facets. The problem is on the faceting with
stats across multiple shards. Following with the example,
'sellers_2014'
Tyrion, 1
Jon, 50
PoorNed, 4
'sellers_2015'
Tyrion, 100
Jon, 50
PoorNed, 4
'sellers_2016'
Tyrion, 1
Jon, 50
PoorNed, 4
If the request is top 2 sellers order by seller_measure desc, the response
would be:
{code}
1)Jon 150
2)Tyrion 100
{code}
But the correct response would be Tyrion with 102, but it is not counting the
+2 because Tyrion is not in the top 2 of the other two shards.
> Wrong behaviour in json facets
> ------------------------------
>
> Key: SOLR-8768
> URL: https://issues.apache.org/jira/browse/SOLR-8768
> Project: Solr
> Issue Type: Bug
> Components: Facet Module
> Reporter: Pablo Anzorena
>
> This bug is quite difficult to explain it, so I will first show it with an
> example and then explain it.
> I have a core splitted into three shards, let's call them 'sellers_2014',
> 'sellers_2015', 'sellers_2016'.
> The schema has the following fields:
> seller_name, string
> seller_measure, double
> seller_date, date
> With the following data.
> 'sellers_2014'
> Tyrion, 1, 2014-01-01T00:00:00Z
> Jon, 50, 2014-01-01T00:00:00Z
> PoorNed, 4, 2014-01-01T00:00:00Z
> 'sellers_2015'
> Tyrion, 100, 2015-01-01T00:00:00Z
> Jon, 50, 2015-01-01T00:00:00Z
> PoorNed, 4, 2015-01-01T00:00:00Z
> 'sellers_2016'
> Tyrion, 1, 2015-01-01T00:00:00Z
> Jon, 50, 2015-01-01T00:00:00Z
> PoorNed, 4, 2015-01-01T00:00:00Z
> Request:
> http://localhost:8983/solr/sellers_2016/select?q=*:*&shards=localhost:8983/solr/sellers_2014,localhost:8983/solr/sellers_2015,localhost:8983/solr/sellers_2016&json.facet=
> {code}
> {
> top_sellers: {
> type: terms,
> field: seller_name,
> limit: 2,
> offset: 0,
> sort: "seller_measure desc",
> facet: {
> seller_measure: "sum(seller_measure)",
> seller_dates: {
> type: range,
> field: seller_date,
> start: "2014-01-01T00:00:00Z",
> end: "2016-12-31T00:00:00Z",
> gap: "+1YEARS",
> facet: {
> seller_measure: "sum(seller_measure)"
> }
> }
> }
> }
> }
> {code}
> So... With the request I want to know the top 2 sellers across the three
> shards and for each seller, their seller_measure for each year.
> The response I'm getting is:
> {code}
> "val": "Jon",
> "count": 3,
> "seller_measure": 150,
> "seller_dates": {
> "buckets": [
> {
> "val": "2014-01-01T00:00:00Z",
> "count": 1,
> "seller_measure": 50
> },
> {
> "val": "2015-01-01T00:00:00Z",
> "count": 1,
> "seller_measure": 50
> },
> {
> "val": "2016-01-01T00:00:00Z",
> "count": 1,
> "seller_measure": 50
> }
> ]
> },
> "val": "Tyrion",
> "count": 3,
> "seller_measure": 102,
> "seller_dates": {
> "buckets": [
> {
> "val": "2015-01-01T00:00:00Z",
> "count": 1,
> "seller_measure": 100
> }
> ]
> }
> {code}
> which is incorrect, because the two buckets of 2014 and 2016 in Tyrion are
> missing.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]