It seems to be a common problem that the top N results returned from an aggregation query is inaccurate due to uneven distribution of matching documents on different shards, because ES will collect top N buckets from each shard no matter actually how many hits are on each shard. It is very often we collect buckets that should have not been collected on some shards, but we missed buckets that should have collected on some others.
Is there a way we can collect buckets based on a dynamic "weight", for example "total hits", on that shard? Thanks in advance. -- You received this message because you are subscribed to the Google Groups "elasticsearch" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/e78571f9-d3e3-4d7c-a60e-d1a2052db397%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.
