It seems to be a common problem that the top N results returned from an 
aggregation query is inaccurate due to uneven distribution of matching 
documents on different shards, because ES will collect top N buckets from 
each shard no matter actually how many hits are on each shard. It is very 
often we collect buckets that should have not been collected on some 
shards, but we missed buckets that should have collected on some others. 

Is there a way we can collect buckets based on a dynamic "weight", for 
example "total hits", on that shard?

Thanks in advance.

-- 
You received this message because you are subscribed to the Google Groups 
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/e78571f9-d3e3-4d7c-a60e-d1a2052db397%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to