hi ,

we are trying to run some aggregation over around 5 million documents with 
 cardinality of the fields of the order of 1000 , the aggregation is a 
filter aggregation which wraps underlying term aggregation .  Right now 
it's taking around 1.2 secs on an average to compute it , the time 
increases when no. of documents are increased or I try to do multiple 
aggregations. we have aws extra large machines, shards 3 and replication 2 
. 

1.) can we improve this time (will like it to get it within 1 sec) , I can 
see very little if any of field cache being used
2.) how does this scale , it increases with number of documents , how can I 
offset that (increasing nodes , replication , sharding  ??)
3.) are there any better options (plugins or a different platform for 
aggregating data )


regards

Ankur Goel


-- 
You received this message because you are subscribed to the Google Groups 
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/fb73f5bd-24a4-4065-9253-39aa8dd9dfe0%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to