There is no right (or wrong) answer. The best way is to test. Start with the default 5 shards and load real data into it at the rate that you expect in production. And then query it at the rate that you expect in production - check throughput and response times. Then run your facets, sorts, aggregations - check throughput, response times, and RAM usage. After a little but of testing, you should get a good sense of the limits of your hardware (per node) and then go from there.
-- You received this message because you are subscribed to the Google Groups "elasticsearch" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/3f908453-78e7-4af3-98fe-900a41ba0e91%40googlegroups.com. For more options, visit https://groups.google.com/groups/opt_out.
