I have the following requirements : 1. Adds : 20 docs/sec 2. Searches : 100 searches/sec 3. Deletes : (20*3600*24*7 ~ 12 mill ) docs/week ( basically a cron job which deletes all documents more than 7 days old )
I am thinking of having 6 shards ( with each having 2 million docs ) with 1 master and 2 slaves with SolrReplication. Have following questions : 1. With 50 searches/sec per shard with 2 million doc, what would be the tentative response-time ? I am thinking of keeping it under <100 ms 2. What would be a reasonable latency ( pollInterval ) on slave for SolrReplication ( all slaves connected with a single backplane ). Is 1 minute pollInterval reasonable ? 3. Is NRT a better/viable option compared to SolrReplication ? -Thanks, Prasenjit