I'm looking for a technology able to index around 10000 documents (hashmap of hashmaps with around 1000 key=value pairs each) per minute. I was making some tests with elasticsearch, pure Lucene, modeshape. While ES is my top candidate I need to be sure about it and I need a lot of proves and results for management.
I'm not sure if I have my test cluster set up OK: the speed of indexing on one node is the same or even better that with 4 different nodes connected with gigabit network. How can set it up to make ES cluster work faster with each node added? From documentation of ES it seems pretty simple to set up cluster and it should be spreading the load across the cluster, but the speed of indexing seems to be the same with my setting. I've created an index with 4 shards and 4 replica shards across 4 servers. When indexing starts all nodes have high IO and CPU loads, but total indexing time is not better than with one node. Am I missing something? -- You received this message because you are subscribed to the Google Groups "elasticsearch" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/7d9d07fd-96c7-4354-ac7f-d59c72345715%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.
