See my answers inline. On Fri, Jun 27, 2014 at 8:42 PM, IronMan2014 <[email protected]> wrote: > I am trying to understand my index (attached in screenshot) and how can I > improve size and performance. > The goal is to index 5 million docs. So, I started small by indexing 421,000 > docs as shown in the image. > - I am using two nodes (1 & 2) ; node 1 master > > Q1) My index size is 3.99 GB with 421,627 docs so far, so I am guessing it > will be over 40 GB with 5 million docs? Does the size sound too big? I do > have store = YES for PDF docs.
Total size depends on the kind of docs you'll index. So, it depends! > Q2) What is the (8 GB) from the image, is this the size on the 2 nodes? > Also, what is (526,428) ? Total size of primary shards equals 3.99GB in your case. So, 3.99GB will be the total size in case you had zero replica. As you've 1 replica set, actual disk space used is 8GB. Regarding number of documents, 421627 is the total number of docs present in your index. 526428 is the max_docs your index has seen before the merge removed the deleted docs. > Q3) Should I do more nodes, more/less shards? That really depends. I would suggest doing some tests to find out what works best for you. Even having 2 shards will work in your case as you have 2 nodes and each primary shard will go to different nodes. But then, it'll limit your option of adding a node in case you need more nodes. ( Of course, there are workarounds). > > -- > You received this message because you are subscribed to the Google Groups > "elasticsearch" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To view this discussion on the web visit > https://groups.google.com/d/msgid/elasticsearch/073c71bf-7499-4abc-8da6-5a381834c1bc%40googlegroups.com. > For more options, visit https://groups.google.com/d/optout. -- Cheers, Abhijeet Rastogi (shadyabhi) -- You received this message because you are subscribed to the Google Groups "elasticsearch" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CACXxYfzn7dFFU7CGG1%2BE_b4U8NSTNnx3me2Jmrs3U9FexpAsSw%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.
