See my answers inline.

On Fri, Jun 27, 2014 at 8:42 PM, IronMan2014 <[email protected]> wrote:
> I am trying to understand my index (attached in screenshot) and how can I
> improve size and performance.
> The goal is to index 5 million docs. So, I started small by indexing 421,000
> docs as shown in the image.
> - I am using two nodes (1 & 2) ; node 1 master
>
> Q1) My index size is 3.99 GB with 421,627 docs so far, so I am guessing it
> will be over 40 GB with 5 million docs? Does the size sound too big? I do
> have store = YES for PDF docs.

Total size depends on the kind of docs you'll index. So, it depends!

> Q2) What is the (8 GB) from the image, is this the size on the 2 nodes?
> Also, what is (526,428) ?

Total size of primary shards equals 3.99GB in your case. So, 3.99GB
will be the total size in case you had zero replica. As you've 1
replica set, actual disk space used is 8GB.

Regarding number of documents, 421627 is the total number of docs
present in your index. 526428 is the max_docs your index has seen
before the merge removed the deleted docs.

> Q3) Should I do more nodes, more/less shards?

That really depends. I would suggest doing some tests to find out what
works best for you. Even having 2 shards will work in your case as you
have 2 nodes and each primary shard will go to different nodes. But
then, it'll limit your option of adding a node in case you need more
nodes. ( Of course, there are workarounds).

>
> --
> You received this message because you are subscribed to the Google Groups
> "elasticsearch" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to [email protected].
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/elasticsearch/073c71bf-7499-4abc-8da6-5a381834c1bc%40googlegroups.com.
> For more options, visit https://groups.google.com/d/optout.



-- 
Cheers,
Abhijeet Rastogi (shadyabhi)

-- 
You received this message because you are subscribed to the Google Groups 
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/CACXxYfzn7dFFU7CGG1%2BE_b4U8NSTNnx3me2Jmrs3U9FexpAsSw%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to