Hi,

I've started working on elasticsearch and having some doubts about shards 
and replicas and how they handle data. I don't have any prior knowledge on 
Lucene.
As I know lucene will split data in segments and store in disk, and shard 
is the lucene index itself. Some of the doubts which I have is...

1) There are two way we can do shard allocation, one in cluster level with 
config settings and another in index level settings. Suppose in cluster 
level I mentioned max shard is 3 and in index level I mentioned 5 shards, 
how the shards will be allocated?  I have one cluster one node.

2) Suppose, one index is having 5 shards and 2 replicas and I'm pushing 
data in bulk api, how the data will be stored? Is same data will be stored 
in 5 shards or the data will split and store in chunks in 5 shards? How 
replicas will have backup of data of all 5 shards? 

3) Suppose I have 5 nodes and 10 shards are distributed over the nodes, 2 
shards each. So when I index new documents how the data will be stored in 
over the nodes? 
Suppose the 5th node goes down suddenly which is holding 9th and 10th 
shard. Now do I loose all the data stored in 9th and 10th shard or the data 
are already copied in rest of the nodes ?

Please explain.

Thanks,
Subhadip

-- 
You received this message because you are subscribed to the Google Groups 
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/4ac575bd-0d0a-4f5f-972e-7f3c54f2eb85%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to