I'm trying to have our deployment layout..I read one of the articles/FAQ 
(probably JG's)...that it's better to
have zookeeper on separate cluster/separate sets of machine..I'm assuming that 
is the right approach..

 
All our transactions are HBase (inserts, mapreduce-table as input, another 
table as output, other queries,..)
Based on other thread on locality..RegionServer & Datanode i'll put on same 
hosts..

If these boxes have enough capacity, do we need to put zookeeper on separate 
cluster?
If it is on a separate cluster, my understanding is zookeper has much smaller 
memory footprint compared
to HRegionServer/Datanodes..& it shld need that much CPU as well..correct?

Is there any suggested guidance on number of zookeeper vs number of 
regionservers?..looking for some ratio..say 10 node cluster..
how many zookeeper..?

Please ignore responding to this ..if this is outside the etiquette
thanks
venkatesh 


Reply via email to