I'm trying to have our deployment layout..I read one of the articles/FAQ (probably JG's)...that it's better to have zookeeper on separate cluster/separate sets of machine..I'm assuming that is the right approach..
All our transactions are HBase (inserts, mapreduce-table as input, another table as output, other queries,..) Based on other thread on locality..RegionServer & Datanode i'll put on same hosts.. If these boxes have enough capacity, do we need to put zookeeper on separate cluster? If it is on a separate cluster, my understanding is zookeper has much smaller memory footprint compared to HRegionServer/Datanodes..& it shld need that much CPU as well..correct? Is there any suggested guidance on number of zookeeper vs number of regionservers?..looking for some ratio..say 10 node cluster.. how many zookeeper..? Please ignore responding to this ..if this is outside the etiquette thanks venkatesh
