Hey, We're just looking into ways to run multiple instances/versions of HBase for testing/development and were wondering how other people have gone about doing this.
If we used just one hadoop cluster then we can have a different paths / user for each hbase instance, and then have a set of zookeeper nodes for each instance (or run multiple zk's on each server binding to different hosts for each instance..). If we used multiple hadoop clusters then the only difference would be just using different hdfs for storing the data. Does anyone have experiences with problems or benefits to either of the above? I'm tempted to go towards the single cluster for more efficient use of hardware but I'm not sure if that's a good idea or not. Thanks, -- Dan Harvey | Datamining Engineer www.mendeley.com/profiles/dan-harvey Mendeley Limited | London, UK | www.mendeley.com Registered in England and Wales | Company Number 6419015
