I think that Andrew talked about this some years ago and he created some scripts for that. You can find them here: https://github.com/apurtell/hbase-ec2
Then, you can review some links about this topic: http://blog.cloudera.com/blog/2012/10/set-up-a-hadoophbase-cluster-on-ec2-in-about-an-hour/ http://my.safaribooksonline.com/book/databases/storage-systems/9781849517140/1dot-setting-up-hbase-cluster/id286696951 http://whynosql.com/why-we-run-our-hbase-on-ec2/ You can read the HBase on EC2 demo from Andrew in the HBaseCon 2012: https://github.com/apurtell/ec2-demo 2013/5/7 Pal Konyves <[email protected]> > Hi, > > Has anyone got some recommendations about running HBase on EC2? I am > testing it, and so far I am very disappointed with it. I did not change > anything about the default 'Amazon distribution' installation. It has one > MasterNode and two slave nodes, and write performance is around 2500 small > rows per sec at most, but I expected it to be way better. Oh, and this is > with batch put operations with autocommit turned off, where each batch > containes about 500-1000 rows... When I do it with autocommit, it does not > even reach the 1000 rows per sec. > > Every nodes were m1.Large ones. > > Any experiences, suggestions? Is it worth to try the RMap distribution > instead of the amazon one? > > Thanks, > Pal > -- Marcos Ortiz Valmaseda Product Manager at PDVSA http://about.me/marcosortiz
