On 12/30/2013 08:32 AM, Steven Núñez wrote:
The CDH, BigTop and HDP (I assume) base distributions require a lot of manual configuration, so the best way to spin up a cluster with a reasonable set of applications (say HDFS, YARN, Hive, HCatalog, HBase, ZooKeeper, Oozie, Pig, Sqoop) is to use CDH + CM or Ambari + HDP.
Some people have also automated this through tools such as Puppet, Chef or Ansible.
Thanks, Bruno