As you probably already know Apache Whirr 0.7.0 is no longer able to deploy Hadoop / CDH clusters because the Oracle JDK was removed from the Ubuntu repositories.
Here is a quick way of getting CDH3u2 running on Amazon (using a development build). Download the binary artefact from: http://people.apache.org/~asavu/whirr-0.7.0-dev.tar.gz And use a recipe similar to this: whirr.cluster-name=cdh-test-asavu whirr.location-id=eu-west-1 whirr.instance-templates=1 hadoop-namenode+hadoop-jobtracker,1 hadoop-datanode+hadoop-tasktracker whirr.hadoop.install-function=install_cdh_hadoop whirr.hadoop.configure-function=configure_cdh_hadoop whirr.java.install-function=install_oab_java whirr.provider=aws-ec2 whirr.identity=${env:AWS_ACCESS_KEY_ID} whirr.credential=${env:AWS_SECRET_ACCESS_KEY} It takes a while to deploy the cluster but it works fine. WARNING: this is not an official release - we are working on that. Keep an eye on the email list! Cheers, -- Andrei Savu / andreisavu.ro
