Thanks for the suggestion. Now I am getting the following error with cloudera's distro. I have set AWS_SECRET_KEY appropriately though. Any pointers :
pmukher...@ubuntu:~/apps/cloudera-for-hadoop-on-ec2-py-0.2.0-beta$ echo $AWS_SECRET_ACCESS_KEY <.........snipped......................................> pmukher...@ubuntu:~/apps/cloudera-for-hadoop-on-ec2-py-0.2.0-beta$ ./hadoop-ec2 list Traceback (most recent call last): File "./hadoop-ec2", line 124, in <module> list_all() File "/home/pmukherjee/apps/cloudera-for-hadoop-on-ec2-py-0.2.0-beta/hadoop/ec2/commands.py", line 43, in list_all clusters = get_clusters_with_role(MASTER) File "/home/pmukherjee/apps/cloudera-for-hadoop-on-ec2-py-0.2.0-beta/hadoop/ec2/cluster.py", line 29, in get_clusters_with_role all = EC2Connection().get_all_instances() File "/usr/lib/python2.5/site-packages/boto/ec2/connection.py", line 69, in __init__ self.region.endpoint, debug, https_connection_factory, path) File "/usr/lib/python2.5/site-packages/boto/connection.py", line 446, in __init__ debug, https_connection_factory, path) File "/usr/lib/python2.5/site-packages/boto/connection.py", line 169, in __init__ self.hmac = hmac.new(self.aws_secret_access_key, digestmod=sha) AttributeError: EC2Connection instance has no attribute 'aws_secret_access_key' -Prasen On Mon, Jan 18, 2010 at 10:47 AM, Zak Stone <zst...@gmail.com> wrote: > In my experience, the Cloudera distributions are excellent, actively > developed, and well-supported. > > Zak > > > On Mon, Jan 18, 2010 at 12:01 AM, Mark Kerzner <markkerz...@gmail.com> wrote: >> My personal experience led me to prefer cloudera. Can't talk for every >> situation, but for me the hadoop distro had many bugs and was unreliable. >> >> Mark >> >> On Sun, Jan 17, 2010 at 10:58 PM, prasenjit <prasen....@gmail.com> wrote: >> >>> >>> It seems there are 2 hadoop-ec2 scripts: >>> >>> 1) One which comes along with the hadoop distro : >>> <hadoop>/src/contrib/ec2/bin/hadoop-ec2 >>> >>> 2) Another which is downloadable from >>> >>> http://cloudera-packages.s3.amazonaws.com/cloudera-for-hadoop-on-ec2-py-0.3.0-beta.tar.gz >>> and is from cloudera folks. >>> >>> I prefer using the base hadoop, as I want to avoid dependencies on >>> boto/simplejson which is required for (2). My question is are they planned >>> to kept in sync. Which one is under active development and hence suggested >>> ( given my preference for hadoop's contrib package ) for stable use ? >>> >>> -Thanks, >>> Prasen >>> >>> -- >>> View this message in context: >>> http://old.nabble.com/Which-instance-type-on-Amazon-EC2--tp25667297p27206207.html >>> Sent from the Hadoop core-user mailing list archive at Nabble.com. >>> >>> >> >