Ive never used the Cloudera distributions, but you cant not hear about them. Is it really much easier to manage the whole platform using clouderas manager? 50 nodes free is generous enough that Id feel comfortable committing to them as a platform (and thus the future potential cost), I think.
My only real experience comes from AWSs environment, which, other than having a dedicated DFS, and launching jobs via their steps process, they seem like a pretty straight forward Hadoop configuration. Dave From: Håvard Wahl Kongsgård [mailto:[email protected]] Sent: Friday, March 29, 2013 3:21 PM To: user Subject: Re: Which hadoop installation should I use on ubuntu server? I recommend cloudera's CDH4 on ubuntu 12.04 LTS On Thu, Mar 28, 2013 at 7:07 AM, David Parks <[email protected]> wrote: Im moving off AWS MapReduce to our own cluster, Im installing Hadoop on Ubuntu Server 12.10. I see a .deb installer and installed that, but it seems like files are all over the place `/usr/share/Hadoop`, `/etc/hadoop`, `/usr/bin/hadoop`. And the documentation is a bit harder to follow: http://hadoop.apache.org/docs/r1.1.2/cluster_setup.html So I just wonder if this installer is the best approach, or if itll be easier/better to just install the basic build in /opt/hadoop and perhaps the docs become easier to follow. Thoughts? Thanks, Dave -- Håvard Wahl Kongsgård Data Scientist Faculty of Medicine & Department of Mathematical Sciences NTNU http://havard.dbkeeping.com/
