Cloudera's distribution is nice in that they bundle and test it
together with Hadoop and other related tools, so you get the whole
suite and you know it works together. The downside is that there is a
lag between when Pig does a release and Cloudera picks it up, so you
have to wait a few months to get the latest release.
Alan.
On Mar 28, 2011, at 3:20 AM, Alex McLintock wrote:
I installed Hadoop and Pig myself using tarballs on my ubuntu boxes,
but I
see that most people use Cloudera's Distribution for Hadoop (aka CDH).
Is there any reason not to go straight to CDH? Do I need to
carefully remove
my old installations before installing the CDH debian packages?
Cheers
Alex