I installed Hadoop and Pig myself using tarballs on my ubuntu boxes, but I see that most people use Cloudera's Distribution for Hadoop (aka CDH). Is there any reason not to go straight to CDH? Do I need to carefully remove my old installations before installing the CDH debian packages?
Cheers Alex
