Hi list, I have the opportunity to reinstall a cluster form scratch. I use Hadoop and HBase (not yet any of the other tools, like Pig, Hive, Avro, Thrift, etc.). Now, I wonder what versions to use. CDH3 is nice, because it comes with RPMs out of the box, so the operations people now what to do with it (of course, we can build our own for versions that don't have RPMs). It does appear, however, that CDH is mostly focused on a very good Hadoop / HDFS / MR version and you're better of with the ASF HBase release right now. And there is also the version that SU provides on githug, which has the advantage of being heavily used in a production environment by people who know what they're doing.
Any advise on this anyone? Thanks, Friso
