On 13/10/10 04:13, michael j pan wrote:
So the
question may be, why is Hadoop (which implements MapReduce as
described in that paper) the most popular MapReduce framework in the
wild, even though it was not the first, nor the most efficient?


-good engineering effort at Y! and others means that it scales to double digits of petabytes, thousands of nodes, so for everyone else you know you don't hit the limits

-good community evolving it

-regular release schedule

-good documentation/books

-evolving set of tools near it: hive, pig, hbase, cassandra, mahout, etc.



Reply via email to