Here's some interesting benchmarks (by Leonidas Fegaras) showing off
the performance of Hama compared to the Spark. Pagerank and KMeans
were run via MRQL query, which is not as fast as the native BSP code.
Moreover, 0.5 is very slow. I've started to think that latest Hama may
be faster than Spark. :-)
----
On laptop with 8 cores:
Hama 0.5 Spark
Pagerank 500K/2M: 211 341
KMeans 1M: 31 22
KMeans 2M: 41 40
KMeans 4M: 165 77
On cluster with 64 cores:
Hama 0.5 Spark
Pagerank 1M/10M: 3590 428
KMeans 10M: 87 82
KMeans 20M: 129 134
On cluster with 32 cores:
Hama 0.5 Spark
Pagerank 1M/10M: 4419 434
KMeans 10M: 98 74
KMeans 20M: 273 74
--
Best Regards, Edward J. Yoon
@eddieyoon