Re: Benchmarks: Hama vs. Spark

2013-08-11 Thread Edward J. Yoon
Moreover, there is still improvement to be made in BSP-based KMeans
algorithm. I'll describe it on new JIRA ticket.

On Mon, Aug 12, 2013 at 7:57 AM, Yexi Jiang  wrote:
> That's cool!
>
>
> 2013/8/11 Edward J. Yoon 
>
>> Here's some interesting benchmarks (by Leonidas Fegaras) showing off
>> the performance of Hama compared to the Spark. Pagerank and KMeans
>> were run via MRQL query, which is not as fast as the native BSP code.
>> Moreover, 0.5 is very slow. I've started to think that latest Hama may
>> be faster than Spark. :-)
>>
>> 
>> On laptop with 8 cores:
>>   Hama 0.5 Spark
>> Pagerank 500K/2M: 211 341
>> KMeans 1M: 31 22
>> KMeans 2M: 41 40
>> KMeans 4M: 165 77
>>
>> On cluster with 64 cores:
>>Hama 0.5 Spark
>> Pagerank 1M/10M: 3590 428
>> KMeans 10M: 87 82
>> KMeans 20M: 129 134
>>
>> On cluster with 32 cores:
>>Hama 0.5 Spark
>> Pagerank 1M/10M: 4419 434
>> KMeans 10M: 98 74
>> KMeans 20M: 273 74
>>
>>
>> --
>> Best Regards, Edward J. Yoon
>> @eddieyoon
>>
>
>
>
> --
> --
> Yexi Jiang,
> ECS 251,  [email protected]
> School of Computer and Information Science,
> Florida International University
> Homepage: http://users.cis.fiu.edu/~yjian004/



-- 
Best Regards, Edward J. Yoon
@eddieyoon


Re: Benchmarks: Hama vs. Spark

2013-08-11 Thread Yexi Jiang
That's cool!


2013/8/11 Edward J. Yoon 

> Here's some interesting benchmarks (by Leonidas Fegaras) showing off
> the performance of Hama compared to the Spark. Pagerank and KMeans
> were run via MRQL query, which is not as fast as the native BSP code.
> Moreover, 0.5 is very slow. I've started to think that latest Hama may
> be faster than Spark. :-)
>
> 
> On laptop with 8 cores:
>   Hama 0.5 Spark
> Pagerank 500K/2M: 211 341
> KMeans 1M: 31 22
> KMeans 2M: 41 40
> KMeans 4M: 165 77
>
> On cluster with 64 cores:
>Hama 0.5 Spark
> Pagerank 1M/10M: 3590 428
> KMeans 10M: 87 82
> KMeans 20M: 129 134
>
> On cluster with 32 cores:
>Hama 0.5 Spark
> Pagerank 1M/10M: 4419 434
> KMeans 10M: 98 74
> KMeans 20M: 273 74
>
>
> --
> Best Regards, Edward J. Yoon
> @eddieyoon
>



-- 
--
Yexi Jiang,
ECS 251,  [email protected]
School of Computer and Information Science,
Florida International University
Homepage: http://users.cis.fiu.edu/~yjian004/


Benchmarks: Hama vs. Spark

2013-08-11 Thread Edward J. Yoon
Here's some interesting benchmarks (by Leonidas Fegaras) showing off
the performance of Hama compared to the Spark. Pagerank and KMeans
were run via MRQL query, which is not as fast as the native BSP code.
Moreover, 0.5 is very slow. I've started to think that latest Hama may
be faster than Spark. :-)


On laptop with 8 cores:
  Hama 0.5 Spark
Pagerank 500K/2M: 211 341
KMeans 1M: 31 22
KMeans 2M: 41 40
KMeans 4M: 165 77

On cluster with 64 cores:
   Hama 0.5 Spark
Pagerank 1M/10M: 3590 428
KMeans 10M: 87 82
KMeans 20M: 129 134

On cluster with 32 cores:
   Hama 0.5 Spark
Pagerank 1M/10M: 4419 434
KMeans 10M: 98 74
KMeans 20M: 273 74


-- 
Best Regards, Edward J. Yoon
@eddieyoon