Hi,

This does look interesting. Please give me some time to read up on stuff
and evaluate if
its feasible to implement psvm using BSP.
Btw on the benchmarks page, its not clear if the tests are for hadoop or
hama.
Am I missing something ?

Thanks
Aditya Sarawgi

On Thu, Mar 1, 2012 at 1:51 AM, Thomas Jungblut <[email protected]>wrote:

> Hi Aditya,
>
> I'm from the Apache Hama team, we are working on a BSP (Bulk Synchronous
> Parallel) Engine.
> BSP is quite like MPI, just with 2 primitives (barrier sync and message
> send), I don't know if it is enough for your algorithm, but I would be very
> interested in implementing it with BSP and Apache Hama.
>
> I have already implemented a k-means clustering with BSP [1] which is much
> more faster than the MapReduce implementation [2].
> I plan to contribute it over the next few months to Mahout, since I think
> BSP is a missing part of large scale machine learning (currently I just see
> MapReduce implementations everywhere), you would help to give Mahout
> another good example of BSP and machine learning.
> And it would of course help me to convince the Mahout team of the usage of
> Apache Hama ;)
>
> If you are interested, I'd be glad to hear from you.
>
> Best regards,
> Thomas
>
> [1]
> https://github.com/thomasjungblut/thomasjungblut-common/blob/master/src/de/jungblut/clustering/KMeansBSP.java
>
> [2]  http://wiki.apache.org/hama/Benchmarks (scroll down a bit)
>
> Am 1. März 2012 06:31 schrieb Aditya Sarawgi <[email protected]>:
>
> Hello,
>>
>> I am looking to implement psvm for Mahout as a part of of my coursework.
>> The reference paper is
>> http://books.nips.cc/papers/files/nips20/NIPS2007_0435.pdf
>> and there is a implementation over http://code.google.com/p/psvm/ which
>> uses MPI.
>> Any ideas, pointers are much appreciated.
>>
>> Thanks
>> Aditya Sarawgi
>>
>
>

Reply via email to