Hi All,

We would like to contribute SimRank algorithm to mllib. SimRank algorithm used 
to calculate similarity rank between two objects based on graph structure, 
details can be seen in (http://ilpubs.stanford.edu:8090/508/1/2001-41.pdf), 
here we implemented a matrix multiplication method based on basic algorithm, 
the description of matrix multiplication method can be seen in 
(http://www.cse.unsw.edu.au/~zhangw/files/wwwj.pdf) chapter 4.1.

The implementation is abstracted and generalized from our customer's real case, 
we made some tradeoffs to improve the speed and reduce the shuffle size. we 
just wondered if this algorithm be suitable to put into mllib? What else should 
we take care about?

Any suggestion would be really appreciated.

Thanks
Jerry

Reply via email to