Hi All, I was just curious if the job flow for the distributed similarity calculation is documented anywhere. What is the difference between calculating a similarity sequentially versus using distributed matrix operations on Hadoop. I am just looking for a high level description of how to get from the User-Item matrix to a Item Item similarity score in map-reduce.
Thanks! Chris
