It's possible this is correct. 1.0 is the maximum similarity and occurs when two vector are just a scalar multiple of each other (0 angle between them). It's possible there are several of these, and so their 1.0 similarities dominate the result.
On Mon, Oct 1, 2012 at 10:03 AM, yamo93 <[email protected]> wrote: > I saw something strange : all recommended items, returned by > mostSimilarItems(), have a value of 1.0. > Is it normal ? > > > On 10/01/2012 10:39 AM, Sean Owen wrote: >> >> This is probably because the Hadoop job does some sampling and pruning >> whereas the non-Hadoop generally doesn't. > >
