Hi, Brian, this question is also relevant for me. Perhaps somebody will give more details because I am just learning myself. But, I guess you can try to change the parameters, and check the performance, and write here about it that everybody would get more knowledge!
In general, if these values are lower, the performance should be faster because mahout based on some algorithms of hadoop. I think it could help if you will try the algorithms with several pieces of data, and look if you are missing some important recommendations. Let's say if you choose " maxSimilaritiesPerItem" as 4, and you miss some recommendations, then you should increase the value. It is a balance between performance and better results, and you should find that balance. Hope, you to share more details about what you will find out because I noticed that here (in the mailing list of mahout) everybody is asking but only few replying, and sharing. Thanks, Darius 2013/9/12 Brian Arnold <[email protected]> > Hi, > > I am currently trying to run the distributed Item Based Collaborative > filtering algorithm on our Hadoop cluster, and I have a few questions > regarding tweaking the various properties of the algorithm. For the > maxPrefsPerUser,maxSimilaritiesPerItem, and maxPrefsPerUserItemSimilarity > properties I was wondering if I could get a more detailed explanation of > what these properties control. I saw the description in the code, but I am > just wondering how changing these values will affect the results of the > algorithm, and will increasing them result in a better recommendation. > > Thanks >
