[ https://issues.apache.org/jira/browse/MAHOUT-305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12837517#action_12837517 ]
Sean Owen commented on MAHOUT-305: ---------------------------------- Agree, I do not see recall as useful, but hey, including it for completeness isn't a big deal. Yeah in this context there's no choice but to count unrated items as misses. My intuition based on limited experience is it is in fact an issue -- are the best items for a user typically found among their ratings in real-world data sets? I just can't imagine it's so for most users, who express few ratings. So I harp on picking the highest-rated items for exclusion, and based only on the criteria, in order to align best with the process being tested, because even then the test is fairly flawed, but at least minimally flawed. > Combine both cooccurrence-based CF M/R jobs > ------------------------------------------- > > Key: MAHOUT-305 > URL: https://issues.apache.org/jira/browse/MAHOUT-305 > Project: Mahout > Issue Type: Improvement > Components: Collaborative Filtering > Affects Versions: 0.2 > Reporter: Sean Owen > Assignee: Ankur > Priority: Minor > > We have two different but essentially identical MapReduce jobs to make > recommendations based on item co-occurrence: > org.apache.mahout.cf.taste.hadoop.{item,cooccurrence}. They ought to be > merged. Not sure exactly how to approach that but noting this in JIRA, per > Ankur. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.