[ https://issues.apache.org/jira/browse/MAHOUT-305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12861381#action_12861381 ]
Sean Owen commented on MAHOUT-305: ---------------------------------- I see, fair enough. Even for this simplistic initial system, something better is called for. Perhaps the mappers can keep a count of how many times each item has been seen and favor co-occurrences among items that have *not* been seen. they wouldn't have a global count but such a simple heuristic may be efficient and effective. For now I might arbitrarily prune, say, for user vectors with more than 50 preferences. > Combine both cooccurrence-based CF M/R jobs > ------------------------------------------- > > Key: MAHOUT-305 > URL: https://issues.apache.org/jira/browse/MAHOUT-305 > Project: Mahout > Issue Type: Improvement > Components: Collaborative Filtering > Affects Versions: 0.2 > Reporter: Sean Owen > Assignee: Ankur > Priority: Minor > > We have two different but essentially identical MapReduce jobs to make > recommendations based on item co-occurrence: > org.apache.mahout.cf.taste.hadoop.{item,cooccurrence}. They ought to be > merged. Not sure exactly how to approach that but noting this in JIRA, per > Ankur. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.