Save one pass through the data in ItemSimilarityJob and RecommenderJob by
intelligently using counters
------------------------------------------------------------------------------------------------------
Key: MAHOUT-736
URL: https://issues.apache.org/jira/browse/MAHOUT-736
Project: Mahout
Issue Type: Improvement
Components: Collaborative Filtering
Affects Versions: 0.6
Reporter: Sebastian Schelter
Assignee: Sebastian Schelter
Attachments: MAHOUT-736.patch
Both ItemSimilarityJob and RecommenderJob have to find the overall number of
users. This was done with an M/R pass over the data. Instead of that we can use
a counter in a previous pass over the data (similar to the way the total number
of ngrams if collected in CollocDriver) and thereby save one pass through the
data.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira