Save one pass through the data in ItemSimilarityJob and RecommenderJob by 
intelligently using counters
------------------------------------------------------------------------------------------------------

                 Key: MAHOUT-736
                 URL: https://issues.apache.org/jira/browse/MAHOUT-736
             Project: Mahout
          Issue Type: Improvement
          Components: Collaborative Filtering
    Affects Versions: 0.6
            Reporter: Sebastian Schelter
            Assignee: Sebastian Schelter
         Attachments: MAHOUT-736.patch

Both ItemSimilarityJob and RecommenderJob have to find the overall number of 
users. This was done with an M/R pass over the data. Instead of that we can use 
a counter in a previous pass over the data (similar to the way the total number 
of ngrams if collected in CollocDriver) and thereby save one pass through the 
data.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to