Error in formula for preference estimation in GenericItemBasedRecommender
-------------------------------------------------------------------------

                 Key: MAHOUT-898
                 URL: https://issues.apache.org/jira/browse/MAHOUT-898
             Project: Mahout
          Issue Type: Bug
          Components: Collaborative Filtering
         Environment: mahout-core
            Reporter: Paulo Villegas
            Assignee: Sean Owen
            Priority: Minor
             Fix For: 0.6


The formula to estimate the preference for an item in the Taste item-based 
recommender normalizes by the sum of similarities for items used in estimation. 
But the terms in the sum taken to normalize should be in absolute value, since 
they can be negative (e.g. when using Pearson correlation, similarity is in 
[-1,1]). Now they are not, and as a result when there are negative and positive 
values they cancel out, giving a small denominator and incorrectly boosting the 
preference for the item (symptom: it is easy for a predicted preference to take 
the maximum value, since the quotient becomes large and it is capped afterwards)

The patch is rather trivial (a one-liner, actually) for 
src/main/java/org/apache/mahout/cf/taste/impl/recommender/GenericItemBasedRecommender.java
Note: the same error & suggested fix happens in GenericUserBasedRecommender

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to