[ 
https://issues.apache.org/jira/browse/MAHOUT-1089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14322615#comment-14322615
 ] 

clem clem commented on MAHOUT-1089:
-----------------------------------

Hi, 
I have been using the SGD matrix factorization for the Yelp Business dataset. I 
called the factorize() method and saved the output which is a matrix of 
doubles. Now I am trying to understand the actual meaning of the values I 
obtained (I suspect the category of the business is one of the latent factors, 
and the gender of the users could be another one). But when I call the 
getItemFeatures(), for each item the first two values are always equal to 1.0.
In the same way, when I call the getUserFeatures(), for each user the third 
value is always equal to 1.0.
If anybody has the time to explain this to me I would be really grateful.
 

> SGD matrix factorization for rating prediction with user and item biases
> ------------------------------------------------------------------------
>
>                 Key: MAHOUT-1089
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1089
>             Project: Mahout
>          Issue Type: New Feature
>          Components: Collaborative Filtering
>    Affects Versions: 0.7
>            Reporter: Zeno Gantner
>            Assignee: Sebastian Schelter
>             Fix For: 0.8
>
>         Attachments: MAHOUT-1089.patch, RatingSGDFactorizer.java, 
> RatingSGDFactorizer.java
>
>
> A matrix factorization that is trained with standard SGD on all features at 
> the same time, in contrast to ExpectationMaximizationFactorizer, which learns 
> feature by feature.
> Additionally to the free features it models a rating bias for each user and 
> item.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to