[ 
https://issues.apache.org/jira/browse/MAHOUT-1089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13486309#comment-13486309
 ] 

Zeno Gantner commented on MAHOUT-1089:
--------------------------------------

w/o learning rate decay, good parameter combinations for ml-1m  were
preventOverfitting=0.05 numFeatures=5 learningRate=0.005 iterations=75 (RMSE 
0.8745)
and
0.075 50 0.01 75 (RMSE: 0.8725)

I tried
numFeatures 5, 10, 20, 30, 50
preventOverfitting 0.05, 0.075, 0.1, 0.15, 0.2
learnRate 0.0025, 0.005, 0.01, 0.025, 0.05, 0.075
once on an 80/20 split -- so take those numbers with a grain of salt.

                
> SGD matrix factorization for rating prediction with user and item biases
> ------------------------------------------------------------------------
>
>                 Key: MAHOUT-1089
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1089
>             Project: Mahout
>          Issue Type: New Feature
>          Components: Collaborative Filtering
>            Reporter: Zeno Gantner
>            Assignee: Sebastian Schelter
>         Attachments: MAHOUT-1089.patch, RatingSGDFactorizer.java
>
>
> A matrix factorization that is trained with standard SGD on all features at 
> the same time, in contrast to ExpectationMaximizationFactorizer, which learns 
> feature by feature.
> Additionally to the free features it models a rating bias for each user and 
> item.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to