[ 
https://issues.apache.org/jira/browse/MAHOUT-910?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13163085#comment-13163085
 ] 

Sean Owen commented on MAHOUT-910:
----------------------------------

It's still computing some maximum (for each of three different things) and 
allowing everything if the number of things is less than the max, and only 
sampling if it exceeds the max. I think it's the same idea as before in this 
regard. Or are you questioning the default 'factor'? I picked 5. Right now if 
you have about 10,000 items, it will sample when a user exceeds 5*ln(10000) ~= 
46 items.
                
> Improve sampling in SamplingCandidateItemStrategy, optimize intersection 
> computations
> -------------------------------------------------------------------------------------
>
>                 Key: MAHOUT-910
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-910
>             Project: Mahout
>          Issue Type: Improvement
>          Components: Collaborative Filtering
>    Affects Versions: 0.5
>            Reporter: Sean Owen
>            Assignee: Sean Owen
>             Fix For: 0.6
>
>         Attachments: MAHOUT-910.patch, MAHOUT-910.patch, MAHOUT-910.patch, 
> SamplingCandidateItemsStrategy.java
>
>
> Per the lengthy discussion on the mailing list about optimizing 
> SamplingCandidateItemStrategy and related code, I'm opening this placeholder 
> issue.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to