[ 
https://issues.apache.org/jira/browse/MAHOUT-1289?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sebastian Schelter resolved MAHOUT-1289.
----------------------------------------

    Resolution: Fixed
    
> Move downsampling code into RowSimilarityJob
> --------------------------------------------
>
>                 Key: MAHOUT-1289
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1289
>             Project: Mahout
>          Issue Type: Improvement
>          Components: Math
>            Reporter: Sebastian Schelter
>            Assignee: Sebastian Schelter
>             Fix For: 0.9
>
>
> When computing similarities with RowSimilarityJob, downsampling highly 
> frequent things is crucial for performance. At the moment, this is done by 
> the data preparation code for collaborative filtering.
> We should move the downsampling directly into RowSimilarityJob as we've seen 
> a lot of cases where users want to directly use it.
> Furthermore, it should be possible to fix the random seed for the sampling to 
> be able to conduct repeatable experiments.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to