[
https://issues.apache.org/jira/browse/MAHOUT-1289?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Schelter resolved MAHOUT-1289.
----------------------------------------
Resolution: Fixed
> Move downsampling code into RowSimilarityJob
> --------------------------------------------
>
> Key: MAHOUT-1289
> URL: https://issues.apache.org/jira/browse/MAHOUT-1289
> Project: Mahout
> Issue Type: Improvement
> Components: Math
> Reporter: Sebastian Schelter
> Assignee: Sebastian Schelter
> Fix For: 0.9
>
>
> When computing similarities with RowSimilarityJob, downsampling highly
> frequent things is crucial for performance. At the moment, this is done by
> the data preparation code for collaborative filtering.
> We should move the downsampling directly into RowSimilarityJob as we've seen
> a lot of cases where users want to directly use it.
> Furthermore, it should be possible to fix the random seed for the sampling to
> be able to conduct repeatable experiments.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira