[
https://issues.apache.org/jira/browse/MAHOUT-463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12897675#action_12897675
]
Hui Wen Han commented on MAHOUT-463:
------------------------------------
In Class AbstractDistributedVectorSimilarity
/**
* computes the number of elements in the {...@link Iterator}
*/
protected int countElements(Iterator<?> iterator) {
int count = 0;
while (iterator.hasNext()) {
count++;
iterator.next();
}
return count;
}
countElements may cause performance issue,
Can we change it to array or list ,then we just get the length of array or size
of list ?
also change some mapper and reducer class to use array or list ?
I am doing a test , the RowSimilarityJob-CooccurrencesMapper-SimilarityReducer
used about 2 hours,
> Boolean Data can not get any recommendation by running RecommnenderJob
> ----------------------------------------------------------------------
>
> Key: MAHOUT-463
> URL: https://issues.apache.org/jira/browse/MAHOUT-463
> Project: Mahout
> Issue Type: Test
> Components: Collaborative Filtering
> Affects Versions: 0.4
> Reporter: Hui Wen Han
> Fix For: 0.4
>
> Attachments: MAHOUT-463-2.patch, MAHOUT-463.patch
>
>
> Boolean Preference Data can not get any recommendation by run RecommnenderJob,
> but It can get data using the build in about Aprl.
> It seems that it can not get any data by running RowSimilarityJob
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.