[ 
https://issues.apache.org/jira/browse/MAHOUT-463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12897675#action_12897675
 ] 

Hui Wen Han commented on MAHOUT-463:
------------------------------------

In Class AbstractDistributedVectorSimilarity

 
 /**
   * computes the number of elements in the {...@link Iterator}
   */
  protected int countElements(Iterator<?> iterator) {
    int count = 0;
    while (iterator.hasNext()) {
      count++;
      iterator.next();
    }
    return count;
  }

countElements may cause performance issue,

Can we change it to array or list ,then we just get the length of array or size 
of list ?

also change some mapper and reducer class to use array or list ?

I am doing a test , the RowSimilarityJob-CooccurrencesMapper-SimilarityReducer 
used about 2 hours,



> Boolean Data can not get any recommendation by running RecommnenderJob
> ----------------------------------------------------------------------
>
>                 Key: MAHOUT-463
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-463
>             Project: Mahout
>          Issue Type: Test
>          Components: Collaborative Filtering
>    Affects Versions: 0.4
>            Reporter: Hui Wen Han
>             Fix For: 0.4
>
>         Attachments: MAHOUT-463-2.patch, MAHOUT-463.patch
>
>
> Boolean Preference Data can not get any recommendation by run RecommnenderJob,
> but It can get data using the build in about Aprl.
> It seems that it can not get any data by running RowSimilarityJob 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to