[
https://issues.apache.org/jira/browse/MAHOUT-483?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12899897#action_12899897
]
Han Hui Wen edited comment on MAHOUT-483 at 8/18/10 12:37 PM:
---------------------------------------------------------------
I remembered that we already sorted the key in previous phase .
So we can do the task in original reducer task in the new Mapper phase,
The new job should no reducer phase,also need not copy the data from mapper to
reducer.
was (Author: huiwenhan):
I remembered that we already sorted the key in previous phase .
So we can do the task in original reducer task in the new Mapper phase,
The new job should no reducer phase,also need copy the data from mapper to
reducer.
> Job RowSimilarityJob-Mapper-EntriesToVectorsReducer improvement
> ----------------------------------------------------------------
>
> Key: MAHOUT-483
> URL: https://issues.apache.org/jira/browse/MAHOUT-483
> Project: Mahout
> Issue Type: Test
> Components: Collaborative Filtering
> Affects Versions: 0.4
> Reporter: Han Hui Wen
> Fix For: 0.4
>
> Attachments: screenshot-3.jpg
>
>
> the Mapper phase of Job RowSimilarityJob-Mapper-EntriesToVectorsReducer uses
> too long time but do nothing actually.
> Can we move the EntriesToVectorsReducer task to Mapper phase ? set the
> Reducer phase is null,
> So can improve the performance.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.