[
https://issues.apache.org/jira/browse/MAHOUT-483?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12899902#action_12899902
]
Han Hui Wen commented on MAHOUT-483:
-------------------------------------
Or we need decrease the output data size of the previous job, normalizing the
similarityValue from double to long/int is a way.
> Job RowSimilarityJob-Mapper-EntriesToVectorsReducer improvement
> ----------------------------------------------------------------
>
> Key: MAHOUT-483
> URL: https://issues.apache.org/jira/browse/MAHOUT-483
> Project: Mahout
> Issue Type: Test
> Components: Collaborative Filtering
> Affects Versions: 0.4
> Reporter: Han Hui Wen
> Fix For: 0.4
>
> Attachments: screenshot-3.jpg
>
>
> the Mapper phase of Job RowSimilarityJob-Mapper-EntriesToVectorsReducer uses
> too long time but do nothing actually.
> Can we move the EntriesToVectorsReducer task to Mapper phase ? set the
> Reducer phase is null,
> So can improve the performance.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.