[
https://issues.apache.org/jira/browse/MAHOUT-827?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13120284#comment-13120284
]
Sean Owen commented on MAHOUT-827:
----------------------------------
The concept is fine -- does it deserve a whole other pipeline? It seems like
this could bolt on as an option to dump the item-item similarities. In fact I
think we had a JIRA for that. I'd imagine this is simpler, just exposing some
intermediate output.
> Another version of RecommenderJob that broadcasts the similarity matrix
> -----------------------------------------------------------------------
>
> Key: MAHOUT-827
> URL: https://issues.apache.org/jira/browse/MAHOUT-827
> Project: Mahout
> Issue Type: New Feature
> Components: Collaborative Filtering
> Affects Versions: 0.6
> Reporter: Sebastian Schelter
> Assignee: Sebastian Schelter
> Attachments: MAHOUT-827.patch
>
>
> Add another version of RecommenderJob that computes the item similarities via
> RowSimilarityJob but assumes that the resulting similarity matrix fits into
> the memory of the mappers in the cluster. After the item similarity
> computation is done, the similarities are broadcasted via Hadoop's
> distributed cache and the recommendations are computed in a map-only pass
> over the data afterwards.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira