[
https://issues.apache.org/jira/browse/MAHOUT-884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13157676#comment-13157676
]
Jake Mannix commented on MAHOUT-884:
------------------------------------
There's a better way to do this, however, if both matrices have different
column dimensionalities (which is always the case except for very special
concatenations), you can put the cardinalities of each matrix in the
Configuration, and then do the simple Identity Mapper and have the Reducer use
the cardinality of the pair of Vectors to decide which goes first.
No custom partitioner, comparator, and it's all simple scalable map-reduce.
> Matrix Concatenate utility
> --------------------------
>
> Key: MAHOUT-884
> URL: https://issues.apache.org/jira/browse/MAHOUT-884
> Project: Mahout
> Issue Type: New Feature
> Components: Integration
> Reporter: Lance Norskog
> Priority: Minor
> Attachments: MAHOUT-884.patch, MAHOUT-884.patch
>
>
> Utility to concatenate matrices stored as SequenceFiles of vectors.
> Each pair in the SequenceFile is the IntWritable row number and a
> VectorWritable.
> The input and output files may skip rows.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira