You want row N from matrix A and B?

Map A to (row # -> row vector) and likewise for B. Both are input paths.
Then the reducer has, for each row, both row vectors.

You can add a custom Writable with more info about, say, which vector
is which if you like.

On Tue, Aug 3, 2010 at 10:12 AM, Shannon Quinn <[email protected]> wrote:
> Right, that's the concept I'd had in mind, but to me it always seem to come
> down to having access to two distinct vectors at the same time, and I'm not
> sure how you would do that. In my case, both the dimensions and the data
> types of the two vectors are identical, so we're talking a merged vector of
> floats that's simply twice as long as the original, but how to gain access
> to the two original vectors at the same time is beyond me.
>
> But still, the data types I need that would do this for me are in a newer
> Hadoop commit, I'm just trying to figure out how to build the commit
> manually and integrate it to the core Hadoop .jar file.
>
> Any suggestions that would speed along either of these options are most
> welcome.
>
> Shannon

Reply via email to