Re: providing the same input to more than one Map task

Kai Voigt Mon, 25 Apr 2011 06:18:02 -0700

Hi,

I'd use the distributed cache to store the vector on every mapper machine 
locally.


Kai

Am 22.04.2011 um 21:15 schrieb Alexandra Anghelescu:

> Hi all,
> 
> I am trying to perform matrix-vector multiplication using Hadoop.
> So I have matrix M in a file, and vector v in another file. How can I make
> it so that each Map task will get the whole vector v and a chunk of matrix
> M?
> Basically I want my map function to output key-value pairs (i,m[i,j]*v[j]),
> where i is the row number, and j the column number. And the reduce function
> will sum up all the values with the same key i, and that will be the ith
> element of my result vector.
> Or can you suggest another way to do it?
> 
> 
> Thanks,
> Alexandra Anghelescu

-- 
Kai Voigt
[email protected]

Re: providing the same input to more than one Map task

Reply via email to