Hi, I'd use the distributed cache to store the vector on every mapper machine locally.
Kai Am 22.04.2011 um 21:15 schrieb Alexandra Anghelescu: > Hi all, > > I am trying to perform matrix-vector multiplication using Hadoop. > So I have matrix M in a file, and vector v in another file. How can I make > it so that each Map task will get the whole vector v and a chunk of matrix > M? > Basically I want my map function to output key-value pairs (i,m[i,j]*v[j]), > where i is the row number, and j the column number. And the reduce function > will sum up all the values with the same key i, and that will be the ith > element of my result vector. > Or can you suggest another way to do it? > > > Thanks, > Alexandra Anghelescu -- Kai Voigt [email protected]
