Hama Trunk doesn't work for large matrices multiplication with
mapred.task.timeout and scanner.timeout exception. I tried 1,000,000 *
1,000,000 matrix multiplication on 100 node. (Rests are good)

To reduce read operation of duplicated block, I thought as describe
below. But, each map processing seems too large.

----
// c[i][k] += a[i][j] * b[j][k];

map() {
  SubMatrix a = value.get();

  for (RowResult row : scan) {
     collect : c[i][k] = a * b[j][k];
  }
}

reduce() {
  c[i][k] += c[i][k];
}
----

Should we increase {mapred.task.timeout and scanner.timeout}?
or any good idea?

-- 
Best Regards, Edward J. Yoon @ NHN, corp.
[email protected]
http://blog.udanax.org

Reply via email to