[Hadoop Wiki] Trivial Update of "DataProcessingBenchmarks" by udanax

Apache Wiki Thu, 15 Jan 2009 16:59:28 -0800

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Hadoop Wiki" for change 
notification.


The following page has been changed by udanax:
http://wiki.apache.org/hadoop/DataProcessingBenchmarks

------------------------------------------------------------------------------
  ||<bgcolor="#E5E5E5">Time  ||4.43 sec ||172.30 sec||108.01 sec||77.41 
sec||66.30 sec||60.78 sec||
  
  ----
- == Hbase Matrix computations Benchmarks ==
-  * You can download the Hbase [http://incubator.apache.org/hama Matrix 
Package] for Map/Reduce-based Parallel Matrix Computations (still under 
development)
  
- ==== MapReduce Flow ====
+ I also investigate a lot of traditional methods of parallel processing and 
experiment some high level processing (e.g. matrix algebra, graph algorithm) 
using Hadoop/Hbase/MapReduce. The only way to increase speed linearly was 
locality (Do write all data even if there are duplicated efforts). Increased 
node numbers, there is a linear increase of IO channel.
  
-  * The Multiplication requires (n + 1) table full scan irrespective of the 
number of mapper. 
-  * Each map processor requires O(n2) for the communication and O(n3/mappers) 
the computation. 
- 
- ----
-

[Hadoop Wiki] Trivial Update of "DataProcessingBenchmarks" by udanax

Reply via email to