Realtime Map Reduce = Supercomputing for the Masses?

Martin Jaggi Sat, 31 May 2008 19:52:49 -0700

Concerning real-time Map Reduce within (and not only between) machines(multi-core & GPU), e.g. the Phoenix and Mars frameworks:

I'm really interested in very fast Map Reduce tasks, i.e. without muchdisk access. With the rise of multi-core systems, this could get moreand more interesting, and could maybe even lead to something like'super-computing for everyone', or is that a bit overwhelming? AnywayI was nicely surprised to see the recent Phoenix (http://csl.stanford.edu/~christos/sw/phoenix/) implementation of Map Reduce for multi-core CPUs (they won the bestpaper award at HPCA'07).

Recently also GPU computing was in the news again, pushed by Nvidia(check CUDA http://www.nvidia.com/object/cuda_showcase.html ), andnow also there a Map Reduce implementation called Mars became available:

http://www.cse.ust.hk/gpuqp/Mars_tr.pdf

The Mars people say a the end of their paper "We are also interestedin integrating Mars into the existing Map Reduce implementations suchas Hadoop so that the Map Reduce framework can take the advantage ofthe parallelism among different machines as well as the parallelismwithin each machine."

What do you think of this, especially about the multi-core approach?Do you think these needs are already served by the currentInMemoryFileSystem of Hadoop or not? Are there any plans of'integrating' one of the two above frameworks?Or would it already be done by improving the significant intermediatedata pairs overhead (https://issues.apache.org/jira/browse/HADOOP-3366 )?


Any comments?

Realtime Map Reduce = Supercomputing for the Masses?

Reply via email to