Re: More Hadoop Design Question

Owen O'Malley Thu, 06 Nov 2008 13:04:25 -0800


On Nov 6, 2008, at 2:30 PM, Ricky Ho wrote:

Hmmm, sounds like the combiner is invoked after the map() processcompleted for the file split.

No. The data path is complex, but the combiner is called when the mapoutputs are being spilled to disk. So roughly, the map will outputkey, value pairs until the io.sort.mb buffer is full, the contents aresorted and fed to the combiner. The output of the combiner is writtento disk. When there are enough spills on disk, it will merge themtogether, call the combiner, and write to disk. When the map finishes,the final multi-level merge is done.

Since the reduce is also doing multi-level sort, it will also call thecombiner when a merge is done (other than the final merge, which isfed into the reduce).

That means, before the combiner function starts, all theintermediate map() output result will be kept in memory ? Anycomment on the memory footprint consumption ?


The memory is bound by io.sort.mb.

I think a sufficient condition is just to make sure the reduce taskwill not COMPLETE before all the map tasks has completed. We don'tneed to make sure the reduce task will not START before all mapstasks has completed. This can be achieved easily by letting theiterator.next() call within the reduce() method blocked.

*Sigh* no. The reduce function is invoked once per a unique key. Thereduce function is called in ascending order of keys. Since the finalmap may return a's when previously you've only seen b's and c's. Youcan't call the reduce with the b, you can't later call it with the a.

There is another potential issue in the reduce() API, can youexplain why do we need to expose the OutputCollector to the reduce()method ? For example, is it possible that the "key" in theoutput.collect() be a different key from the reduce methodparameter ? What happen if two reduce method (start with differentkeys) writing their output on the same key ?

The reduce is allowed to have different input and output types. Thereare *four* type parameters.


Reducer<KeyIn, ValueIn, KeyOut, ValueOut>

The output of the reduce is not resorted. If the reduce doesn't usethe same key as the input, the output of the reduce won't be sorted.Duplicate keys on reduce output (either within the same reduce ordifferent ones, is not a problem for the framework.)

However, this requires some change of the current Reducerinterface. Currently the reduce() method is called once per key.We want that to be called once per map result (within the samekey). What I mean is the following interface ...

There is a library that lets you run a chain of maps, if that is thesemantics you are looking for. For map/reduce, the sort is a veryfundamental piece. If you don't need sort between map and reduce, youcan set reduces = 0 and run much faster.

Does it make sense ?


Not really. Most map/reduce applications need the other semantics.

-- Owen

Re: More Hadoop Design Question

Reply via email to