Re: Poly-reduce?

Eric Baldeschwieler Fri, 24 Aug 2007 16:32:43 -0700

especially at scale!

And we are testing on >1000 node clusters with long jobs. We seelots of failures per job.


On Aug 24, 2007, at 4:20 PM, Ted Dunning wrote:

On 8/24/07 12:11 PM, "Doug Cutting" <[EMAIL PROTECTED]> wrote:

>
>> Using the same logic, streaming reduce outputs to
>> the next map and reduce steps (before the first reduce is complete)
>> should also provide speedup.
>
> Perhaps, but the bookeeping required in the jobtracker might beonerous.
>   The failure modes are more complex, complicating recovery.

Frankly, I find Doug's arguments about reliability fairly compelling.
Map-reduce^n is not the same, nor is it entirely analogous to pipe-styleprogramming. It feels the same, but there are very importantdifferencesthat I wasn't thinking of when I made this suggestion. The mostimportantis the issue of reliability. In a large cluster, failure is acontinuousprocess, not an isolated event. As such, the problems of having torollback an entire program due to node failure are not something thatcan betreated as unusual. That makes Doug's comments about risk more on-pointthan the potential gains. It is very easy to imagine scenarioswhere thepossibility of program roll-back results in very large average run-timeswhile the chained reduce results in only incremental savings. Thisisn't a
good thing to bet on.

Re: Poly-reduce?

Reply via email to