On chaining, please refer to
http://hadoop.apache.org/common/docs/r0.20.2/api/org/apache/hadoop/mapred/lib/ChainReducer.html
Also check out ChainMapper

See also
http://www.mail-archive.com/[email protected]/msg00541.html

On Tue, Jun 8, 2010 at 6:02 PM, Torsten Curdt <[email protected]> wrote:

> At Cocoon we have a construct that we called pipelines.
> And frankly speaking I am currently missing something similar in hadoop.
>
> It would be so great if the API was to allow thing like this
>
> M -> M -> R
>
> M -> R -> R
>
> M --> R
>    \-> R
>
> M --> R
> M -/
>
> Of course supporting also multiple inputs and outputs.
>
> The current hadoop processing model feels so overly restrictive too me.
> But it could just be me not knowing better.
>
> Any comments?
>
> cheers
> --
> Torsten
>

Reply via email to