Re: Sorting values of a key in reduce phase

Toby DiPasquale Tue, 07 Aug 2007 17:05:12 -0700

On 8/7/07, novice user <[EMAIL PROTECTED]> wrote:
>
> Hi,
>    In reduce phase, with outputValueGroupingComparator, we can sort all keys
> and then group values of a particular key together and send it to reduce()
> method. Is there a way to sort values of a particular key efficiently before
> it reaches to reduce method?


I'm not sure if this is what you want, but Google's MapReduce
framework has the concept of an optional second key parameter for
subsorting of records (saw it in some slides by Jeff Dean). Perhaps
you could integrate this into Hadoop as a patch and submit it?

-- 
Toby DiPasquale

Re: Sorting values of a key in reduce phase

Reply via email to