You can specify that a group of keys should go to the same host for
reducing, but I have never encountered any situation where you need to know
beforehand exactly which host a particular key should go to. I am not sure
if that can be done. Just out of curiosity, why do you need this kind of
control over reduction?

Hari

On Sat, Dec 18, 2010 at 11:54 PM, Jane Chen <jxchen_us_1...@yahoo.com>wrote:

> But how does this help me request which *host* to schedule the reduce task
> to?
> Thanks,
> Jane
>
> --- On *Sat, 12/18/10, Hari Sreekumar <hsreeku...@clickable.com>* wrote:
>
>
> From: Hari Sreekumar <hsreeku...@clickable.com>
> Subject: Re: How to Influence Reduce Task Location.
> To: mapreduce-user@hadoop.apache.org
> Date: Saturday, December 18, 2010, 10:16 AM
>
>
> Hi Jane,
>
>          The partitioner class can be used to achieve this. (
> http://hadoop.apache.org/mapreduce/docs/r0.21.0/api/org/apache/hadoop/mapreduce/Partitioner.html
> ).
>
> Thanks,
> Hari
>
> On Sat, Dec 18, 2010 at 11:13 PM, Jane Chen 
> <jxchen_us_1...@yahoo.com<http://us.mc509.mail.yahoo.com/mc/compose?to=jxchen_us_1...@yahoo.com>
> > wrote:
>
> Hi All,
>
> Is there anyway to influence where a reduce task is run?  We have a case
> where we'd like to choose the host to run the reduce task based on the
> task's input key.
>
> Any suggestion is greatly appreciated.
>
> Thanks,
> Jane
>
>
>
>
>
>

Reply via email to