And, as a follow-up, yes, we use the partitioner class to achieve this. Our partioner runs a hashing algorithm which ensures that a given user key will always map to a specific shard #.

DR

On 12/18/2010 01:16 PM, Hari Sreekumar wrote:
Hi Jane,

          The partitioner class can be used to achieve this. (
http://hadoop.apache.org/mapreduce/docs/r0.21.0/api/org/apache/hadoop/mapreduce/Partitioner.html
).

Thanks,
Hari

On Sat, Dec 18, 2010 at 11:13 PM, Jane Chen<jxchen_us_1...@yahoo.com>wrote:

Hi All,

Is there anyway to influence where a reduce task is run?  We have a case
where we'd like to choose the host to run the reduce task based on the
task's input key.

Any suggestion is greatly appreciated.

Thanks,
Jane

Reply via email to