To exclude data to a specific reducer, should I build a partitioner that do this? Should I have a map function that checks to which reduce task the output goes?

Can anyone give me some suggestion?

And by the way, I really want to exclude data to a reduce task. So, I will run more than 1 reducer, even if one of them does not get input data.


On 11-03-2015 10:28, xeonmailinglist-gmail wrote:
Hi,

I have this job that has 3 map tasks and 2 reduce tasks. But, I want to excludes data that will go to the reduce task 2. This means that, only reducer 1 will produce data, and the other one will be empty, or even it doesn't execute.

How can I do this in MapReduce?

Example Job Execution


Thanks,

--
--

--
--

Reply via email to