To exclude data to a specific reducer, should I build a partitioner that
do this? Should I have a map function that checks to which reduce task
the output goes?
Can anyone give me some suggestion?
And by the way, I really want to exclude data to a reduce task. So, I
will run more than 1 reducer, even if one of them does not get input data.
On 11-03-2015 10:28, xeonmailinglist-gmail wrote:
Hi,
I have this job that has 3 map tasks and 2 reduce tasks. But, I want
to excludes data that will go to the reduce task 2. This means that,
only reducer 1 will produce data, and the other one will be empty, or
even it doesn't execute.
How can I do this in MapReduce?
Example Job Execution
Thanks,
--
--
--
--