Re: Prune out data to a specific reduce task

xeonmailinglist-gmail Fri, 13 Mar 2015 07:49:18 -0700

Hi,

The only obstacle is to know to which partition the map output would go.
1 ~ From the map method, how can I know to which partition the output go?

2 ~ Can I call |getPartition(K key, V value, int numReduceTasks)| fromthe map function?


Thanks,




On 13-03-2015 03:25, Naganarasimha G R (Naga) wrote:

I think Drake's comment
"In the map method, records would be ignored with no output.collect()or context.write()."is most valid way to do it as it will avoid further processingdownstream and hence less resources would be consumed, as unwantedrecords are pruned at the source itself.
Is there any obstacle from doing this in your map method ?

Regards,
Naga
------------------------------------------------------------------------
*From:* xeonmailinglist-gmail [[email protected]]
*Sent:* Thursday, March 12, 2015 22:17
*To:* [email protected]
*Subject:* Fwd: Re: Prune out data to a specific reduce task
If I use the partitioner, I must be able to tell map reduce to notexecute values from a certain reduce tasks.
The method |public int getPartition(K key, V value, intnumReduceTasks)| must always return a partition. I can’t return -1.Thus, I don’ t know how to tell Mapreduce to not execute data from apartition. Any suggestion?
———— Forwarded Message ————

Subject: Re: Prune out data to a specific reduce task

Date: Thu, 12 Mar 2015 12:40:04 -0400

From: Fei Hu [email protected] <http://mailto:[email protected]>

Reply-To: [email protected]

To: [email protected]

Maybe you could use Partitioner.class to solve your problem.
On Mar 11, 2015, at 6:28 AM, xeonmailinglist-gmail<[email protected] <mailto:[email protected]>> wrote:
Hi,
I have this job that has 3 map tasks and 2 reduce tasks. But, I wantto excludes data that will go to the reduce task 2. This means that,only reducer 1 will produce data, and the other one will be empty, oreven it doesn't execute.
How can I do this in MapReduce?

<ExampleJobExecution.png>


Thanks,

--
--


--
--

Re: Prune out data to a specific reduce task

Reply via email to