Re: Map reduce classes

Arun C Murthy Tue, 15 Apr 2008 22:46:55 -0700

Implement a custom OutputFormat (http://hadoop.apache.org/core/docs/current/mapred_tutorial.html#Job+Output) and a custom RecordWriter(http://hadoop.apache.org/core/docs/current/mapred_tutorial.html#RecordWriter). In the write() method of the yourRecordWriter you can do the filtering based on keys.


Arun


On Apr 15, 2008, at 3:20 PM, Aayush Garg wrote:

HI,
Could you please suggest what classes and another better way toachieve
this:-

I am getting outputcollector in my reduce function as:

 void reduce(....)
{
   output.collect(key,value);
}

Here key is Text,
and value is Custom class type that I generated from rcc.
1. After all calls are complete to reduce function, I need toeliminatecertain rows in this outputformat based on keys. I guess I need tostorethis outputformat in some static Map(declared in Reduce class) andneed to
do required operations from the Main function. Is this right approach?
2. This stored outputformat I want to use for another Map Reducejob. Whatclasses and format should I use in the previous step so that I caneasily
use this as input in another program invoking MR job.

Regards,
Garg

Re: Map reduce classes

Reply via email to