HI,

The current structure of my program is::
Upper class{
class Reduce{
  reduce function(K1,V1,K2,V2){
        // I count the frequency for each key
     // Add output in  HashMap(Key,value)  instead  of  output.collect()
   }
 }

void run()
 {
      runjob();
     // Now eliminate top frequency keys in HashMap built in reduce function
here because only now hashmap is complete.
     // Write this hashmap to a file in such a format so that I can use this
hashmap in next MapReduce job and key of this hashmap is taken as key in
mapper function of that Map Reduce. ?? How and which format should I
choose??? Is this design and approach ok?

  }

  public static void main() {}
}
I hope you have got my question.

Thanks,


On Wed, Apr 16, 2008 at 8:33 AM, Amar Kamat <[EMAIL PROTECTED]> wrote:

> Aayush Garg wrote:
>
> > Hi,
> >
> > Are you sure that another MR is required for eliminating some rows?
> > Can't I
> > just somehow eliminate from main() when I know the keys which are needed
> > to
> > remove?
> >
> >
> >
> Can you provide some more details on how exactly are you filtering?
> Amar
>
>
>

Reply via email to