then just do what i said --set the number of reducers to zero.  this should
just run the mapper phase

2008/7/21 Zhou, Yunqing <[EMAIL PROTECTED]>:

> since the whole data is 5TB.  the Identity reducer still cost a lot of
> time.
>
> On Mon, Jul 21, 2008 at 5:09 PM, Christian Ulrik Søttrup <[EMAIL PROTECTED]>
> wrote:
>
> > Hi,
> >
> > you can simply use the built in reducer that just copies the map output:
> >
> > conf.setReducerClass(org.apache.hadoop.mapred.lib.IdentityReducer.class);
> >
> > Cheers,
> > Christian
> >
> >
> > Zhou, Yunqing wrote:
> >
> >> I only use it to do something in parallel,but the reduce step will cost
> me
> >> additional several days, is it possible to make hadoop do not use a
> reduce
> >> step?
> >>
> >> Thanks
> >>
> >>
> >>
> >
> >
>



-- 
The University of Edinburgh is a charitable body, registered in Scotland,
with registration number SC005336.

Reply via email to