then just do what i said --set the number of reducers to zero. this should just run the mapper phase
2008/7/21 Zhou, Yunqing <[EMAIL PROTECTED]>: > since the whole data is 5TB. the Identity reducer still cost a lot of > time. > > On Mon, Jul 21, 2008 at 5:09 PM, Christian Ulrik Søttrup <[EMAIL PROTECTED]> > wrote: > > > Hi, > > > > you can simply use the built in reducer that just copies the map output: > > > > conf.setReducerClass(org.apache.hadoop.mapred.lib.IdentityReducer.class); > > > > Cheers, > > Christian > > > > > > Zhou, Yunqing wrote: > > > >> I only use it to do something in parallel,but the reduce step will cost > me > >> additional several days, is it possible to make hadoop do not use a > reduce > >> step? > >> > >> Thanks > >> > >> > >> > > > > > -- The University of Edinburgh is a charitable body, registered in Scotland, with registration number SC005336.
