Or you can output the data in the keys and NullWritable as the value. That ways you'll get only unique data...
On 9/4/09, zhang jianfeng <[email protected]> wrote: > Hi Sugandha , > > If you only want to the value, you need to set the key as NullWritable in > reduce. > > e.g. > output.collect(NullWritable.get(), value); > > > > On Fri, Sep 4, 2009 at 12:46 AM, Sugandha Naolekar > <[email protected]>wrote: > >> Hello! >> >> Running a simple MR job, and setting a replication factor of 2. >> Now, >> after its execution, the output is split in files named as part-00000 and >> so >> on. I want to ask is, can't we avoid these keys or key values to get >> printed >> in output files? I mean, I am getting the output in the files in key-value >> pair. I want just the data and not the keys(integers) in it. >> >> >> >> >> -- >> Regards! >> Sugandha >> > -- Amandeep Khurana Computer Science Graduate Student University of California, Santa Cruz
