I'm using SequenceFileInputFormat, but then what to write in my mappers? each mapper is taking a split from the SequenceInputFile then sort its split ?! I don't want that..
Thanks, Mark On Wed, May 25, 2011 at 2:09 AM, Luca Pireddu <[email protected]> wrote: > On May 25, 2011 01:43:22 Mark question wrote: > > Thanks Luca, but what other way to sort a directory of sequence files? > > > > I don't plan to write a sorting algorithm in mappers/reducers, but hoping > > to use the sequenceFile.sorter instead. > > > > Any ideas? > > > > Mark > > Maybe this class can help? > > org.apache.hadoop.mapreduce.lib.input.SequenceFileInputFormat > > With it you should be able to read (key,value) records from your sequence > files > and then do whatever you need with them. > > > -- > Luca Pireddu > CRS4 - Distributed Computing Group > Loc. Pixina Manna Edificio 1 > Pula 09010 (CA), Italy > Tel: +39 0709250452 >
