M/R job with a single reducer would do the job. This way you can utilize distributed sort and merge/combine/dedupe key/values as you wish.
On 5/11/11, 丛林 <congli...@gmail.com> wrote: > Hi all, > > There is lots of SequenceFile in HDFS, how can I merge them into one > SequenceFile? > > Thanks for you suggestion. > > -Lin >