The three ideas that I had are not options in my company. (1) I cannot upgrade their Hadoop system. (2) I cannot change that the job must run in mapreduce, and not mapred. (3) And I cannot change that I receive multiple small file inputs. Are there any other utilities or contrib items that might be my last option, other than cracking open each of my input sequence files and writing out larger ones manually? Thank you,
Anna On Thu, Sep 27, 2012 at 4:12 PM, Bejoy Ks <[email protected]> wrote: > Hi Anna > > CombineFileInputFormat is included within in the mapreduce package in the > latest releases > > > http://hadoop.apache.org/docs/r1.0.3/api/org/apache/hadoop/mapreduce/lib/input/CombineFileInputFormat.html > > > Regards > Bejoy KS >
