Re: Read MapFileOutputFormat output in ascending key order

Andrzej Bialecki Wed, 13 Feb 2008 11:04:13 -0800

Doug Cutting wrote:

Would one of the SequenceFile#merge() methods suffice?
http://hadoop.apache.org/core/docs/current/api/org/apache/hadoop/io/SequenceFile.Sorter.html#merge(java.util.List,%20org.apache.hadoop.fs.Path)

Hmm ... the idea was to avoid the cost of additional I/O, and read theparts directly as they are. If I understand it correctly, theSorter.merge() needs to rewrite the files in order to merge them, whichmeans a lot of I/O.



--
Best regards,
Andrzej Bialecki     <><
 ___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com

Re: Read MapFileOutputFormat output in ascending key order

Reply via email to