Using SequenceFile instead of TextFiles

maha Fri, 04 Mar 2011 19:34:00 -0800

Hi, 

I have 2 questions:


1) Is a  SequenceFile more efficient than TextFiles for input?  ... I think 
TextFiles will be processed by TextInputFormat into sequenceFiles inside 
hadoop. So will SequenceFiles (ie.binary input Files) be more efficient ?

2) If I decided to use SequenceFiles as InputFormat, Do I need to stick to the 
header protocol defined in 
http://hadoop.apache.org/common/docs/current/api/org/apache/hadoop/io/SequenceFile.html
 ?


Thanks everyone,

Maha

Using SequenceFile instead of TextFiles

Reply via email to