Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Lucene-hadoop Wiki" for 
change notification.

The following page has been changed by mingyang:
http://wiki.apache.org/lucene-hadoop/FAQ

------------------------------------------------------------------------------
   
  E.g. 
'''org.apache.hadoop.mapred.Sort``Validator.Record``Stats``Checker.Non``Splitable``Sequence``File``Input``Format'''
 in 
[http://svn.apache.org/viewvc/lucene/hadoop/trunk/src/test/org/apache/hadoop/mapred/SortValidator.java
 src/test/org/apache/hadoop/mapred/SortValidator.java]
  
+ In addition to implementing the InputFormat interface and having 
isSplitable(...) returning false, it is also necessary to implement the 
RecordReader interface for returning the whole content of the input file. 
(default is LineRecordReader, which splits the file into separate lines)
+ 
  The other, quick-fix option, is to set 
[http://lucene.apache.org/hadoop/hadoop-default.html#mapred.min.split.size 
mapred.min.split.size] to large enough value.
  
  

Reply via email to