Currently Nutch creates a Lucene multifile index, and makes sure any
existing compound index is converted  to multifile by using the
IndexWriter.setUseCompoundFile(false) method.

 

This is done whenever an IndexWriter is opened in the following methods:

org.apache.nutch.indexer.Indexer.getRecordWriter

org.apache.nutch.indexer.IndexSorter.sort

org.apache.nutch.indexer.IndexMerger.merge

 

Is there a technical constraint as to why Nutch should ensure usage of
multifile (or prevent compound) and not allow the type to be set by a
property setting?

 

Does anyone object to/support  a patch to allow this to be configurable?

 

Best regards,

Alan

_________________________

Alan Tanaman

iDNA Solutions

 

-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Nutch-developers mailing list
Nutch-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to