Currently when you index files in nutch The entire file contents are saved .This should not happen .
The segments folder is almost the same size of the folder which you are indexing . Is there a way to turn off saving local file content There is a thing called file.content.ignored But there is a NO-Implement sign . Is this right that the file contents are also saved while indexing considering the fact that the file is local and i can access it without nutch storing the entire content . Is there a patch which can turn off this feature
