If I'm not wrong, segments are used by nutch to store parsed data, and after update the crawldb, and finally build an index.
But when the crawl is finished, for a next recrawl nutch only need the last crawldb? so not my old segments. And for building the new index, it only needs my new indexes and the old index, not the old segs. (and it seems for the search engine part segment are used just for "show page cache copy" ?) It could be nice space saved to delete the segments, but do my argument is right? -- View this message in context: http://www.nabble.com/When-can-I-delete-segments--%28still-usefull-after-indexing-%29-tf3413479.html#a9511359 Sent from the Nutch - User mailing list archive at Nabble.com. ------------------------------------------------------------------------- Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT & business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
