if I crawl with the -noParsing tag can I trash the fetcher output folder after I parse that segment?
searching, is there any way to limit the results to only english, or only websites ending in extensions I define(.com.edu.org.net.tv.info.gov.biz.us.cc.name.bz)? thanks, -J ------------------------------------------------------- SF.Net email is sponsored by: Discover Easy Linux Migration Strategies from IBM. Find simple to follow Roadmaps, straightforward articles, informative Webcasts and more! Get everything you need to get up to speed, fast. http://ads.osdn.com/?ad_id=7477&alloc_id=16492&op=click _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
