I am running a fetch process on a P4 2.6ghz HT with 1 gig ram and 4x120 gig drives in a raid 0 (stripped) format.
nutch fetch process was started from a fresh index with the entire dmoz rdf process imported. The first 12 hours or so of the fetch seemed to sustain 3.5 to 4.5mpbs and now it seems to be swapped out about 1.5 gigs and having a 10-15 minute pause after a 10-15 minute fetch (while kswapd appears to go nuts swapping) Is there some tweaking i can do to fix this? Too many threads going?? (pretty much a default nutch config). If i kill the process and restart - i know i will have to touch the fetch.done files and such, but will i have to re-inject the db with the urls to spider or will they be picked up the next time i restart after a few db analyze processes? ------------------------------------------------------- This SF.Net email is sponsored by: IBM Linux Tutorials Free Linux tutorial presented by Daniel Robbins, President and CEO of GenToo technologies. Learn everything from fundamentals to system administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click _______________________________________________ Nutch-developers mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/nutch-developers
