I am running a fetch process on a P4 2.6ghz HT with 1
gig ram and 4x120 gig drives in a raid 0 (stripped)
format.

nutch fetch process was started from a fresh index
with the entire dmoz rdf process imported.  The first
12 hours or so of the fetch seemed to sustain 3.5 to
4.5mpbs and now it seems to be swapped out about 1.5
gigs and having a 10-15 minute pause after a 10-15
minute fetch (while kswapd appears to go nuts
swapping)

Is there some tweaking i can do to fix this? Too many
threads going?? (pretty much a default nutch config).

If i kill the process and restart - i know i will have
to touch the fetch.done files and such, but will i
have to re-inject the db with the urls to spider or
will they be picked up the next time i restart after a
few db analyze processes?


-------------------------------------------------------
This SF.Net email is sponsored by: IBM Linux Tutorials
Free Linux tutorial presented by Daniel Robbins, President and CEO of
GenToo technologies. Learn everything from fundamentals to system
administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click
_______________________________________________
Nutch-developers mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to