On May 7, 2007, at 6:34 PM, Brian Whitman wrote: > OK. I got the crash again today on different urls. It's strange > because I've been crawling quite regularly with the same nutch > setup for a while. It's possible that a recent system-level change > is getting in the way (I'm running debian with a recent full upgrade.) > > After googling the culprit for a while I found this trick: > > -Djava.net.preferIPv4Stack=true > > I'm running a large crawl with it now and will let you know if I > don't see it in a while!
Just a note I've crawled 500K pages over a couple of days on the same start URL set that has been crashing it without a problem after adding that flag in bin/nutch. So if anyone else gets the segfault it might be that. -Brian ------------------------------------------------------------------------- This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ _______________________________________________ Nutch-developers mailing list Nutch-developers@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nutch-developers