On May 7, 2007, at 6:34 PM, Brian Whitman wrote:
OK. I got the crash again today on different urls. It's strange
because I've been crawling quite regularly with the same nutch
setup for a while. It's possible that a recent system-level change
is getting in the way (I'm running debian with a recent full upgrade.)
After googling the culprit for a while I found this trick:
-Djava.net.preferIPv4Stack=true
I'm running a large crawl with it now and will let you know if I
don't see it in a while!
Just a note I've crawled 500K pages over a couple of days on the same
start URL set that has been crashing it without a problem after
adding that flag in bin/nutch.
So if anyone else gets the segfault it might be that.
-Brian