Hi

i am running into the exact same problem using Nutch1.2


2011-05-28 13:48:42,019 WARN  parse.ParseUtil - TIMEOUT parsing XXX with
org.apache.nutch.parse.html.HtmlParser@424c414
2011-05-28 13:48:42,019 WARN  parse.ParseUtil - Unable to successfully parse
content XXX of type text/html
2011-05-28 13:48:42,352 WARN  parse.ParseUtil - TIMEOUT parsing XXX with
org.apache.nutch.parse.html.HtmlParser@424c414
2011-05-28 13:48:42,352 WARN  parse.ParseUtil - Unable to successfully parse
content XXX of type text/html
2011-05-28 13:48:42,492 INFO  fetcher.Fetcher - -activeThreads=10,
spinWaiting=6, fetchQueues.totalSize=500
2011-05-28 13:48:43,492 INFO  fetcher.Fetcher - -activeThreads=10,
spinWaiting=6, fetchQueues.totalSize=500
2011-05-28 13:48:44,492 INFO  fetcher.Fetcher - -activeThreads=10,
spinWaiting=6, fetchQueues.totalSize=500
2011-05-28 13:48:44,493 WARN  fetcher.Fetcher - Aborting with 10 hung
threads.

This happened around depth 2-3 of 8. (undeterministically)
Have u resolved this problem at your end?

I've read thru all the previous email threads related to this topic, also
tried some of the suggested solution. None of them work (those are on older
versions of nutch anyways, and fixes have already been commited to the
branch)

Does any one else have any clue? this is a blocker!
Thx!

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Nutch-1-2-fetcher-aborting-with-N-hung-threads-tp2411724p2997311.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Reply via email to