Michael Cafarella wrote:
A few people mentioned (I saw on archives) they've seen
the fetcher hang. I ran into this several times recently.
The problem seems to be in the PDF-parsing plugin.
Here's a recent stack trace:
"http://www.ehdo.go.jp/kyoto/antore/anto.pdf" prio=1 tid=0xa78446a0
This document seems to hang PDFBox, including the latest version.
I added this to a bug report for PDFBox.
http://sourceforge.net/tracker/index.php?func=detail&aid=918220&group_id=78314&atid=552832
I've temporarily disabled the parse-pdf plugin until this is fixed.
Doug
-------------------------------------------------------
This SF.Net email is sponsored by BEA Weblogic Workshop
FREE Java Enterprise J2EE developer tools!
Get your free copy of BEA WebLogic Workshop 8.1 today.
http://ads.osdn.com/?ad_id=5047&alloc_id=10808&op=click
_______________________________________________
Nutch-general mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/nutch-general