Michael Cafarella wrote:
A few people mentioned (I saw on archives) they've seen the fetcher hang. I ran into this several times recently. The problem seems to be in the PDF-parsing plugin.

  Here's a recent stack trace:

"http://www.ehdo.go.jp/kyoto/antore/anto.pdf"; prio=1 tid=0xa78446a0

This document seems to hang PDFBox, including the latest version.

I added this to a bug report for PDFBox.

http://sourceforge.net/tracker/index.php?func=detail&aid=918220&group_id=78314&atid=552832

I've temporarily disabled the parse-pdf plugin until this is fixed.

Doug


------------------------------------------------------- This SF.Net email is sponsored by BEA Weblogic Workshop FREE Java Enterprise J2EE developer tools! Get your free copy of BEA WebLogic Workshop 8.1 today. http://ads.osdn.com/?ad_id=5047&alloc_id=10808&op=click _______________________________________________ Nutch-general mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to