I had the following error when crawling on pdf files (it happened on 2 pdf files):
http://lyra:85/ExternalDocumentation/BusinessComponentApproach_Chapter2.pdf: failed(2,0): Can't be handled as pdf document. java.io.EOFException: Unexpected end of ZLIB input stream Any idea? -- View this message in context: http://www.nabble.com/Unexpected-end-of-ZLIB-input-stream-when-parsing-pdf-files-tp20223893p20223893.html Sent from the Nutch - User mailing list archive at Nabble.com.
