[
https://issues.apache.org/jira/browse/PDFBOX-1202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13180286#comment-13180286
]
Ilija Pavlic commented on PDFBOX-1202:
--------------------------------------
I have edited the issue to include information about the encryption. I have
used the inserted snippet to decrypt the document. Either way, I was able to
extract text from pages 1-396 without problems. Therefore I do not think the
issue should be closed.
> org.apache.pdfbox.filter.FlateFilter decode SEVERE: Stop reading corrupt
> stream
> -------------------------------------------------------------------------------
>
> Key: PDFBOX-1202
> URL: https://issues.apache.org/jira/browse/PDFBOX-1202
> Project: PDFBox
> Issue Type: Bug
> Components: Text extraction
> Affects Versions: 1.6.0
> Reporter: Ilija Pavlic
> Priority: Critical
> Attachments: IATAUnitedStates.pdf
>
>
> Error "org.apache.pdfbox.filter.FlateFilter decode SEVERE: Stop reading
> corrupt stream" thrown when extracting text.
> The document was loaded with the following snippet:
> document = PDDocument.load("C:/Users/ilija.pavlic/Downloads/TestInput.pdf");
> if (document.isEncrypted()) {
> try {
> document.decrypt("");
> } catch (InvalidPasswordException e) {
> System.err.println("Error: Document is encrypted with a password.");
> System.exit(1);
> }
> }
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira