[
https://issues.apache.org/jira/browse/PDFBOX-256?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Andreas Lehmkühler updated PDFBOX-256:
--------------------------------------
Attachment: PDFBOX256-ELERAP_100_cfl.pdf
> Error decrypting document
> -------------------------
>
> Key: PDFBOX-256
> URL: https://issues.apache.org/jira/browse/PDFBOX-256
> Project: PDFBox
> Issue Type: Bug
> Components: Text extraction
> Priority: Minor
> Attachments: PDFBOX256-ELERAP_100_cfl.pdf
>
>
> [imported from SourceForge]
> http://sourceforge.net/tracker/index.php?group_id=78314&atid=552832&aid=1682201
> Originally submitted by nobody on 2007-03-16 08:15.
> I get the following exception:
> WARNING: IOException while extracting full-text of
> file:/home/sintek/papers/baseweb/ECRA/ELERAP_100_cfl.pdf
> java.io.IOException: Error decrypting document, details: Error: The supplied
> password does not match either the owner or user password in the document.
> at org.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:208)
> at org.pdfbox.util.PDFTextStripper.getText(PDFTextStripper.java:149)
> [...]
> when trying to extract the text from the attached PDF. It has no password as
> far as I can tell, it opens fine in acrobat, gpdf, pdftotext, etc. pdfinfo
> tells me it's encrypted though.
> Any ideas?
> [attachment on SourceForge]
> http://sourceforge.net/tracker/download.php?group_id=78314&atid=552832&aid=1682201&file_id=220882
> ELERAP_100_cfl.pdf (application/pdf), 9371 bytes
> "encrypted" PDF file
> [comment on SourceForge]
> Originally sent by nobody.
> Logged In: NO
> I noticed something regarding this, the PDDocument.decrypt("") does not set
> the encryption dictionary to null. This results in PDFTextStripper to try and
> decrypt the document again (I think it looks at the encryptionDictionary?).
> If allready decrypted, this results in a simular error as stated in this
> ticket.
> I now decrypt with the following code:
> pdDoc.decrypt( passWord ); //password is mostly an empty string
> pdDoc.setEncryptionDictionary(null);
> pdDoc.getDocument().getTrailer().setItem("Encrypt",null);
> That goes fine.
> [comment on SourceForge]
> Originally sent by gromgull.
> Logged In: YES
> user_id=185674
> Originator: NO
> Ah - indeed it is. I was confused since both acrobat and gpdf, etc. were able
> to show the content without prompting me for a password. So you can encrypte
> PDFs for text-extraction? Isn't that a hopeless idea? Oh well :)
> [comment on SourceForge]
> Originally sent by ng_aldridge.
> Logged In: YES
> user_id=1111818
> Originator: NO
> This file *is* encrypted. I just loaded it up into Acrobat and it's secured
> with a password.
> [comment on SourceForge]
> Originally sent by gromgull.
> Logged In: YES
> user_id=185674
> Originator: NO
> Ah yes - that was me reporting that. Sorry for not logging in.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.