[ https://issues.apache.org/jira/browse/TIKA-223?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12730186#action_12730186 ]
Chris A. Mattmann commented on TIKA-223: ---------------------------------------- Hi All: Is there a patch for this issue, which includes e.g., a unit test for verification? I'm trying to get the 0.4 RC together and this is one of the 2 only remaining open issues. Please let me know. I'll use the same approach as for the other open issue. If I don't hear back from anyone in the next 48 hrs, I'll assume it's OK to push this to 0.5. If I do hear back and there is significant support to push this to 0.5, I'll do so sooner. If not, can we get a patch together ASAP? I'd like to cut an RC this week and call for a vote? My vote is -1 that this is a blocker for 0.4 and +1 to move this to 0.5. Cheers, Chris > PDFParser causes Problems when using encrypted PDF documents > ------------------------------------------------------------ > > Key: TIKA-223 > URL: https://issues.apache.org/jira/browse/TIKA-223 > Project: Tika > Issue Type: Bug > Components: parser > Affects Versions: 0.3 > Environment: Java 1.5.x on MAC, WIN, LIN > Reporter: Joachim Zittmayr > Fix For: 0.4 > > Original Estimate: 2h > Remaining Estimate: 2h > > The PDFParser.parse() method decrypts the document for the metadata already > and then passes it over to PDF2XHTML.process(), which in turn calls the > inherited getText(). This calls writeText(), which tries to decrypt the > PDDocument again, but this will fail as it is already decrypted. The solution > would be to override writeText(), without the document.isEncrypted check. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.