[ https://issues.apache.org/jira/browse/TIKA-2121?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15590306#comment-15590306 ]
Tim Allison commented on TIKA-2121: ----------------------------------- Y, sorry, again, PDFBox's ExtractText doesn't exercise PDAnnotations. I couldn't tell from this description where the ClassCastException originated from. > ClassCastException on a valid PDF (fixed in PDFBox) > --------------------------------------------------- > > Key: TIKA-2121 > URL: https://issues.apache.org/jira/browse/TIKA-2121 > Project: Tika > Issue Type: Bug > Components: parser > Affects Versions: 1.13 > Environment: Windows 7 x64, JVM 1.8.0_101 > Reporter: Seva Alekseyev > > When parsing the following valid PDF file: > https://dl.dropboxusercontent.com/u/92341073/Protheroe%20Clin%20Gastr%202009.pdf > the Tika parses throws a ClassCastException with a message that > "org.apache.pdfbox.cos.COSString cannot be cast to > org.apache.pdfbox.cos.COSDictionary" -- This message was sent by Atlassian JIRA (v6.3.4#6332)