Martin Deutsch created PDFBOX-4132:
--------------------------------------
Summary: Unknown dir object c=')' cInt=41 peek=')' peekInt=41 at
offset 2701
Key: PDFBOX-4132
URL: https://issues.apache.org/jira/browse/PDFBOX-4132
Project: PDFBox
Issue Type: Bug
Components: Parsing
Affects Versions: 2.0.8
Reporter: Martin Deutsch
Attachments: buggy.pdf
The attached document gives an "IOException: Unknown dir object..." when
parsing it, stack trace:
{code:java}
// code placeholder
java.io.IOException: Unknown dir object c=')' cInt=41 peek=')' peekInt=41 at
offset 2701
at
org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:959)
at org.apache.pdfbox.pdfparser.BaseParser.parseCOSArray(BaseParser.java:631)
at
org.apache.pdfbox.pdfparser.PDFStreamParser.parseNextToken(PDFStreamParser.java:174)
at
org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:502)
at
org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:469)
at
org.apache.pdfbox.contentstream.PDFStreamEngine.processPage(PDFStreamEngine.java:150)
at
org.apache.pdfbox.text.LegacyPDFStreamEngine.processPage(LegacyPDFStreamEngine.java:139)
at
org.apache.pdfbox.text.PDFTextStripper.processPage(PDFTextStripper.java:391)
at
org.apache.pdfbox.text.PDFTextStripper.processPages(PDFTextStripper.java:319)
at
org.apache.pdfbox.text.PDFTextStripper.writeText(PDFTextStripper.java:266)
at org.apache.pdfbox.text.PDFTextStripper.getText(PDFTextStripper.java:227)
{code}
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]