[
https://issues.apache.org/jira/browse/PDFBOX-4132?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Martin Deutsch updated PDFBOX-4132:
-----------------------------------
Description:
The attached document gives an "IOException: Unknown dir object..." when
parsing it, stack trace:
{code:java}
java.io.IOException: Unknown dir object c=')' cInt=41 peek=')' peekInt=41 at
offset 2701
at
org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:959)
at org.apache.pdfbox.pdfparser.BaseParser.parseCOSArray(BaseParser.java:631)
at
org.apache.pdfbox.pdfparser.PDFStreamParser.parseNextToken(PDFStreamParser.java:174)
at
org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:502)
at
org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:469)
at
org.apache.pdfbox.contentstream.PDFStreamEngine.processPage(PDFStreamEngine.java:150)
at
org.apache.pdfbox.text.LegacyPDFStreamEngine.processPage(LegacyPDFStreamEngine.java:139)
at
org.apache.pdfbox.text.PDFTextStripper.processPage(PDFTextStripper.java:391)
at
org.apache.pdfbox.text.PDFTextStripper.processPages(PDFTextStripper.java:319)
at
org.apache.pdfbox.text.PDFTextStripper.writeText(PDFTextStripper.java:266)
at org.apache.pdfbox.text.PDFTextStripper.getText(PDFTextStripper.java:227)
{code}
was:
The attached document gives an "IOException: Unknown dir object..." when
parsing it, stack trace:
{code:java}
// code placeholder
java.io.IOException: Unknown dir object c=')' cInt=41 peek=')' peekInt=41 at
offset 2701
at
org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:959)
at org.apache.pdfbox.pdfparser.BaseParser.parseCOSArray(BaseParser.java:631)
at
org.apache.pdfbox.pdfparser.PDFStreamParser.parseNextToken(PDFStreamParser.java:174)
at
org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:502)
at
org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:469)
at
org.apache.pdfbox.contentstream.PDFStreamEngine.processPage(PDFStreamEngine.java:150)
at
org.apache.pdfbox.text.LegacyPDFStreamEngine.processPage(LegacyPDFStreamEngine.java:139)
at
org.apache.pdfbox.text.PDFTextStripper.processPage(PDFTextStripper.java:391)
at
org.apache.pdfbox.text.PDFTextStripper.processPages(PDFTextStripper.java:319)
at
org.apache.pdfbox.text.PDFTextStripper.writeText(PDFTextStripper.java:266)
at org.apache.pdfbox.text.PDFTextStripper.getText(PDFTextStripper.java:227)
{code}
> Unknown dir object c=')' cInt=41 peek=')' peekInt=41 at offset 2701
> -------------------------------------------------------------------
>
> Key: PDFBOX-4132
> URL: https://issues.apache.org/jira/browse/PDFBOX-4132
> Project: PDFBox
> Issue Type: Bug
> Components: Parsing
> Affects Versions: 2.0.8
> Reporter: Martin Deutsch
> Priority: Major
> Attachments: buggy.pdf
>
>
> The attached document gives an "IOException: Unknown dir object..." when
> parsing it, stack trace:
> {code:java}
> java.io.IOException: Unknown dir object c=')' cInt=41 peek=')' peekInt=41 at
> offset 2701
> at
> org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:959)
> at
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSArray(BaseParser.java:631)
> at
> org.apache.pdfbox.pdfparser.PDFStreamParser.parseNextToken(PDFStreamParser.java:174)
> at
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:502)
> at
> org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:469)
> at
> org.apache.pdfbox.contentstream.PDFStreamEngine.processPage(PDFStreamEngine.java:150)
> at
> org.apache.pdfbox.text.LegacyPDFStreamEngine.processPage(LegacyPDFStreamEngine.java:139)
> at
> org.apache.pdfbox.text.PDFTextStripper.processPage(PDFTextStripper.java:391)
> at
> org.apache.pdfbox.text.PDFTextStripper.processPages(PDFTextStripper.java:319)
> at
> org.apache.pdfbox.text.PDFTextStripper.writeText(PDFTextStripper.java:266)
> at
> org.apache.pdfbox.text.PDFTextStripper.getText(PDFTextStripper.java:227)
> {code}
>
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]