[
https://issues.apache.org/jira/browse/PDFBOX-5289?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17425082#comment-17425082
]
Michael Klink commented on PDFBOX-5289:
---------------------------------------
That syntax error apparently has been caused by someone removing the *Producer*
value without knowing exactly what they do.
Also there are other peculiarities, e.g. illegal values in the trailer
dictionary. Very likely the PDF originally had a cross reference stream, not a
cross reference table, at least that would explain those entries.
> java.io.IOException: Unknown dir object c='>' cInt=62 peek='>' peekInt=62 at
> offset 13377272 (start offset: 13377272)
> ---------------------------------------------------------------------------------------------------------------------
>
> Key: PDFBOX-5289
> URL: https://issues.apache.org/jira/browse/PDFBOX-5289
> Project: PDFBox
> Issue Type: Bug
> Components: Parsing
> Affects Versions: 2.0.24
> Reporter: Stephen
> Priority: Major
> Attachments: Diplomacy by Henry Kissinger (1).pdf
>
>
> {code:java}
> java.io.IOException: Unknown dir object c='>' cInt=62 peek='>' peekInt=62 at
> offset 13377272 (start offset: 13377272)java.io.IOException: Unknown dir
> object c='>' cInt=62 peek='>' peekInt=62 at offset 13377272 (start offset:
> 13377272) at
> org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:913) at
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryValue(BaseParser.java:154)
> at
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryNameValuePair(BaseParser.java:288)
> at
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionary(BaseParser.java:218)
> at
> org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:857) at
> org.apache.pdfbox.pdfparser.COSParser.parseFileObject(COSParser.java:907) at
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:876)
> at
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:796)
> at
> org.apache.pdfbox.pdfparser.COSParser.parseTrailerValuesDynamically(COSParser.java:2858)
> at org.apache.pdfbox.pdfparser.PDFParser.initialParse(PDFParser.java:175) at
> org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:226) at
> org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1228) at
> org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1128)
> {code}
> Please find the problematic PDF attached.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]