[ 
https://issues.apache.org/jira/browse/PDFBOX-5245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17387086#comment-17387086
 ] 

funaiy commented on PDFBOX-5245:
--------------------------------

[~tilman] , hi ! 

I have updated 3.0.0 RC1, but it also failed ; do we have any other suggestion 
for this issue ? 

 

Caused by: java.io.IOException: Unknown dir object c=')' cInt=41 peek=')' 
peekInt=41 at offset 5020 at 
org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:865) 
~[pdfbox-3.0.0-RC1.jar!/:3.0.0-RC1] at 
org.apache.pdfbox.pdfparser.BaseParser.parseCOSArray(BaseParser.java:634) 
~[pdfbox-3.0.0-RC1.jar!/:3.0.0-RC1] at 
org.apache.pdfbox.pdfparser.PDFStreamParser.parseNextToken(PDFStreamParser.java:129)
 ~[pdfbox-3.0.0-RC1.jar!/:3.0.0-RC1]

> IOException: Unknown dir object c=')' cInt=41 peek=')' peekInt=41 at offset 
> 8571 
> ---------------------------------------------------------------------------------
>
>                 Key: PDFBOX-5245
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-5245
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Parsing
>    Affects Versions: 2.0.24
>            Reporter: funaiy
>            Priority: Major
>
> we fetch the text and image content from pdf  by pdfbox, but some pdf files 
> throw IoException; the pdfbox version is 2.0.24;pls help check
>   
> {code:java}
> Caused by: java.io.IOException: Unknown dir object c=')' cInt=41 peek=')' 
> peekInt=41 at offset 8571 (start offset: 8571)
>       at 
> org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:913) 
> ~[pdfbox-2.0.24.jar!/:2.0.24]
>       at 
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryValue(BaseParser.java:154)
>  ~[pdfbox-2.0.24.jar!/:2.0.24]
>       at 
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryNameValuePair(BaseParser.java:288)
>  ~[pdfbox-2.0.24.jar!/:2.0.24]
>       at 
> org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionary(BaseParser.java:218)
>  ~[pdfbox-2.0.24.jar!/:2.0.24]
>       at 
> org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:857) 
> ~[pdfbox-2.0.24.jar!/:2.0.24]
>       at 
> org.apache.pdfbox.pdfparser.COSParser.parseFileObject(COSParser.java:907) 
> ~[pdfbox-2.0.24.jar!/:2.0.24]
>       at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:876)
>  ~[pdfbox-2.0.24.jar!/:2.0.24]
>       at 
> org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:796)
>  ~[pdfbox-2.0.24.jar!/:2.0.24]
>       at 
> org.apache.pdfbox.pdfparser.COSParser.parseTrailerValuesDynamically(COSParser.java:2858)
>  ~[pdfbox-2.0.24.jar!/:2.0.24]
>       at 
> org.apache.pdfbox.pdfparser.PDFParser.initialParse(PDFParser.java:175) 
> ~[pdfbox-2.0.24.jar!/:2.0.24]
>       at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:226) 
> ~[pdfbox-2.0.24.jar!/:2.0.24]
>       at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1228) 
> ~[pdfbox-2.0.24.jar!/:2.0.24]
>       at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1128) 
> ~[pdfbox-2.0.24.jar!/:2.0.24]
> {code}
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to