[ 
https://issues.apache.org/jira/browse/PDFBOX-773?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13080588#comment-13080588
 ] 

Marin Nozhchev commented on PDFBOX-773:
---------------------------------------

I re-tested with PDFBox 1.6.0 . Still fails but the error is different after 
some "ignoring object" output:

Caused by: java.io.IOException: Unknown dir object c=')' cInt=41 peek=')' 
peekInt=41 org.apache.pdfbox.io.PushBackInputStream@63ec239f
        at 
org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:1098)
        at 
org.apache.pdfbox.pdfparser.BaseParser.parseCOSArray(BaseParser.java:802)
        at 
org.apache.pdfbox.pdfparser.PDFStreamParser.parseNextToken(PDFStreamParser.java:262)
        at 
org.apache.pdfbox.pdfparser.PDFStreamParser.access$000(PDFStreamParser.java:46)
        at 
org.apache.pdfbox.pdfparser.PDFStreamParser$1.tryNext(PDFStreamParser.java:175)
        ... 17 more

See also attached log output - test_with_1.6.0_full.txt

> expected='obj' actual='o' error while parsing the attached PDF
> --------------------------------------------------------------
>
>                 Key: PDFBOX-773
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-773
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Parsing
>    Affects Versions: 1.2.0, 1.3.1
>         Environment: Sun JDK 6u21, Windows 7 x86
>            Reporter: Marin Nozhchev
>         Attachments: Andersens_Fairy_Tales.zip
>
>
> Parsing the attached PDF fails with the following error:
> Caused by: java.io.IOException: expected='obj' actual='o' 
> org.apache.pdfbox.io.PushBackInputStream@11d75b9
>       at org.apache.pdfbox.pdfparser.PDFParser.parseObject(PDFParser.java:509)
>       at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:179)
>       at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:859)
>       at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:826)
>         ...
> The same errors appears with the 1.1, 1.2 releases and the 1.3 latest trunk 
> so far - svn rev. 962879 .
> The file opens without warnings or any visible issues in the latest versions 
> of Foxit Reader and Acrobat Reader on Windows. The parsing was done via the 
> Apache Tika Parser.
> Thank you

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to