[ 
https://issues.apache.org/jira/browse/PDFBOX-980?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13007389#comment-13007389
 ] 

Timo Boehme commented on PDFBOX-980:
------------------------------------

You have a corrupt PDF file. It contains HTML code after the final %%EOF 
marker. You can parse the file if you specify 'force' parsing (PDDocument.load( 
InputStream, new RandomAccessBuffer(), true )).


> Expected an integer type, actual='Active'
> -----------------------------------------
>
>                 Key: PDFBOX-980
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-980
>             Project: PDFBox
>          Issue Type: Bug
>    Affects Versions: 1.5.0
>            Reporter: Franck Valentin
>         Attachments: prot_dna_long.pdf
>
>
> Hi,
> When I try to parse the file I've attached (prot_dna_long.pdf)
> I get the following exception:
> Caused by: java.io.IOException: Error: Expected an integer type, 
> actual='Active'
>         at 
> org.apache.pdfbox.pdfparser.BaseParser.readInt(BaseParser.java:1384) 
> ~[ebinocle-indexer-1.0-SNAPSHOT-jar-with-dependencies.jar:na]
>         at 
> org.apache.pdfbox.pdfparser.PDFParser.parseObject(PDFParser.java:507) 
> ~[ebinocle-indexer-1.0-SNAPSHOT-jar-with-dependencies.jar:na]
>         at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:180) 
> ~[ebinocle-indexer-1.0-SNAPSHOT-jar-with-dependencies.jar:na]
>         at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:862) 
> ~[ebinocle-indexer-1.0-SNAPSHOT-jar-with-dependencies.jar:na]
>         at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:829) 
> ~[ebinocle-indexer-1.0-SNAPSHOT-jar-with-dependencies.jar:na]

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to