Some PDFs cannot be processed yet are valid PDFs
------------------------------------------------
Key: PDFBOX-562
URL: https://issues.apache.org/jira/browse/PDFBOX-562
Project: PDFBox
Issue Type: Bug
Components: Parsing
Affects Versions: 0.8.0-incubator, 0.7.3, 0.7.2, 0.7.1, 0.7.0, 1.0.0
Environment: Windows 7, JDK 1.6, Eclipse
Reporter: Matt Hicks
Priority: Critical
The following exception is thrown when attempting to process certain PDFs.
This occurs in the latest code from SVN and causes PDFBox to be useful in any
production processing system since not all PDFs are supported. I have uploaded
an example PDF that reproduces the problem here:
http://captiveimagination.com/download/test02.pdf
This occurs upon invocation of PDFDocument.load(...):
Exception in thread "main" org.apache.pdfbox.exceptions.WrappedIOException
at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:237)
at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:841)
at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:808)
at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:733)
at test.Test.main(Test.java:21)
Caused by: java.lang.ClassCastException: org.apache.pdfbox.cos.COSDictionary
cannot be cast to org.apache.pdfbox.cos.COSStream
at
org.apache.pdfbox.cos.COSDocument.dereferenceObjectStreams(COSDocument.java:456)
at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:207)
... 4 more
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.