[
https://issues.apache.org/jira/browse/PDFBOX-81?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12990032#comment-12990032
]
Tilman Hausherr commented on PDFBOX-81:
---------------------------------------
I have a similar problem with 1.4.0, some PDF files are JBIG2 encoded and all I
get when extracting the images are white pages, not even a log error or an
exception. Including the levigo-jbig2-imageio-1.1.jar in the lib list doesn't
help. Same when using the jbig2.jar from the JPedal site. Same when copying it
in the lib/ext directory of the JRE.
About the "how should we proceed" question - why not simple document it on the
dependency page?
http://pdfbox.apache.org/dependencies.html
> Excetion while extracting images
> --------------------------------
>
> Key: PDFBOX-81
> URL: https://issues.apache.org/jira/browse/PDFBOX-81
> Project: PDFBox
> Issue Type: New Feature
> Components: PDFReader
> Affects Versions: 0.8.0-incubator
> Priority: Minor
> Attachments: COSName.java.diff, FilterManager.java.diff,
> JBIG2Filter.java, jbig2_src.zip, pdfbox-81.PDXObjectImage.patch
>
>
> [imported from SourceForge]
> http://sourceforge.net/tracker/index.php?group_id=78314&atid=552832&aid=1259747
> Originally submitted by guzzil on 2005-08-15 02:40.
> when trying to extract images from I pdf, i get exceptions
> like
> Exception in thread "main" java.io.IOException: Unknown
> stream filter:COSName{JBIG2Decode}
> at
> org.pdfbox.filter.FilterManager.getFilter(FilterManager.java:116)
> at
> org.pdfbox.cos.COSStream.doDecode(COSStream.java:276)
> at
> org.pdfbox.cos.COSStream.doDecode(COSStream.java:240)
> at
> org.pdfbox.cos.COSStream.getUnfilteredStream(COSStream.java:173)
> at
> org.pdfbox.pdmodel.common.PDStream.createInputStream(PDStream.java:205)
> at
> org.pdfbox.pdmodel.common.PDStream.getByteArray(PDStream.java:458)
> at
> org.pdfbox.pdmodel.graphics.xobject.PDPixelMap.getRGBImage(PDPixelMap.java:131)
>
> at
> org.pdfbox.pdmodel.graphics.xobject.PDPixelMap.write2OutputStream(PDPixelMap.java:153)
>
> at
> org.pdfbox.pdmodel.graphics.xobject.PDXObjectImage.write2file(PDXObjectImage.java:117)
>
> at
> org.pdfbox.ExtractImages.extractImages(ExtractImages.java:169)
> at
> org.pdfbox.ExtractImages.main(ExtractImages.java:73)
>
> The pdfs are scanned images, which are afterwards
> optimized with Adobe Acrobats "optimize" function.
>
> pdfimages from xpdf can extract the images.
>
> I can send you a pdf with this error (it is to big for an
> upload).
> [comment on SourceForge]
> Originally sent by benlitchfield.
> Logged In: YES
> user_id=601708
> yes please upload the pdf to ftp.pdfbox.org and I will take a
> look at it.
> Ben Litchfield
--
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira