[jira] Commented: (PDFBOX-81) Excetion while extracting images

Tilman Hausherr (JIRA) Thu, 03 Feb 2011 01:52:56 -0800

    [ 
https://issues.apache.org/jira/browse/PDFBOX-81?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12990032#comment-12990032
 ]


Tilman Hausherr commented on PDFBOX-81:
---------------------------------------

I have a similar problem with 1.4.0, some PDF files are JBIG2 encoded and all I 
get when extracting the images are white pages, not even a log error or an 
exception. Including the levigo-jbig2-imageio-1.1.jar in the lib list doesn't 
help. Same when using the jbig2.jar from the JPedal site. Same when copying it 
in the lib/ext directory of the JRE.

About the "how should we proceed" question - why not simple document it on the 
dependency page?
http://pdfbox.apache.org/dependencies.html

> Excetion while extracting images
> --------------------------------
>
>                 Key: PDFBOX-81
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-81
>             Project: PDFBox
>          Issue Type: New Feature
>          Components: PDFReader
>    Affects Versions: 0.8.0-incubator
>            Priority: Minor
>         Attachments: COSName.java.diff, FilterManager.java.diff, 
> JBIG2Filter.java, jbig2_src.zip, pdfbox-81.PDXObjectImage.patch
>
>
> [imported from SourceForge]
> http://sourceforge.net/tracker/index.php?group_id=78314&atid=552832&aid=1259747
> Originally submitted by guzzil on 2005-08-15 02:40.
> when trying to extract images from I pdf, i get exceptions  
> like  
> Exception in thread "main" java.io.IOException: Unknown  
> stream filter:COSName{JBIG2Decode}  
>         at  
> org.pdfbox.filter.FilterManager.getFilter(FilterManager.java:116)  
>         at  
> org.pdfbox.cos.COSStream.doDecode(COSStream.java:276)  
>         at  
> org.pdfbox.cos.COSStream.doDecode(COSStream.java:240)  
>         at  
> org.pdfbox.cos.COSStream.getUnfilteredStream(COSStream.java:173)  
>         at  
> org.pdfbox.pdmodel.common.PDStream.createInputStream(PDStream.java:205)  
>         at  
> org.pdfbox.pdmodel.common.PDStream.getByteArray(PDStream.java:458)  
>         at  
> org.pdfbox.pdmodel.graphics.xobject.PDPixelMap.getRGBImage(PDPixelMap.java:131)
>   
>         at  
> org.pdfbox.pdmodel.graphics.xobject.PDPixelMap.write2OutputStream(PDPixelMap.java:153)
>   
>         at  
> org.pdfbox.pdmodel.graphics.xobject.PDXObjectImage.write2file(PDXObjectImage.java:117)
>   
>         at  
> org.pdfbox.ExtractImages.extractImages(ExtractImages.java:169)  
>         at  
> org.pdfbox.ExtractImages.main(ExtractImages.java:73)  
>   
> The pdfs are scanned images, which are afterwards  
> optimized with Adobe Acrobats "optimize" function.  
>   
> pdfimages from xpdf can extract the images.  
>  
> I can send you a pdf with this error (it is to big for an 
> upload). 
> [comment on SourceForge]
> Originally sent by benlitchfield.
> Logged In: YES 
> user_id=601708
> yes please upload the pdf to ftp.pdfbox.org and I will take a 
> look at it.
> Ben Litchfield

-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] Commented: (PDFBOX-81) Excetion while extracting images

Reply via email to