[jira] Commented: (PDFBOX-457) PDF to Image doesn't show correctly the document

JIRA Tue, 06 Apr 2010 00:20:00 -0700

    [ 
https://issues.apache.org/jira/browse/PDFBOX-457?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12853781#action_12853781
 ]


Andreas Lehmkühler commented on PDFBOX-457:
-------------------------------------------

The given example 580505.PR00003.000003.PDF uses a CCITTFaxDecode filter as 
compression algo (it is common for pdfs created by a FAX) . PDFBox doesn't have 
a builtin support for that filter and that's the reason why getRGBImage returns 
null. To read those kind of files, the ImageIO-lib [1] has to be added to the 
classpath .


[1] https://jai-imageio.dev.java.net/

> PDF to Image doesn't show correctly the document
> ------------------------------------------------
>
>                 Key: PDFBOX-457
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-457
>             Project: PDFBox
>          Issue Type: Bug
>    Affects Versions: 0.8.0-incubator
>            Reporter: Marcelo Tavares
>            Assignee: Daniel Wilson
>         Attachments: 580505.PR00003.000003.PDF, pdfbox-457-as_fax.pdf, 
> pdfbox-457-Scan_from_a_Xerox_WorkCentre_Pro.PDF, pdfbox-457.PNG, 
> testPDFToImage1.png
>
>
> I tried to convert the following document to image, but I got the attached 
> result. 
> It parsed just the text. I also tried different formats like JPG.  I ran it 
> using the PDFToImage class passing the document path as parameter. 
> I've read that sometimes the document is not created respecting the PDF 
> standard. But, is there a possibility to ignore it?! In fact, it's very 
> important to me, so, could I use PDF Box despite of those "errors"? 
> Thank you
> Marcelo

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (PDFBOX-457) PDF to Image doesn't show correctly the document

Reply via email to