[jira] [Commented] (PDFBOX-457) PDF to Image doesn't show correctly the document

John Hewson (JIRA) Sat, 29 Mar 2014 10:59:27 -0700

    [ 
https://issues.apache.org/jira/browse/PDFBOX-457?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13954365#comment-13954365
 ]


John Hewson commented on PDFBOX-457:
------------------------------------

Tilman, I presume you're talking about "580505.PR00003.000003.PDF", I don't see 
any problems with the CCITT or Flate filters, the /Filters array gives them in 
the order which they should be decoded, which is correct, so the /Length 
matches the Flate filter, which is also correct.

{code}
I also traced through the filter and noticed that it had a wrong length. The 
stream is encoded twice, and the ccitt filter comes second, but gets the 
"/Length". This makes no sense and no other filter does this, so I deleted it 
in rev 1582785 for the trunk and rev 1582786 for the 1.8 branch.
{code}

Not sure about this.

> PDF to Image doesn't show correctly the document
> ------------------------------------------------
>
>                 Key: PDFBOX-457
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-457
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Rendering
>    Affects Versions: 0.8.0-incubator
>            Reporter: Marcelo Tavares
>            Assignee: Daniel Wilson
>              Labels: CCITTFaxDecode, TIFF, ccitt
>         Attachments: 580505.PR00003.000003.PDF, 
> pdfbox-457-Scan_from_a_Xerox_WorkCentre_Pro.PDF, pdfbox-457-as_fax.pdf, 
> pdfbox-457.PNG, testPDFToImage1.png
>
>
> I tried to convert the following document to image, but I got the attached 
> result. 
> It parsed just the text. I also tried different formats like JPG.  I ran it 
> using the PDFToImage class passing the document path as parameter. 
> I've read that sometimes the document is not created respecting the PDF 
> standard. But, is there a possibility to ignore it?! In fact, it's very 
> important to me, so, could I use PDF Box despite of those "errors"? 
> Thank you
> Marcelo



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (PDFBOX-457) PDF to Image doesn't show correctly the document

Reply via email to