[
https://issues.apache.org/jira/browse/PDFBOX-457?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13954398#comment-13954398
]
John Hewson commented on PDFBOX-457:
------------------------------------
Ok, great, I wasn't sure if you were saying that the PDF file was corrupt or if
PDFBox had a bug. As you say the /Length can be inferred from the stream data.
So it looks like there is still a bug in our CCITT decoder?
> PDF to Image doesn't show correctly the document
> ------------------------------------------------
>
> Key: PDFBOX-457
> URL: https://issues.apache.org/jira/browse/PDFBOX-457
> Project: PDFBox
> Issue Type: Bug
> Components: Rendering
> Affects Versions: 0.8.0-incubator
> Reporter: Marcelo Tavares
> Assignee: Daniel Wilson
> Labels: CCITTFaxDecode, TIFF, ccitt
> Attachments: 580505.PR00003.000003.PDF,
> pdfbox-457-Scan_from_a_Xerox_WorkCentre_Pro.PDF, pdfbox-457-as_fax.pdf,
> pdfbox-457.PNG, testPDFToImage1.png
>
>
> I tried to convert the following document to image, but I got the attached
> result.
> It parsed just the text. I also tried different formats like JPG. I ran it
> using the PDFToImage class passing the document path as parameter.
> I've read that sometimes the document is not created respecting the PDF
> standard. But, is there a possibility to ignore it?! In fact, it's very
> important to me, so, could I use PDF Box despite of those "errors"?
> Thank you
> Marcelo
--
This message was sent by Atlassian JIRA
(v6.2#6252)