[
https://issues.apache.org/jira/browse/PDFBOX-958?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Andreas Lehmkühler closed PDFBOX-958.
-------------------------------------
Resolution: Fixed
Reopened to replace a missing attachment
> convertToImage mangles images which were in the PDF
> ---------------------------------------------------
>
> Key: PDFBOX-958
> URL: https://issues.apache.org/jira/browse/PDFBOX-958
> Project: PDFBox
> Issue Type: Bug
> Affects Versions: 1.2.1, 1.4.0, 1.5.0
> Environment: RHEL5 and WinXP, java version "1.6.0_23"
> Reporter: Eric Schwarzenbach
> Assignee: Andreas Lehmkühler
> Priority: Critical
> Fix For: 1.6.0
>
> Attachments: Image of Page 13.jpeg, Image of Page 13.png,
> PDFBOX958-WrycanLoremIpsumTest.pdf
>
>
> Of the PDFs we've tried running through PDFBox and generating page images, a
> number of them (coming from disparate sources and method of creation) seem to
> produce images where an image that was embedded in the page of the PDF shows
> somewhat mangled. It seems to be divided by horizontal stripes, where some
> stripes look normal, others seem to have some kind of "smearing" effect going
> on. See attached images and original PDF (image is of page 13).
> I marked this as critical as we are trying to use PDFBox in a project where
> page images are crucial, and inability to produce reasonable looking page
> images is pretty much a deal breaker.
> The code we use to extract the images looks more or less like the following:
> BufferedImage image =
> page.convertToImage();
>
> SmartDeferredFileOutputStream outStream
> = new SmartDeferredFileOutputStream();
> String[] writerFormatNames =
> ImageIO.getWriterFormatNames();
> ImageIO.write(image, "jpeg", outStream);
> outStream.close()
> We've also tried specifying "png". In both "jpg" and "png" cases we get an
> image file that is indeed the correct format, and both images look exactly
> the same.
--
This message was sent by Atlassian JIRA
(v6.2#6252)