[jira] [Commented] (PDFBOX-2195) Missing text when converting PDF to image

Maruan Sahyoun (JIRA) Tue, 08 Jul 2014 14:00:37 -0700

    [ 
https://issues.apache.org/jira/browse/PDFBOX-2195?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14055519#comment-14055519
 ]


Maruan Sahyoun commented on PDFBOX-2195:
----------------------------------------

it might be better to import the PDF page as an PDFormXObject and scale that 
down leaving space for the OMR marks. Add these and convert the resulting PDF 
to TIFF. Take a look at 
org.apache.pdfbox.util.LayerUtility.java#importPageAsForm for a sample.

> Missing text when converting PDF to image
> -----------------------------------------
>
>                 Key: PDFBOX-2195
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-2195
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Rendering
>    Affects Versions: 2.0.0
>         Environment: Win8.1 (JRE 1.7)
>            Reporter: A.D. Kent
>         Attachments: Claim AA011332 Diagram and Estimates.pdf, Claim AA011332 
> Diagram and Estimates.tif, Claim AA011332 Diagram and Estimates_p10.pdf, 
> Claim AA011332 Diagram and Estimates_p10_jai.tif
>
>
> Attempting to convert a PDF to image using latest 2.0.0 from SVN.  PDF 
> utilizes Tahoma, Tahoma,Bold, and Tahoma,Italic (non-embedded).  Upon calling 
> PDFRenderer.renderImageWithDPI, I get the following output:
> Jul 08, 2014 9:50:01 AM org.apache.fontbox.util.SystemFontManager 
> findTTFontname
> WARNING: Font not found: Tahoma,Italic
> Resultant image is missing text where Tahoma,Italic is used.  Have also 
> reverted to 1.8.6 and used PDPage.convertToImage with same results.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (PDFBOX-2195) Missing text when converting PDF to image

Reply via email to