[
https://issues.apache.org/jira/browse/PDFBOX-2195?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14055519#comment-14055519
]
Maruan Sahyoun commented on PDFBOX-2195:
----------------------------------------
it might be better to import the PDF page as an PDFormXObject and scale that
down leaving space for the OMR marks. Add these and convert the resulting PDF
to TIFF. Take a look at
org.apache.pdfbox.util.LayerUtility.java#importPageAsForm for a sample.
> Missing text when converting PDF to image
> -----------------------------------------
>
> Key: PDFBOX-2195
> URL: https://issues.apache.org/jira/browse/PDFBOX-2195
> Project: PDFBox
> Issue Type: Bug
> Components: Rendering
> Affects Versions: 2.0.0
> Environment: Win8.1 (JRE 1.7)
> Reporter: A.D. Kent
> Attachments: Claim AA011332 Diagram and Estimates.pdf, Claim AA011332
> Diagram and Estimates.tif, Claim AA011332 Diagram and Estimates_p10.pdf,
> Claim AA011332 Diagram and Estimates_p10_jai.tif
>
>
> Attempting to convert a PDF to image using latest 2.0.0 from SVN. PDF
> utilizes Tahoma, Tahoma,Bold, and Tahoma,Italic (non-embedded). Upon calling
> PDFRenderer.renderImageWithDPI, I get the following output:
> Jul 08, 2014 9:50:01 AM org.apache.fontbox.util.SystemFontManager
> findTTFontname
> WARNING: Font not found: Tahoma,Italic
> Resultant image is missing text where Tahoma,Italic is used. Have also
> reverted to 1.8.6 and used PDPage.convertToImage with same results.
--
This message was sent by Atlassian JIRA
(v6.2#6252)