[ https://issues.apache.org/jira/browse/PDFBOX-1038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13049239#comment-13049239 ]
Funfel edited comment on PDFBOX-1038 at 6/14/11 3:35 PM: --------------------------------------------------------- I've attached the original pdf (one page only) and generated html was (Author: funfel): I've atached the originale pdf (one page only) and generated html > Strange signs after pdftohtml parsing. > -------------------------------------- > > Key: PDFBOX-1038 > URL: https://issues.apache.org/jira/browse/PDFBOX-1038 > Project: PDFBox > Issue Type: Bug > Components: Text extraction > Affects Versions: 1.5.0 > Environment: windows vista > Reporter: Funfel > Attachments: pg0007.html, pg0007.pdf > > > After parsing pdf to html I've got a strange signs which supposed to be nice > letter (not chinese or japanese). I've noticed that font description for them > is UniversPro-Roman-Identity-H. > How can get it generated properly? -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira