[ https://issues.apache.org/jira/browse/PDFBOX-1216?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Andreas Lehmkühler updated PDFBOX-1216: --------------------------------------- Attachment: PDFBOX1216-a1.png > Arabic / Farsi (Persian) text appear disconnected when PDF is converted to > image > -------------------------------------------------------------------------------- > > Key: PDFBOX-1216 > URL: https://issues.apache.org/jira/browse/PDFBOX-1216 > Project: PDFBox > Issue Type: Bug > Affects Versions: 1.6.0 > Reporter: Hamed Iravanchi > Assignee: Andreas Lehmkühler > Attachments: PDFBOX1216-a1.png, a.pdf > > > When the PDF file contains Arabic / Farsi text, they appear disconnected when > converting pages to image. > Arabic / Farsi letters are connected to each other when written. > Additionally, the error message "Changing font on <?> from <B Lotus> to the > default font" appears on the console. > As I tried to debug the issue, it is because PDFBox is looking into the > embedded fonts for the "isolated" variation of the character, where the > embedded font only includes "connected" variation. > If the embedded font contains the isolated format too, the font is displayed > correctly (the warning message doesn't appear for that character), but the > character is displayed as the incorrect variation (i.e. isolated instead of > connected) > This happens in both 1.6.0 release and the latest trunk code (as of today). I > didn't test previous versions. > The difference is that in 1.6.0, the default font (that is substituted as > mentioned above) contains the Arabic / Persian characters, but in the trunk, > the replaced characters are displayed as squares. > I will attach a PDF as an input for reproducing the issue. > Note: this might be related to issue PDFBOX-1127, but that one regards text > extraction. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira