[ 
https://issues.apache.org/jira/browse/PDFBOX-2076?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hong-Thai Nguyen updated PDFBOX-2076:
-------------------------------------

    Attachment: page0010.html

> Arabic not well converted on PDF document
> -----------------------------------------
>
>                 Key: PDFBOX-2076
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-2076
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Text extraction
>    Affects Versions: 1.8.5
>            Reporter: Hong-Thai Nguyen
>         Attachments: page0010.html, page0010.pdf
>
>
> I'm using PDFBox 1.8.5 to convert this pdf file to text. Seem that arabic 
> content is not well converted.
> Here's option:
> {code}
> -html
> E:\page0010.pdf.pdf
> E:\page0010.html
> {code}



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to