[
https://issues.apache.org/jira/browse/PDFBOX-664?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
John Hewson updated PDFBOX-664:
-------------------------------
Component/s: (was: FontBox)
Text extraction
> Incorrect rendering
> -------------------
>
> Key: PDFBOX-664
> URL: https://issues.apache.org/jira/browse/PDFBOX-664
> Project: PDFBox
> Issue Type: Bug
> Components: Text extraction
> Affects Versions: 1.1.0
> Reporter: Villu Ruusmann
> Attachments: frontpage.png
>
>
> Peter Zavadsky reported to PDFBox users' mailing list about unsatisfiable
> results when trying to perform text extraction from the following Slovak
> language PDF document:
> http://www.justice.gov.sk/kop/ovest/ov10/03/050/OV050A.pdf
> While I'm not expert enough to say anything about text extraction, I clearly
> see numerous rendering problems. Please take a look at the image attachment
> frontpage.png
> Quite obviously, Slovak language makes use of custom character encoding
> schemes.
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)