[ 
https://issues.apache.org/jira/browse/PDFBOX-664?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

John Hewson updated PDFBOX-664:
-------------------------------

    Component/s:     (was: FontBox)
                 Text extraction

> Incorrect rendering
> -------------------
>
>                 Key: PDFBOX-664
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-664
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Text extraction
>    Affects Versions: 1.1.0
>            Reporter: Villu Ruusmann
>         Attachments: frontpage.png
>
>
> Peter Zavadsky reported to PDFBox users' mailing list about unsatisfiable 
> results when trying to perform text extraction from the following Slovak 
> language PDF document:
> http://www.justice.gov.sk/kop/ovest/ov10/03/050/OV050A.pdf
> While I'm not expert enough to say anything about text extraction, I clearly 
> see numerous rendering problems. Please take a look at the image attachment 
> frontpage.png
> Quite obviously, Slovak language makes use of custom character encoding 
> schemes.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Reply via email to