[ 
https://issues.apache.org/jira/browse/PDFBOX-664?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tilman Hausherr resolved PDFBOX-664.
------------------------------------

       Resolution: Fixed
    Fix Version/s: 2.0.0

The attached file is from a different URL of the same website:
http://www.justice.gov.sk/ovest/ov11/050/ov050a.pdf

It renders fine with the unreleased 2.0 version, and text extraction is fine 
too.

> Incorrect rendering of Slovak language PDF
> ------------------------------------------
>
>                 Key: PDFBOX-664
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-664
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Rendering, Text extraction
>    Affects Versions: 1.1.0
>            Reporter: Villu Ruusmann
>             Fix For: 2.0.0
>
>         Attachments: frontpage.png, ov050a.pdf-1.png
>
>
> Peter Zavadsky reported to PDFBox users' mailing list about unsatisfiable 
> results when trying to perform text extraction from the following Slovak 
> language PDF document:
> http://www.justice.gov.sk/kop/ovest/ov10/03/050/OV050A.pdf
> While I'm not expert enough to say anything about text extraction, I clearly 
> see numerous rendering problems. Please take a look at the image attachment 
> frontpage.png
> Quite obviously, Slovak language makes use of custom character encoding 
> schemes.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Reply via email to