[ 
https://issues.apache.org/jira/browse/PDFBOX-1793?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler closed PDFBOX-1793.
--------------------------------------

    Resolution: Duplicate
      Assignee: Andreas Lehmkühler

Closed as it duplicates PDFBOX-1783

> Failure to extract custom encoded text
> --------------------------------------
>
>                 Key: PDFBOX-1793
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-1793
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Text extraction
>    Affects Versions: 1.8.3
>            Reporter: Tim Allison
>            Assignee: Andreas Lehmkühler
>            Priority: Minor
>         Attachments: gaat fout.pdf, gaat fout.txt
>
>
> PDFBox extracts a binary garble from this file.  Adobe Reader does the same.  
> Linux's pdftotext extracts text fairly well.  I suspect there's a custom 
> font/encoding node that isn't being processed, but I could be wrong.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Reply via email to