[ 
https://issues.apache.org/jira/browse/PDFBOX-3066?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14985014#comment-14985014
 ] 

Maruan Sahyoun commented on PDFBOX-3066:
----------------------------------------

Added note:

The PDF 2.0 spec has this

{quote}
If the font is a simple font and the glyph selection algorithm (see 9.6.5, 
"Character encoding") uses a glyph name, that name can be looked up in the 
Adobe Glyph List to obtain the corresponding Unicode value.
{quote}

Which would give the same result we have as well as Acrobats Save As

> Text getting garbled in this file, was Ok in 1.8
> ------------------------------------------------
>
>                 Key: PDFBOX-3066
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-3066
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Text extraction
>    Affects Versions: 2.0.0
>            Reporter: Joel Hirsh
>             Fix For: 2.1.0
>
>         Attachments: PDFBOX-3066-reduced.pdf, garbled.pdf
>
>
> Attached file, PrintTextLocations shows text garbled, like *,%-))’)) 
> Acrobat copy/paste shows accurate text, and was also fine in 1.8.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to