[ 
https://issues.apache.org/jira/browse/PDFBOX-3066?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14985009#comment-14985009
 ] 

Maruan Sahyoun commented on PDFBOX-3066:
----------------------------------------

AFAIK Save As Text follows the definitions in the spec where Cut & Paste 
doesn't. It's more evidence based as there were already other issues where the 
text extraction using Save As Text was more inline with our results than Cut & 
Paste (I'll look up these later). In addition I try to get an opinion from 
other sources on that topic.

> Text getting garbled in this file, was Ok in 1.8
> ------------------------------------------------
>
>                 Key: PDFBOX-3066
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-3066
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Text extraction
>    Affects Versions: 2.0.0
>            Reporter: Joel Hirsh
>             Fix For: 2.1.0
>
>         Attachments: PDFBOX-3066-reduced.pdf, garbled.pdf
>
>
> Attached file, PrintTextLocations shows text garbled, like *,%-))’)) 
> Acrobat copy/paste shows accurate text, and was also fine in 1.8.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to