[
https://issues.apache.org/jira/browse/PDFBOX-3066?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14985009#comment-14985009
]
Maruan Sahyoun commented on PDFBOX-3066:
----------------------------------------
AFAIK Save As Text follows the definitions in the spec where Cut & Paste
doesn't. It's more evidence based as there were already other issues where the
text extraction using Save As Text was more inline with our results than Cut &
Paste (I'll look up these later). In addition I try to get an opinion from
other sources on that topic.
> Text getting garbled in this file, was Ok in 1.8
> ------------------------------------------------
>
> Key: PDFBOX-3066
> URL: https://issues.apache.org/jira/browse/PDFBOX-3066
> Project: PDFBox
> Issue Type: Bug
> Components: Text extraction
> Affects Versions: 2.0.0
> Reporter: Joel Hirsh
> Fix For: 2.1.0
>
> Attachments: PDFBOX-3066-reduced.pdf, garbled.pdf
>
>
> Attached file, PrintTextLocations shows text garbled, like *,%-))’))
> Acrobat copy/paste shows accurate text, and was also fine in 1.8.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]