[
https://issues.apache.org/jira/browse/PDFBOX-3067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14980805#comment-14980805
]
Tilman Hausherr commented on PDFBOX-3067:
-----------------------------------------
Here's what I get with the current version:
{code}
String[363.94,491.34 fs=-12.0 xscale=7.44 height=6.3265433 space=4.464
width=4.4639893]-
String[368.404,491.34 fs=-12.0 xscale=7.44 height=6.3265433 space=4.464
width=4.4639893]2
String[372.86798,491.34 fs=-12.0 xscale=7.44 height=6.3265433 space=4.464
width=4.4639893]0
String[377.33197,491.34 fs=-12.0 xscale=7.44 height=6.3265433 space=4.464
width=4.4639893]0
String[381.79596,491.34 fs=-12.0 xscale=7.44 height=6.3265433 space=4.464
width=4.4639893],
String[386.25995,491.34 fs=-12.0 xscale=7.44 height=6.3265433 space=4.464
width=4.4639893]0
String[390.72394,491.34 fs=-12.0 xscale=7.44 height=6.3265433 space=4.464
width=4.4639893]0
String[395.18793,491.34 fs=-12.0 xscale=7.44 height=6.3265433 space=4.464
width=4.4639893]0
String[399.65192,491.34 fs=-12.0 xscale=7.44 height=6.3265433 space=4.464
width=4.4639893].
String[404.1159,491.34 fs=-12.0 xscale=7.44 height=6.3265433 space=4.464
width=4.4639893]0
String[408.5799,491.34 fs=-12.0 xscale=7.44 height=6.3265433 space=4.464
width=4.4639893]0
String[363.94,522.33997 fs=-12.0 xscale=7.44 height=6.3265433 space=4.464
width=4.4639893]-
String[368.404,522.33997 fs=-12.0 xscale=7.44 height=6.3265433 space=4.464
width=4.4639893]2
String[372.86798,522.33997 fs=-12.0 xscale=7.44 height=6.3265433 space=4.464
width=4.4639893]0
String[377.33197,522.33997 fs=-12.0 xscale=7.44 height=6.3265433 space=4.464
width=4.4639893]0
String[381.79596,522.33997 fs=-12.0 xscale=7.44 height=6.3265433 space=4.464
width=4.4639893],
String[386.25995,522.33997 fs=-12.0 xscale=7.44 height=6.3265433 space=4.464
width=4.4639893]0
String[390.72394,522.33997 fs=-12.0 xscale=7.44 height=6.3265433 space=4.464
width=4.4639893]0
String[395.18793,522.33997 fs=-12.0 xscale=7.44 height=6.3265433 space=4.464
width=4.4639893]0
String[399.65192,522.33997 fs=-12.0 xscale=7.44 height=6.3265433 space=4.464
width=4.4639893].
String[404.1159,522.33997 fs=-12.0 xscale=7.44 height=6.3265433 space=4.464
width=4.4639893]0
String[408.5799,522.33997 fs=-12.0 xscale=7.44 height=6.3265433 space=4.464
width=4.4639893]0
{code}
And I also downloaded from the link you meant, and did get "-200,000.00" with
ExtractText. So unless I'm terribly mistaken somewhere, I suspect you've mixed
up your libraries...
> Text strings being returned as single characters, regression from version 1.8
> -----------------------------------------------------------------------------
>
> Key: PDFBOX-3067
> URL: https://issues.apache.org/jira/browse/PDFBOX-3067
> Project: PDFBox
> Issue Type: Bug
> Components: Text extraction
> Affects Versions: 2.0.0
> Reporter: Joel Hirsh
> Assignee: Tilman Hausherr
> Labels: regression
> Fix For: 2.0.0
>
> Attachments: singlecharacters.pdf
>
>
> PrintTextLocations writestring() is returning individual characters on this
> file, rather than a complete string. Was returning strings with '-200,000'
> in version 1.8
> Also note that textposition.getWidthOfSpace() is getting a negative value
> (-4.464) for each character. Don't know if that is symptom or a cause.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]