[
https://issues.apache.org/jira/browse/PDFBOX-3067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14980730#comment-14980730
]
Tilman Hausherr commented on PDFBOX-3067:
-----------------------------------------
I got this with ExtractText for the file singlecharacters.pdf:
{code}
-200,000.00
-200,000.00
{code}
Is that what you meant?
> Text strings being returned as single characters, regression from version 1.8
> -----------------------------------------------------------------------------
>
> Key: PDFBOX-3067
> URL: https://issues.apache.org/jira/browse/PDFBOX-3067
> Project: PDFBox
> Issue Type: Bug
> Components: Text extraction
> Affects Versions: 2.0.0
> Reporter: Joel Hirsh
> Assignee: Tilman Hausherr
> Labels: regression
> Fix For: 2.0.0
>
> Attachments: singlecharacters.pdf
>
>
> PrintTextLocations writestring() is returning individual characters on this
> file, rather than a complete string. Was returning strings with '-200,000'
> in version 1.8
> Also note that textposition.getWidthOfSpace() is getting a negative value
> (-4.464) for each character. Don't know if that is symptom or a cause.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]