[
https://issues.apache.org/jira/browse/PDFBOX-3195?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tilman Hausherr updated PDFBOX-3195:
------------------------------------
Attachment: PDFBOX-3195-reduced.txt
PDFBOX-3195-reduced-marked-1.png
PDFBOX-3195-reduced.pdf
Here's a reduced file that concentrates on the problem. Btw the "space" was not
a space but hex A0, a non-breaking space.
> ExtractText add space at start of text
> --------------------------------------
>
> Key: PDFBOX-3195
> URL: https://issues.apache.org/jira/browse/PDFBOX-3195
> Project: PDFBox
> Issue Type: Bug
> Components: Text extraction
> Affects Versions: 2.0.0
> Reporter: simon steiner
> Attachments: Insurance_Form.pdf, PDFBOX-3195-reduced-marked-1.png,
> PDFBOX-3195-reduced.pdf, PDFBOX-3195-reduced.txt
>
>
> java -jar ~/pdf-box-svn/app/target/pdfbox-app-2.0.0-SNAPSHOT.jar ExtractText
> Insurance_Form.pdf
> Output has extra space " Section 1 – Owner Details:"
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]