[ 
https://issues.apache.org/jira/browse/PDFBOX-4480?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tilman Hausherr resolved PDFBOX-4480.
-------------------------------------
       Resolution: Fixed
         Assignee: Tilman Hausherr
    Fix Version/s: 3.0.0 PDFBox
                   2.0.15

> Problem extracting text in newline characters and spaces beetween words
> -----------------------------------------------------------------------
>
>                 Key: PDFBOX-4480
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-4480
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Text extraction
>    Affects Versions: 2.0.13
>         Environment: macOs
>            Reporter: ANIL SANGHANI
>            Assignee: Tilman Hausherr
>            Priority: Major
>              Labels: textextraction
>             Fix For: 2.0.15, 3.0.0 PDFBox
>
>         Attachments: Document.txt, Narasimhan S.pdf, 
> PDFBOX-4480-huge-CapHeight.pdf.txt
>
>
>  
> I have a PDF file , when I try to extract its text using
> It ignores some Enter characters between lines, so the last word in the line 
> and the first word in the next line appear as 1 word without spaces between 
> them !!
> For Example, In Attached Pdf
> main Bsk as mainBsk
> [narasimhan1...@gmail.com Bangalore|mailto:narasimhan1989@gmail.comBangalore] 
> as narasimhan1989@gmail.comBangalore



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org

Reply via email to