ANIL SANGHANI created PDFBOX-4480:
-------------------------------------

             Summary: Problem extracting text in newline characters and spaces 
beetween words
                 Key: PDFBOX-4480
                 URL: https://issues.apache.org/jira/browse/PDFBOX-4480
             Project: PDFBox
          Issue Type: Bug
          Components: Text extraction
    Affects Versions: 2.0.13
         Environment: macOs
            Reporter: ANIL SANGHANI
             Fix For: 2.0.13
         Attachments: Document.txt, Narasimhan S.pdf

 

I have a PDF file , when I try to extract its text using

It ignores some Enter characters between lines, so the last word in the line 
and the first word in the next line appear as 1 word without spaces between 
them !!

For Example, In Attached Pdf

main Bsk as mainBsk

[[email protected] Bangalore|mailto:[email protected]] 
as [email protected]



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to