Niyati wawre created PDFBOX-4232: ------------------------------------ Summary: Spaces getting added in between a word in scanned documents Key: PDFBOX-4232 URL: https://issues.apache.org/jira/browse/PDFBOX-4232 Project: PDFBox Issue Type: Bug Components: Text extraction Reporter: Niyati wawre
*As a consumer of* this API I am facing issue when i am trying to extra the PDf which are scanned. In the extracted output can see spaces between of and few places to ave 2 spaces added in between two words. *Following* *are the examples* *In the below* *example* *space is getting added between of* *In PDF:* {code:java} is made as of October 13,2015 between XYZ and ABC.{code} *After extraction:* {code:java} is made as o f October 13,2015 between XYZ and ABC.{code} *Even, in the below example two spaces are getting added in after* *In PDF:* WHEREAS, Navigation *After extraction:* W h e r e a s ,{color:#f6c342} {color}Navigation -- This message was sent by Atlassian JIRA (v7.6.3#76005) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org