Hi, Does this also happen with the current version? (1.8.4)
Tilman Am 25.03.2014 13:53, schrieb Hesham G.:
Hello , While reading a pdf using PDFBox 1.7.1 many spaces are being ignored, so words are merged together while reading the pdf. You can test a 1-page sample PDF from here : http://www.4shared.com/office/yqJGUZn2ce/wrong_space_parsed_sample.html You can see wrong read words like : aboutmidnight, andbefore, CountyDonegal, ... I have tried to use PDFTextStripper.setAverageCharTolerance(...) to control space sensitivity but it didn’t make any change. Any idea why this happens and how to fix it ? Best regards , Hesham

