In some right-to-left languages, compound words are separated using "semi-space" (please take a look at Unicode spaces). When the input document contains these words, PDFTextStripper neglects semi-space character and concatenates words together.
Would you please give me some hint to extend which function of PDFTextStripper to manage semi-space characters? Kind regards, Amir

