In some right-to-left languages, compound words are separated using 
"semi-space" (please take a look at Unicode spaces). When the input document 
contains these words, PDFTextStripper neglects semi-space character and 
concatenates words together. 

Would you please give me some hint to extend which function of PDFTextStripper 
to manage semi-space characters?
Kind regards,
Amir

Reply via email to