How to manage semi-space characters in PDFTextStripper?

Amir H. Jadidinejad Tue, 05 Aug 2014 02:28:36 -0700

In some right-to-left languages, compound words are separated using 
"semi-space" (please take a look at Unicode spaces). When the input document 
contains these words, PDFTextStripper neglects semi-space character and 
concatenates words together.


Would you please give me some hint to extend which function of PDFTextStripper 
to manage semi-space characters?
Kind regards,
Amir

How to manage semi-space characters in PDFTextStripper?

Reply via email to