I need to be able to adjust the size of the gap which Tesseract uses to 
determine the separation between words.

Here is my setup:

-Tesseract Windows Version 5.0.0 from UB-Mannheim
-image cleaning and resizing using openCV (have put much effort into 
getting this as good as I can)
-parameters --psm 6 --oem 1 (LSTM engine)

In my case, I need to make that gap a bit smaller than Tesseract seems to 
use.  An example is the following:
[image: OneOfThree.png]
For this image, Tesseract returns "1of3", essentially treating this as one 
word with no spaces.

It seems like that configuration file parameter set starting with "tosp_" 
would be the ones to adjust this inter-word spacing, but I have 
experimented extensively with this parameter set with no results.  Perhaps 
these parameters are only relevant to the legacy engine.

So is there anyone who can tell me how to configure the size of the gap 
between words when using the LSTM engine?

Thanks,

Dave

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/c98a368a-c4b3-4c5a-b5b9-ee1996dfdc01%40googlegroups.com.

Reply via email to