Tom, 

I wasn't aware of the new possiblity to specify user words on the command 
line. Instead I used the config file method with the following command 
lines and outputs:

tesseract.exe --version 
tesseract 3.05.00dev
 leptonica-1.72
  libgif 4.1.6(?) : libjpeg 8d (libjpeg-turbo 1.3.1) : libpng 1.6.17 : 
libtiff 4.0.3 : zlib 1.2.8 : libwebp 0.4.3

tesseract.exe test.jpg stdout -l deu --print-parameters bazaar   | grep 
load_system\|load_freq\|user_
load_system_dawg    0    Load system word dawg.
load_freq_dawg    0    Load frequent word dawg.
user_words_file        A filename of user-provided words.
user_words_suffix    user-words    A suffix of user-provided words located 
in tessdata.
user_patterns_file        A filename of user-provided patterns.
user_patterns_suffix    user-patterns    A suffix of user-provided patterns 
located in tessdata.


tesseract.exe test.jpg stdout -l deu --print-parameters   | grep 
load_system\|load_freq\|user_
load_system_dawg    1    Load system word dawg.
load_freq_dawg    1    Load frequent word dawg.
user_words_file        A filename of user-provided words.
user_words_suffix        A suffix of user-provided words located in 
tessdata.
user_patterns_file        A filename of user-provided patterns.
user_patterns_suffix        A suffix of user-provided patterns located in 
tessdata. 

My bazaar config file:

load_system_dawg     F
load_freq_dawg       F
user_words_suffix    user-words
user_patterns_suffix user-patterns

For the time being, I solved my problem by increasing the scan resolution 
from 300 dpi to 600 dpi which ensures everything to be recognized correctly 
with the default (system) settings.

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at http://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/b75c106e-b166-4c20-877f-50394e5d76c6%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to