Thanks for this clarification.

Stef

Am Mittwoch, 30. September 2015 20:30:51 UTC+2 schrieb Tom Morris:
>
> Perhaps this is just a misunderstanding or bad documentation.  The 
> --print-parameters dump shows the input parameters, and the user_words_file 
> / user_patterns_file parameters, if they're not set on the command line, 
> will always be empty.
>
> The actual file name that gets loaded gets computed on the fly here:
>     
> https://github.com/tesseract-ocr/tesseract/blob/master/dict/dict.cpp#L274
> but the result isn't saved into the user_words_file parameter
>
> Tom
>
> On Monday, September 28, 2015 at 4:02:15 AM UTC-4, Stef wrote:
>>
>> Tom, 
>>
>> I wasn't aware of the new possiblity to specify user words on the command 
>> line. Instead I used the config file method with the following command 
>> lines and outputs:
>>
>> tesseract.exe --version 
>> tesseract 3.05.00dev
>>  leptonica-1.72
>>   libgif 4.1.6(?) : libjpeg 8d (libjpeg-turbo 1.3.1) : libpng 1.6.17 : 
>> libtiff 4.0.3 : zlib 1.2.8 : libwebp 0.4.3
>>
>> tesseract.exe test.jpg stdout -l deu --print-parameters bazaar   | grep 
>> load_system\|load_freq\|user_
>> load_system_dawg    0    Load system word dawg.
>> load_freq_dawg    0    Load frequent word dawg.
>> user_words_file        A filename of user-provided words.
>> user_words_suffix    user-words    A suffix of user-provided words 
>> located in tessdata.
>> user_patterns_file        A filename of user-provided patterns.
>> user_patterns_suffix    user-patterns    A suffix of user-provided 
>> patterns located in tessdata.
>>
>>
>> tesseract.exe test.jpg stdout -l deu --print-parameters   | grep 
>> load_system\|load_freq\|user_
>> load_system_dawg    1    Load system word dawg.
>> load_freq_dawg    1    Load frequent word dawg.
>> user_words_file        A filename of user-provided words.
>> user_words_suffix        A suffix of user-provided words located in 
>> tessdata.
>> user_patterns_file        A filename of user-provided patterns.
>> user_patterns_suffix        A suffix of user-provided patterns located in 
>> tessdata. 
>>
>> My bazaar config file:
>>
>> load_system_dawg     F
>> load_freq_dawg       F
>> user_words_suffix    user-words
>> user_patterns_suffix user-patterns
>>
>> For the time being, I solved my problem by increasing the scan resolution 
>> from 300 dpi to 600 dpi which ensures everything to be recognized correctly 
>> with the default (system) settings.
>>
>>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at http://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/43b2b1f2-8897-4457-aa4f-24633bc69c19%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to