[tesseract-ocr] Re: User-words with Tesseract 5

2020-03-31 Thread Gabriel de Oliveira
I'm not sure if user-words and/or whitelist characters are supported by LSTMs engines (versions>= 4.00) Last news I had about this it was only suported on legacy engines (v3.x) with the --oem 0 option. Maybe someone can prove correct me if I'm wrong? On Monday, March 23, 2020 at 11:38:46 AM

Re: [tesseract-ocr] Re: user-words

2017-05-31 Thread ShreeDevi Kumar
Samuel, Do the user-words work as expected after making this change? Which version of tesseract are you using? ShreeDevi भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com On Wed, May 31, 2017 at 2:35 AM, Samuel backus

[tesseract-ocr] Re: user-words

2017-05-31 Thread Samuel backus
I had to recompile tesseract after updating dict.h and dict.cpp for this change to take effect. On Monday, October 3, 2011 at 3:20:05 AM UTC-4, Slavko Kocjancic wrote: > > Dne 2.10.2011 1:36, pi�e B.J.: > > I ran into this problem recently. Here is the solution (I'm using > > Tesseract

[tesseract-ocr] Re: user-words / bazaar

2015-09-30 Thread Tom Morris
Perhaps this is just a misunderstanding or bad documentation. The --print-parameters dump shows the input parameters, and the user_words_file / user_patterns_file parameters, if they're not set on the command line, will always be empty. The actual file name that gets loaded gets computed on

[tesseract-ocr] Re: user-words / bazaar

2015-09-30 Thread Stef
Thanks for this clarification. Stef Am Mittwoch, 30. September 2015 20:30:51 UTC+2 schrieb Tom Morris: > > Perhaps this is just a misunderstanding or bad documentation. The > --print-parameters dump shows the input parameters, and the user_words_file > / user_patterns_file parameters, if

[tesseract-ocr] Re: user-words / bazaar

2015-09-28 Thread Stef
Tom, I wasn't aware of the new possiblity to specify user words on the command line. Instead I used the config file method with the following command lines and outputs: tesseract.exe --version tesseract 3.05.00dev leptonica-1.72 libgif 4.1.6(?) : libjpeg 8d (libjpeg-turbo 1.3.1) : libpng

[tesseract-ocr] Re: user-words / bazaar

2015-09-24 Thread Meh Hem
Hi Stef, They have indeed no effect as far as I have found. The idea is great, but unfortunately it just does not seem to work. I have found no working demonstrations of it after looking for quite an amount of time. We have instead found a strong ambiguous character set combined with

[tesseract-ocr] Re: user-words / bazaar

2015-09-24 Thread Tom Morris
On Monday, September 21, 2015 at 9:29:39 AM UTC-4, Stef wrote: > > I'm trying to use user wordlists with the bazaar config but it seems to > have no effect on the OCR result in my case. Therefore I printed the > current parameters to verify whether the user-words list is used. This > confirmed