Samuel,

Do the user-words work as expected after making this change?

Which version of tesseract are you using?

ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

On Wed, May 31, 2017 at 2:35 AM, Samuel backus <[email protected]>
wrote:

> I had to recompile tesseract after updating dict.h and dict.cpp for this
> change to take effect.
>
> On Monday, October 3, 2011 at 3:20:05 AM UTC-4, Slavko Kocjancic wrote:
>>
>> Dne 2.10.2011 1:36, pi�e B.J.:
>> > I ran into this problem recently.  Here is the solution (I'm using
>> > Tesseract 3.01):
>> > to use user-words list, in dict.h and dict.cpp, find user_words_suffix
>> > and change the "" to "user-words"
>> > //dict.h
>> > STRING_VAR_H(user_words_suffix, "user-words", "A list of user-provided
>> > words.");
>> >
>> > //dict.cpp
>> > STRING_INIT_MEMBER(user_words_suffix, "user-words",
>> >    "A list of user-provided words.",
>> >    getImage()->getCCUtil()->params()),
>> >
>> > This assumes, then, that in your tessdata folder there is a file named
>> > "eng.user-words" with your user made word list.
>> >
>> > .bj.
>> >
>>
>> I have 3.01 from svn too.
>> And that field's are empty. So I modified as you suggest. But I see no
>> difference in OCR. The confidence is still low and missreaded word is
>> still missreaded.
>> And if I remove 'eng.user-words' then tess just abort execution with
>> missing eng.user-words statments so I assume that file is oppened and
>> used.
>>
>> So is there someone smart enought to explain how that
>> ('lang.user-words') works.
>> And other things.. Is there someone smart enought to change source on
>> svn to have that included but just to check if user-words exist not to
>> popup error? (as I know the lang.user-words is optional so keep is like
>> that.)
>>
>> Thanks...
>>
>> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to [email protected].
> To post to this group, send email to [email protected].
> Visit this group at https://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit https://groups.google.com/d/
> msgid/tesseract-ocr/18a7aac6-cc5d-4904-985e-4bb6ea1bccde%
> 40googlegroups.com
> <https://groups.google.com/d/msgid/tesseract-ocr/18a7aac6-cc5d-4904-985e-4bb6ea1bccde%40googlegroups.com?utm_medium=email&utm_source=footer>
> .
> For more options, visit https://groups.google.com/d/optout.
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduUptO_NGUA6%3DeAbHzX4q6GcVSedW%3Dac_MfrvnmYFUxH3A%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to