Samuel, Do the user-words work as expected after making this change?
Which version of tesseract are you using? ShreeDevi ____________________________________________________________ भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com On Wed, May 31, 2017 at 2:35 AM, Samuel backus <[email protected]> wrote: > I had to recompile tesseract after updating dict.h and dict.cpp for this > change to take effect. > > On Monday, October 3, 2011 at 3:20:05 AM UTC-4, Slavko Kocjancic wrote: >> >> Dne 2.10.2011 1:36, pi�e B.J.: >> > I ran into this problem recently. Here is the solution (I'm using >> > Tesseract 3.01): >> > to use user-words list, in dict.h and dict.cpp, find user_words_suffix >> > and change the "" to "user-words" >> > //dict.h >> > STRING_VAR_H(user_words_suffix, "user-words", "A list of user-provided >> > words."); >> > >> > //dict.cpp >> > STRING_INIT_MEMBER(user_words_suffix, "user-words", >> > "A list of user-provided words.", >> > getImage()->getCCUtil()->params()), >> > >> > This assumes, then, that in your tessdata folder there is a file named >> > "eng.user-words" with your user made word list. >> > >> > .bj. >> > >> >> I have 3.01 from svn too. >> And that field's are empty. So I modified as you suggest. But I see no >> difference in OCR. The confidence is still low and missreaded word is >> still missreaded. >> And if I remove 'eng.user-words' then tess just abort execution with >> missing eng.user-words statments so I assume that file is oppened and >> used. >> >> So is there someone smart enought to explain how that >> ('lang.user-words') works. >> And other things.. Is there someone smart enought to change source on >> svn to have that included but just to check if user-words exist not to >> popup error? (as I know the lang.user-words is optional so keep is like >> that.) >> >> Thanks... >> >> -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To post to this group, send email to [email protected]. > Visit this group at https://groups.google.com/group/tesseract-ocr. > To view this discussion on the web visit https://groups.google.com/d/ > msgid/tesseract-ocr/18a7aac6-cc5d-4904-985e-4bb6ea1bccde% > 40googlegroups.com > <https://groups.google.com/d/msgid/tesseract-ocr/18a7aac6-cc5d-4904-985e-4bb6ea1bccde%40googlegroups.com?utm_medium=email&utm_source=footer> > . > For more options, visit https://groups.google.com/d/optout. > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduUptO_NGUA6%3DeAbHzX4q6GcVSedW%3Dac_MfrvnmYFUxH3A%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.

