On Wednesday, August 22, 2012 5:58:55 PM UTC+3, Nick White wrote: > > On Wed, Aug 22, 2012 at 05:50:06PM +0300, Jani Monoses wrote: > > So there's no way of just adding new words to the existing dictionary > > without redoing the whole training? > > There is a way, yes. Create a ron.user-words file in your tessdata > directory, and a config file stating: > > user_words_suffix user-words > > (I think the config file is needed, but I'm not sure.) The > ron.user-words file should have a list of words, one per line, UTF8 > encoded. > If I only do this I get: Re-initializing document dictionary... Error: word 'aerobuz/P' not in DAWG after adding it Error: failed to load /usr/share/tesseract-ocr/tessdata/ron.user-words
So I need to do the words2dawg and recombination commands sequence as you suggested in your initial reply? I probably need to read the documentation in more detail than anticipated :) thanks Jani -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

