Hi Mayce, The DAWG is kind of a black box (unless you want to read the source code) and the dictionaries are just kind of word lists to give higher probability to known valid English/Spanish/etc. words. Basically people just try things out and ask questions on this list if things behave differently than expected. You can also do searches of the list archives to see how several questions were answered.
http://groups.google.com/group/tesseract-ocr ("search this group") --Sven On Thu, Aug 11, 2011 at 9:30 AM, Mayce Al <[email protected]> wrote: > Hey everyone, > I am new to Tesseract. I would like to ask, if someone has more details > about how to use the dictionaries in Tesseract. > Is it possible to visualize them to see how do the words, punctuations, > digits models are represented in Tesseract? > In the link http://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3, > it does not show in details about how to view the dictionaries dawg. > I appreciate any support, > Best Regards > Mayce > > -- > You received this message because you are subscribed to the Google > Groups "tesseract-ocr" group. > To post to this group, send email to [email protected] > To unsubscribe from this group, send email to > [email protected] > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en > -- ``All that is gold does not glitter, not all those who wander are lost; the old that is strong does not wither, deep roots are not reached by the frost. >From the ashes a fire shall be woken, a light from the shadows shall spring; renewed shall be blade that was broken, the crownless again shall be king.” -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

