VietOCR has a training program that automates the steps, but I suspect that you should preprocess the images rather than retraining. Check the wiki for details of minimum letter height in pixels and you'll probably be able to use a free program like ImageMagick to produce images that will get good results. --Sven
On Sun, Oct 20, 2013 at 9:56 AM, Lehel Kovach <[email protected]> wrote: > I have screenshots from a game I wish to use tesseract for to read text > but the English set isn't reading the characters correctly. (the fonts come > out pixely in the screenshots). I believe I need to train tesseract for > the fonts in the screenshots. Correct? > > P.S. I was reading on the tesseract3 training wiki page that there is no > tool yet to automate all those steps. Has there been one developed by > anyone yet? > > > Thanks, > Lehel > > -- > -- > You received this message because you are subscribed to the Google > Groups "tesseract-ocr" group. > To post to this group, send email to [email protected] > To unsubscribe from this group, send email to > [email protected] > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en > > --- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > For more options, visit https://groups.google.com/groups/opt_out. > -- ``All that is gold does not glitter, not all those who wander are lost; the old that is strong does not wither, deep roots are not reached by the frost. >From the ashes a fire shall be woken, a light from the shadows shall spring; renewed shall be blade that was broken, the crownless again shall be king.” -- -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en --- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/groups/opt_out.

