Hi Mayur, For question #2 I believe Google is keeping the training datasets secret. But they are believed to be English books, from what I hear. I'm not sure about the classifiers. --Sven
On Wed, Mar 28, 2012 at 1:37 AM, Mayur Mudigonda <[email protected]> wrote: > Dear Tesseract Community, > > I am a computer vision scientist working on using Tesseract and some > computer vision algorithms to help blind/aging people with text/object > recognition. I have playing around with Tesseract for the past few months > and I am interested in understanding a bit more on how Tesseract is really > trained. > > Can anyone help me with the following questions? > > Can anyone tell me more about the classifiers that Tesseract uses? > I've read the page on Training Tesseract and want to know specifics of the > dataset on which the default (svn head english version) is trained on > > Thanks, > Mayur > > -- > You received this message because you are subscribed to the Google > Groups "tesseract-ocr" group. > To post to this group, send email to [email protected] > To unsubscribe from this group, send email to > [email protected] > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en -- ``All that is gold does not glitter, not all those who wander are lost; the old that is strong does not wither, deep roots are not reached by the frost. >From the ashes a fire shall be woken, a light from the shadows shall spring; renewed shall be blade that was broken, the crownless again shall be king.” -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

