Could you advise on how to get better results for images like the attached? The Chinese characters are very clear, but Tesseract generates the wrong results.
<https://lh3.googleusercontent.com/-_n1Tl-UNvFY/VfBlbDYHFtI/AAAAAAAADxQ/9l7tU38KdUY/s1600/testout.png> Thanks very much for your help! On Friday, November 2, 2012 at 10:02:49 AM UTC-4, Sven Pedersen wrote: > > Preprocessing can help. Give us some example images and we may be able to > help. > --Sven > > On Fri, Nov 2, 2012 at 7:25 AM, Rong Xiao <[email protected] <javascript:>> > wrote: > > hi,I have tried tesseract-ocr on chinese,but I found that it can do well > on > > only few fonts. I want to know what kind of fonts are included in > > chi_sim.traineddata? If I expect better accuracy, need I train it by > myself > > ? > > > > thanks > > > > -- > > You received this message because you are subscribed to the Google > > Groups "tesseract-ocr" group. > > To post to this group, send email to [email protected] > <javascript:> > > To unsubscribe from this group, send email to > > [email protected] <javascript:> > > For more options, visit this group at > > http://groups.google.com/group/tesseract-ocr?hl=en > > > > -- > ``All that is gold does not glitter, > not all those who wander are lost; > the old that is strong does not wither, > deep roots are not reached by the frost. > From the ashes a fire shall be woken, > a light from the shadows shall spring; > renewed shall be blade that was broken, > the crownless again shall be king.” > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at http://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/51f7dfa6-32e4-4d27-bcb7-0afe1e4769d9%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

