Did you try the options -psm 7 or -psm 8? Probably you will get better results by using one of them.
Paul Am Donnerstag, 31. Juli 2014 08:36:12 UTC+2 schrieb Sayang: > > a) Tesseract correctly OCR'd eight (>30 character) lines of Chinese, > scanned from a book > > b) Tesseract seemed to fail OCR'ing a single line image with three > characters (xingqisi - Thursday) > > (i) Four different fonts were tried - so four different single line > images - attached. > (ii) The binary data produced by Tesseract for each of the four > attempts was identical > (iii) There were no error messages. > > Any suggestions would be greatly appreciated. > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at http://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/9ebbdaff-50af-467d-ae34-c32e362b0600%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

