On 2 August 2010 07:42, Eric.yang <[email protected]> wrote: > Hi,all.I'm currently using tesseract-2.04 to recognition Chinese, in > Windows xp.
Tesseract 2 will be rubbish for Chinese. Tesseract 3 has specific support for Chinese/Japanese/Korean. > I read the introduction in http://code.google.com/p/tesseract-ocr/w/list, > but when I do my training run into some problem. Here are the steps i > did: > > 1.tesseract 1.tif 1 batch.nochop makebox--------------make a txt file > 2.Remane 1.txt to 1.box, then use bbtesseract to adjustment. > 3.Tesseract 1.tif junk nobatch box.train --------make 1.tr and > junk.txt > 4.mftraining scan.tr5.cnTraining scan.tr6.unicharset_extractor > scan.box > > Ok, there are inttemp / normproto/ pffmtable/ unicharset, but how do i > use them? > Did I do something wrong? > Err... you'd have to read further in the training document, where that's explained. > Thinks a lot! > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To post to this group, send email to [email protected]. > To unsubscribe from this group, send email to > [email protected]. > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en. > > -- <Leftmost> jimregan, that's because deep inside you, you are evil. <Leftmost> Also not-so-deep inside you. -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en.

