On 2 August 2010 07:42, Eric.yang <[email protected]> wrote:
> Hi,all.I'm currently using tesseract-2.04 to recognition Chinese, in
> Windows xp.

Tesseract 2 will be rubbish for Chinese. Tesseract 3 has specific
support for Chinese/Japanese/Korean.

> I read the introduction in http://code.google.com/p/tesseract-ocr/w/list,
> but when I do my training run into some problem. Here are the steps i
> did:
>
> 1.tesseract 1.tif 1 batch.nochop makebox--------------make a txt file
> 2.Remane 1.txt to 1.box, then use bbtesseract to adjustment.
> 3.Tesseract 1.tif junk nobatch box.train --------make 1.tr and
> junk.txt
> 4.mftraining scan.tr5.cnTraining scan.tr6.unicharset_extractor
> scan.box
>
> Ok, there are inttemp / normproto/ pffmtable/ unicharset, but how do i
> use them?
> Did I do something wrong?
>

Err... you'd have to read further in the training document, where
that's explained.

> Thinks a lot!
>
> --
> You received this message because you are subscribed to the Google Groups 
> "tesseract-ocr" group.
> To post to this group, send email to [email protected].
> To unsubscribe from this group, send email to 
> [email protected].
> For more options, visit this group at 
> http://groups.google.com/group/tesseract-ocr?hl=en.
>
>



-- 
<Leftmost> jimregan, that's because deep inside you, you are evil.
<Leftmost> Also not-so-deep inside you.

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to 
[email protected].
For more options, visit this group at 
http://groups.google.com/group/tesseract-ocr?hl=en.

Reply via email to