[tesseract-ocr] Training wrongly recognized text using Tesseract OCR training libraries

thomas Fri, 24 Jul 2015 00:35:57 -0700


I used Tesseract to recognize texts. Some texts are wrongly recognized. So 
I need to retrain them. I read the articles from here 
<https://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3> and here 
<https://blog.cedric.ws/how-to-train-tesseract-301>. According to the 
discussions there, I understood as mftrainingand cntraining can accept only 
maximum of 64 tr files.


One tr file and one box file are produced from one jpg file.

So it means,one time training accepts only maximum of 64 jpg files, then 
final output is eng.traineddata (for example) for one time training of 64 
jpg files.

If I have 200 files to train, how can I make only one eng.traineddata file 
for all those 200 files.

Thanks

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at http://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/6fbb6f78-981a-40be-b15b-d5ea61d8d93c%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

[tesseract-ocr] Training wrongly recognized text using Tesseract OCR training libraries

Reply via email to