Recently I was using tesseract (4.0 alpha) to do Chinese OCR and it works
really great. Now I want to pick up a best model to use but I find several
versions. What is the difference between them?
1. chi_sim from https://github.com/tesseract-ocr/tesseract/wiki/Data-Files
2. chi_sim from https://github.com/tesseract-ocr/tessdata/tree/master/best
from https://github.com/tesseract-ocr/tessdata/tree/master/best (around 13M)
4. HanS from https://github.com/tesseract-ocr/tessdata/tree/master/best
All of them can work but the results are slightly different. From my own
evaluation #4 is the best, but I don't have any insight.
Appreciate for any help.
You received this message because you are subscribed to the Google Groups
To unsubscribe from this group and stop receiving emails from it, send an email
To post to this group, send email to firstname.lastname@example.org.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit
For more options, visit https://groups.google.com/d/optout.