To those who come across this old thread:

Training from single line images and their groundtruth is now possible using 
the makefile in tesstrain repo.

https://stackoverflow.com/questions/43352918/how-do-i-train-tesseract-4-with-image-data-instead-of-a-font-file

The above link has a good explanation.
The only change I would suggest is to download tessdata_best/eng.traineddata 
(or other language as needed) to use as startmodel individually using wget 
rather than clone the whole repo which is a few gigs of data.

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/ae20cd08-3f6c-420c-b897-1f069432e610o%40googlegroups.com.

Reply via email to