To those who come across this old thread: Training from single line images and their groundtruth is now possible using the makefile in tesstrain repo.
https://stackoverflow.com/questions/43352918/how-do-i-train-tesseract-4-with-image-data-instead-of-a-font-file The above link has a good explanation. The only change I would suggest is to download tessdata_best/eng.traineddata (or other language as needed) to use as startmodel individually using wget rather than clone the whole repo which is a few gigs of data. -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/ae20cd08-3f6c-420c-b897-1f069432e610o%40googlegroups.com.

