[tesseract-ocr] Training Tesseract 5.0.0 to recognize digital handwriting

'Fabio Lugli' via tesseract-ocr Tue, 14 Jan 2020 08:44:07 -0800

Hello everyone, i'm trying to train tesseract on handwriting, knowing that 
it's not the best option, using the latest version available for Windows. I 
have access to a huge amount of .tif files, lines of handwritten text, i'm 
able to obtain the .box files, which I later edit to be compliant to the 
latest requirements (boxes all over the line, spaces between words, tab at 
the end). After that i did not understand how to improve eng.traineddata or 
how to create an own .traineddata file, also following the instructions on 
https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract-4.00. 
So which are the next passages to obtain a correct training dataset?


-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/b736f06c-0627-41ad-bd2a-6dcad01b4576%40googlegroups.com.

[tesseract-ocr] Training Tesseract 5.0.0 to recognize digital handwriting

Reply via email to