[tesseract-ocr] newbie on training tesseract

Istvan Kassai Wed, 28 Apr 2021 02:28:45 -0700

Hi everyone!

I've got a lot of scanned documents with a font type and font size 
combination the tesseract recognizes with very bad quality. The documents 
are authored by a governmental office, so the obvious solution isnt work 
(to recreate with other font type)
I decided to train the tesseract, but any articles I found lacks steps, 
explains or something what essential to the success.
Is there any comprehensive tutorial or step-by-step guide for training you 
can advice?


environment: tesseract4.1 on ubuntu focal, everything is installed from 
distrib repository.

thanks in advance
Istvan

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/e4db6202-7007-45a3-a709-6f911a756739n%40googlegroups.com.

[tesseract-ocr] newbie on training tesseract

Reply via email to