Thanks, @Zdenko I have newly trained new fonts successfully. I trained Ubuntu and Inter fonts. I am using Tesseract 3.0.5, and Tessdata-3.0.4. 1. I noticed Tesseract does not recognize them, but kept returning a strange name for the fonts. It returned the 1809_Homer font name for Ubuntu, and kept me wondering if there is anything wrong with the training. 2. Secondly, Tesseract seems not to be able to differentiate between font-weight: 700, and font-weight: bold. These are the same, but Tesseract sees font-weight: 700 as a normal font. What can I do to remedy this?
On Friday, 20 May 2022 at 11:32:12 UTC+2 zdenop wrote: > Can you please clarify what exactly you want to do / achieve? Training > LSTM model or legacy model? > > Zdenko > > > št 19. 5. 2022 o 16:12 Kehinde Adeoya <[email protected]> napísal(a): > >> Are the tutorials where it is detailed on how to train a new font using >> the latest Tesseract-5 and Tessdata-3.0.5? I have not found any till date >> for over 2 months. >> >> -- >> You received this message because you are subscribed to the Google Groups >> "tesseract-ocr" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to [email protected]. >> To view this discussion on the web visit >> https://groups.google.com/d/msgid/tesseract-ocr/24845d09-8f1a-4dc0-8ba8-dc32463be06an%40googlegroups.com >> >> <https://groups.google.com/d/msgid/tesseract-ocr/24845d09-8f1a-4dc0-8ba8-dc32463be06an%40googlegroups.com?utm_medium=email&utm_source=footer> >> . >> > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/0056318d-fd36-4838-bbc1-e66eaa76f2f7n%40googlegroups.com.

