Hi, I belong to a group who study an old Egyptian writing system called "Coptic". It's based mostly on Greek (with some variation).
Big majority of books written in Coptic where during the last century and were mostly the same [typewriter] font. Here is a sample picture: https://imgur.com/a/ILRw6vm And sample book: https://archive.org/download/pistissophiaopu00petegoog We need to add Coptic to languages supported by Tesseract but not sure how. I tried following this document https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract-4.00 but it's very difficult to understand. We need someone help us with the initial setup so that we can dedicate our man power to training the system. We are none profit group so we are hoping for free help but we would also consider paid help since the alternative is hundreds of hours of man labor to digitalize just few books. Thanks everyone for contributing to this awesome project -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/08869d08-8b3a-4390-be79-fa811c78c0ca%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

