---------- Forwarded message ---------- Date: Thu, Jan 12, 2017 at 10:36 PM Subject: [tesseract-dev] Calling all language issues for 4.00! To: tesseract-dev <[email protected]>
Update on progress of 4.00 alpha: In a training session over the holiday break, I tried 17 different network architectures to experiment with smaller, faster networks. The news is good! Exactly how it will work in 4.00 is currently up for debate, but I now have a set of traineddata files that deliver ~3x speed-up at a cost of almost no loss in accuracy for most languages! On a modern enough machine with multi-core +SSE/AVX-like SIMD instructions, these networks beat baseline tesseract for speed, even in Latin languages. This may be provided as a second tessdata repo for those that want speed, or maybe the current traineddata files will just get replaced with the faster ones, since the accuracy and speed are so good. Thanks to everyone who has contributed language-specific issues so far! The main purpose of this post is a rallying cry for more. Since the training cycle takes about 2 weeks, I'd like to fix as many language issues as possible before going back to training. -- You received this message because you are subscribed to the Google Groups "tesseract-dev" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-dev. To view this discussion on the web visit https://groups.google.com/d/ msgid/tesseract-dev/cef8e5c7-1275-4dbc-a8c8-2dbe75c666f3%40googlegroups.com <https://groups.google.com/d/msgid/tesseract-dev/cef8e5c7-1275-4dbc-a8c8-2dbe75c666f3%40googlegroups.com?utm_medium=email&utm_source=footer> . For more options, visit https://groups.google.com/d/optout. -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduUjQnG68YvBSm5CgzwAUKj6dHcZ%2B-wzvMEc27vKWSMP_A%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.

