Shree, thank you, and yes, accented vowels would be fine, but right now I was talking of ' іѣѳѵ' set (U+0406,0462,0472,0474 uppercase and U+0456,0463,0473,0475 lowercase).
The 4.0.0.0 version from git definitely refuses to recognise those, and AFAICT there is no mention of the codes in the source files. I'm a complete noob at git, how could I know when the PR you mentioned becomes available in git as downloads? -Yury -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/316cc856-a996-43af-87c1-ca6b3314d8e3%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

