It seems 'fas' is for Persian, but there are no cube files, resulting in poor results. Arabic language files work much better for Persian images. There is another 'per' folder for Persian, but there isn't even '.traieddata' file for it. Does anyone know if 'Google Doc' has used 'Tesseract' for its OCR engine? Google Docs performs OCR for Persian images with good accuracy!
On Saturday, July 18, 2015 at 8:14:07 AM UTC+4:30, Jeff Breidenbach wrote: > > I think 'fas' is the language code for Persian. > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at http://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/edd64e28-9e52-4b44-80cc-0aaa442caa85%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

