hi, everyone. I am new to tesseract. I want add a new language -- Traditional Mongolian Script to tesseract. Traditional Mongolian Script have a different layout from any language. that write from top to bottom in line and lines from left to right. I know that have supported from tom to bottom layout in tesseract 3.02, and supported complex text such as Arabic, Hindi, Thai and so on. So i want to add traditional mongolian to tesseract. I create a image that contains traditional mongolian text, and use commant to generate box file such as: tesseract mvf.baiti.exp0.tif mvf.baiti.exp0 -l mvf batch.nochop makebox I get error:
Error opening data file E:\tesseract-OCR\tessdata/mvf.traineddata Please make sure the TESSDATA_PREFIX environment variable is set to the parent directory of your "tessdata" directory. Failed loading language 'mvf' Tesseract could not initialize tesseract. mvf is Traditional Mongolian code in ISO 639-3 standard. so I have question: If I can not add a new language that do not exist in tesseract traineddata download list? Thanks. -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to tesseract-ocr@googlegroups.com To unsubscribe from this group, send email to tesseract-ocr+unsubscr...@googlegroups.com For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en