Hello,
I am training tesseract for malayalam. The tif and the box files and the tesstrain log are shared here https://drive.google.com/drive/folders/0Bz8Xp0bwrlkdblNWMEZnaGpWTEk?usp=sharing Surprisingly i get errors for only the blobs which have the character മ in them. These blobs are: മ് മ മം മാ മി മീ മു മൂ മ്മ മ്മം മ്മാ മ്മി മ്മീ മ്മേ മ്മ്യ മ്മ്യാ I checked the tif and box file using http://zdenop.github.io/qt-box-editor/ and it all looks fine. Any idea why this could happen. -Raman -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/93d19385-c115-4151-9c16-2b648d31d9ca%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

