Hello, 

I am training tesseract for malayalam.  The tif and the box files and the 
tesstrain log are shared 
here 
https://drive.google.com/drive/folders/0Bz8Xp0bwrlkdblNWMEZnaGpWTEk?usp=sharing 

Surprisingly i get errors for only the blobs which have the character മ  in 
them. 

These blobs are:

മ്
മ
മം
മാ
മി
മീ
മു
മൂ
മ്മ
മ്മം
മ്മാ
മ്മി
മ്മീ
മ്മേ
മ്മ്യ
മ്മ്യാ

I checked the tif and box file using http://zdenop.github.io/qt-box-editor/ 
and it all looks fine. 

Any idea why this could happen. 

-Raman

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/93d19385-c115-4151-9c16-2b648d31d9ca%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to