i have trained my own model for urdu language using jtessboxeditor to 
create tiff/box file and then used Serak tesseract trainer for creating 
trainedata file, my model is recognizing urdu language but there are 2 
issues mainly other than accuracy(accuracy will be test after solving 
following 2 issues).

   1. model is not recognizing the spaces b/w the words.
   2. model is showing the text in LTR form (Urdu is RTL language, similar 
   to arabic)

thanx in advance.

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/b759d0c7-2ad5-4159-9d8d-63bd953e83d2%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to