listen anniqa i need help in urdu tesseract please reply On Wednesday, March 2, 2011 at 4:01:26 PM UTC+5, Aniqa Dilawari wrote: > > I have trained tesseract for Urdu image (which is a multipage tif image > having 20 pages: C.tif(it is zipped in .rar)) and boxfile (C.box) > After training the data, i gave image Urdu4.tif for recognition. The > output of the file is as outputC4.txt > In this file all the characters are not recognized. At position 2 the > recognized id should be 665664 instead of 665663. > > How is it possible to find out which characters are not recognized by > Tesseract? > >
-- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/4a8bee53-a7c7-4306-8e63-a68435826ceb%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

