>Ps: I've used other tamil unicode fonts such as Latha, Akshar and TheeneeUni, they all worked perfectly.
As you recognize in statement above, problem is with the font not tesseract. On Tuesday, August 14, 2018 at 12:03:42 AM UTC+5:30, Mugunthan wrote: > > I've been making tif/box files for Tamil character recognition using > text2image on windows. I came across some issues for this Unicode Font > Sundaram-0807 with text2image. > > Issue1: *Some characters in the tif file doesn't match with the text > file. > > Issue2: Some characters in the tif file match with text file only at some > instances. > > I tried changing the size of generated tif file but it doesn't help. > Please see the attached screenshots and the files for the font > Sundaram-0807 - Size 12 > > Ps: I've used other tamil unicode fonts such as Latha, Akshar and > TheeneeUni, they all worked perfectly. > > > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/19b31b01-f180-4cd9-a733-b15b0ddb5157%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

