*Sir * *I didn't get any reply, how the problem said below resolved for Bangla language. *
I have try to trained tesseract ocr for a new language. Like Bangla script my language consists of large character set. I could able to trained tesseract for the vowels and consonents and as a beginner I edit the box file manually. However for a character having consonent with dependent modifier at right side, the generated box file able to identify the character but along with that get an extra chracter (2 character for 1). here i am attaching few lines of the box file, bold are correct ones. I use Tesseract for Ubuntu. *କା **376 3125 407 3162 *0 ଛା 412 3125 417 3161 0 *ଖା **441 3123 472 3161 0 *ଶା 479 3124 484 3159 0 *ଗା **508 3123 539 3160 0 *ସା 546 3123 550 3158 0 So I a can't be proceed further, please help me as soon as possible. Eagerly waiting for your reply. Thanku On Mon, Apr 15, 2013 at 8:38 PM, Sven Pedersen <[email protected]>wrote: > Please stop sending this message repeatedly. Someone will give you a reply > when they have time to figure it out. > Thanks, > Sven > > > On Mon, Apr 15, 2013 at 9:45 AM, mama <[email protected]> wrote: > >> *Sir * >> >> I have try to trainde tesseract ocr for a new language. Like Bangla >> script my language consists of large character set. I could able to trained >> tesseract for the vowels and consonents and as a beginner I edit the box >> file manually. >> >> However for a character having consonent with dependent modifier at right >> side, the generated box file able to identify the character but along with >> that get an extra chracter (2 character for 1). here i am attaching few >> lines of the box file, bold are correct ones. I use Tesseract for Ubuntu. >> >> *କା **376 3125 407 3162 *0 >> ଛା 412 3125 417 3161 0 >> *ଖା **441 3123 472 3161 0 >> *ଶା 479 3124 484 3159 0 >> *ଗା **508 3123 539 3160 0 >> *ସା 546 3123 550 3158 0 >> >> So I a can't be proceed further, please help me as soon as possible. >> >> Eagerly waiting for your reply. >> >> Thanku >> >> >> >> >> On Tuesday, February 26, 2013 12:32:51 AM UTC+5:30, Nick White wrote: >>> >>> Hi tesseract folks, >>> >>> Just a note to let you know that an article I wrote about training >>> tesseract for Ancient Greek has now been published. It is aimed to >>> be generally useful for people training tesseract with other >>> languages too, so anybody thinking about training may well find it >>> worth perusing. >>> >>> Find it here: >>> http://eutypon.gr/eutypon/pdf/**e2012-29/e29-a01.pdf<http://eutypon.gr/eutypon/pdf/e2012-29/e29-a01.pdf> >>> >>> Any questions or comments would be warmly received. >>> >>> Nick >>> >> -- >> -- >> You received this message because you are subscribed to the Google >> Groups "tesseract-ocr" group. >> To post to this group, send email to [email protected] >> To unsubscribe from this group, send email to >> [email protected] >> For more options, visit this group at >> http://groups.google.com/group/tesseract-ocr?hl=en >> >> --- >> You received this message because you are subscribed to the Google Groups >> "tesseract-ocr" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to [email protected]. >> >> For more options, visit https://groups.google.com/groups/opt_out. >> >> >> > > > > -- > ``All that is gold does not glitter, > not all those who wander are lost; > the old that is strong does not wither, > deep roots are not reached by the frost. > From the ashes a fire shall be woken, > a light from the shadows shall spring; > renewed shall be blade that was broken, > the crownless again shall be king.” > > -- > -- > You received this message because you are subscribed to the Google > Groups "tesseract-ocr" group. > To post to this group, send email to [email protected] > To unsubscribe from this group, send email to > [email protected] > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en > > --- > You received this message because you are subscribed to a topic in the > Google Groups "tesseract-ocr" group. > To unsubscribe from this topic, visit > https://groups.google.com/d/topic/tesseract-ocr/vowksBpeazA/unsubscribe?hl=en > . > To unsubscribe from this group and all its topics, send an email to > [email protected]. > For more options, visit https://groups.google.com/groups/opt_out. > > > -- -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en --- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/groups/opt_out.

