Check the Indic language page http://code.google.com/p/tesseractindic/
Sven On Monday, April 22, 2013, mamata nayak wrote: > *Sir > * > > *I didn't get any reply, how the problem said below resolved for Bangla > language. > * > > I have try to trained tesseract ocr for a new language. Like Bangla script > my language consists of large character set. I could able to trained > tesseract for the vowels and consonents and as a beginner I edit the box > file manually. > > However for a character having consonent with dependent modifier at right > side, the generated box file able to identify the character but along with > that get an extra chracter (2 character for 1). here i am attaching few > lines of the box file, bold are correct ones. I use Tesseract for Ubuntu. > > *କା **376 3125 407 3162 *0 > ଛା 412 3125 417 3161 0 > *ଖା **441 3123 472 3161 0 > *ଶା 479 3124 484 3159 0 > *ଗା **508 3123 539 3160 0 > *ସା 546 3123 550 3158 0 > > So I a can't be proceed further, please help me as soon as possible. > > Eagerly waiting for your reply. > > Thanku > > > > > On Mon, Apr 15, 2013 at 8:38 PM, Sven Pedersen > <[email protected]<javascript:_e({}, 'cvml', > '[email protected]');> > > wrote: > >> Please stop sending this message repeatedly. Someone will give you a >> reply when they have time to figure it out. >> Thanks, >> Sven >> >> >> On Mon, Apr 15, 2013 at 9:45 AM, mama <[email protected]> wrote: >> >> *Sir * >> >> I have try to trainde tesseract ocr for a new language. Like Bangla >> script my language consists of large character set. I could able to trained >> tesseract for the vowels and consonents and as a beginner I edit the box >> file manually. >> >> However for a character having consonent with dependent modifier at right >> side, the generated box file able to identify the character but along with >> that get an extra chracter (2 character for 1). here i am attaching few >> lines of the box file, bold are correct ones. I use Tesseract for Ubuntu. >> >> *କା **376 3125 407 3162 *0 >> ଛା 412 3125 417 3161 0 >> *ଖା **441 3123 472 3161 0 >> *ଶା 479 3124 484 3159 0 >> *ଗା **508 3123 539 3160 0 >> *ସା 546 3123 550 3158 0 >> >> So I a can't be proceed further, please help me as soon as possible. >> >> Eagerly waiting for your reply. >> >> Thanku >> >> >> >> >> On Tuesday, February 26, 2013 12:32:51 AM UTC+5:30, Nick White wrote: >> >> Hi tesseract folks, >> >> Just a note to let you know that an article I wrote about training >> tesseract for Ancient Greek has now been published. It is aimed to >> be generally useful for people training tesseract with other >> languages too, so anybody thinking about training may well find it >> worth perusing. >> >> Find it here: >> http://eutypon.gr/eutypon/pdf/**e2012-29/e29-a01.pdf<http://eutypon.gr/eutypon/pdf/e2012-29/e29-a01.pdf> >> >> Any questions or comments would be warmly received. >> >> Nick >> >> -- >> -- >> You received this message because you are subscribed to the Google >> Groups "tesseract-ocr" group. >> To post to this group, send email to [email protected] >> To unsubscribe from this group, send email to >> [email protected] >> For more options, visit this group at >> http://groups.google.com/group/tesseract-ocr?hl=en >> >> --- >> You received this message because you are subscribed to the Google Groups >> "tesseract-ocr" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to [email protected]. >> >> For more options, visit https://groups.google.com/groups/opt_out. >> >> >> >> -- >> -- >> You received this message because you are subscribed to the Google >> Groups "tesseract-ocr" group. >> To post to this group, send email to >> [email protected]<javascript:_e({}, 'cvml', >> '[email protected]');> >> To unsubscribe from this group, send email to >> [email protected] <javascript:_e({}, 'cvml', >> 'tesseract-ocr%[email protected]');> >> For more options, visit this group at >> http://groups.google.com/group/tesseract-ocr?hl=en >> >> --- >> You received this message because you are subscribed to a topic in the >> Google Groups "tesseract-ocr" group. >> To unsubscribe from this topic, visit >> https://groups.google.com/d/topic/tesseract-ocr/vowksBpeazA/unsubscribe?hl=en >> . >> To unsubscribe from this group and all its topics, send an email to >> [email protected] <javascript:_e({}, 'cvml', >> 'tesseract-ocr%[email protected]');>. >> For more options, visit https://groups.google.com/groups/opt_out. >> >> >> > > -- > -- > You received this message because you are subscribed to the Google > Groups "tesseract-ocr" group. > To post to this group, send email to > [email protected]<javascript:_e({}, 'cvml', > '[email protected]');> > To unsubscribe from this group, send email to > [email protected] <javascript:_e({}, 'cvml', > 'tesseract-ocr%[email protected]');> > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en > > --- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected] <javascript:_e({}, > 'cvml', 'tesseract-ocr%[email protected]');>. > For more options, visit https://groups.google.com/groups/opt_out. > > > -- ``All that is gold does not glitter, not all those who wander are lost; the old that is strong does not wither, deep roots are not reached by the frost. >From the ashes a fire shall be woken, a light from the shadows shall spring; renewed shall be blade that was broken, the crownless again shall be king.” -- -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en --- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/groups/opt_out.

