On Wed, Jul 2, 2008 at 9:32 AM, Sayamindu Dasgupta <[EMAIL PROTECTED]> > This guy seems to be doing some interesting progress for a Bangla OCR > - or more precisely, enabling Bangla in Tesseract. > http://debayanin.googlepages.com/hackingtesseract
Yes, it looks definitely interesting. > Looks like he needs some more training data - can we provide him with some ? If I remember correctly, there was a sample file for testing completeness of Bengali fonts. Since it has all letters and conjuncts typed-in, the file might be useful for training Tesseract as well . Deepayan should be able to give some input here. He has working experience with R and may have some training sample as well. Cheers, Golam -- http://gravity.psu.edu/~hossain/ ------------------------------------------------------------------------- Sponsored by: SourceForge.net Community Choice Awards: VOTE NOW! Studies have shown that voting for your favorite open source project, along with a healthy diet, reduces your potential for chronic lameness and boredom. Vote Now at http://www.sourceforge.net/community/cca08 _______________________________________________ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core