On Wed, Jul 2, 2008 at 9:32 AM, Sayamindu Dasgupta <[EMAIL PROTECTED]>
> This guy seems to be doing some interesting progress for a Bangla OCR
> - or more precisely, enabling Bangla in Tesseract.
> http://debayanin.googlepages.com/hackingtesseract

Yes, it looks definitely interesting.

> Looks like he needs some more training data - can we provide him with some ?

If I remember correctly, there was a sample file for testing completeness
of Bengali fonts. Since it has all letters and conjuncts typed-in, the
file might
be useful for training Tesseract as well .

Deepayan should be able to give some input here. He has working experience
with R and may have some training sample as well.

Cheers,
Golam

--
http://gravity.psu.edu/~hossain/

-------------------------------------------------------------------------
Sponsored by: SourceForge.net Community Choice Awards: VOTE NOW!
Studies have shown that voting for your favorite open source project,
along with a healthy diet, reduces your potential for chronic lameness
and boredom. Vote Now at http://www.sourceforge.net/community/cca08
_______________________________________________
Bengalinux-core mailing list
Bengalinux-core@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bengalinux-core

Reply via email to