Dear all, I am working on a bangla preprocessor for OCR. It converts the connected alphabet into separate alphabet then process by tesseract.
I have also made a web-based interface, after upload image in our website, it preprocess the image using our preprocessor and later on convert with tesseract and show the output. (License: GPL). Example: http://dhaka.ankur.org.bd:2121/bangla_preprocessor_demo.pdf http://dhaka.ankur.org.bd:2121/bangla_preprocessor_demo.png I could not find any contact address for "tessdata.ban" people (they did a great job), any idea ? Do you think it might helpful for other people if I start a web-based tesseract-ocr testing place for other languages as well ? Bangla preprocessor is still under development, any comments or suggestion would helpful for us ? Regards, Salahuddin salahuddin66.blogspot.com salahuddin66.deviantart.com --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en -~----------~----~----~----~------~----~------~--~---

