On Wed, Apr 24, 2013 at 1:04 AM, ABHISHEK GUPTA <[email protected]> wrote: > I am a 3rd tear student at Dhirubhai Ambani Institute of Information & > Communication Technology. I am interested in doing some work with > Ankur-India on the topic "Improving information retrieval methods for OCR > data sets consisting of Indic scripts". I want to know more about the > project. What is the project's current state. What corpora, tools, > algorithms and approaches are you using. As project is aiming at improvement > of the method, what are the current results?
The idea of the project is to work on an upstream centric method which will enable information retrieval with greater accuracy. The list archive has a couple of threads on the OCR and IR related topic, please have a quick read through them. To answer your query on the "state of the union", there exist a variety of approaches upstream in a divergent manner. The focus of our organization's GSoC is to try as much as possible to extend and enhance existing projects. With regards to the viability of current methods of retrieval, please read up literature available. There have been a few recent papers published from IIT-KGP among other places. -- sankarshan mukhopadhyay <https://twitter.com/#!/sankarshan> _______________________________________________ Project-ideas mailing list [email protected] http://lists.ankur.org.in/listinfo.cgi/project-ideas-ankur.org.in
