Hi, OLE Nepal, Sanepa is working on http://www.pustakalaya.org Most of the books are scanned and uploaded. :)
Regards On 12/4/12, rajeev <[email protected]> wrote: > Hi Bal Krishna, > Thanks for the info. > I have tried using some hindi OCR software. Some of them are not too bad. > However I have been using images extracted from PDF files. I would like to > see how it works with higher quality image files. > Regarding research, I myself am not a technical guy. However, I came across > > Dr. Hellwig who seem to have been quite active in Sanskrit/Hindi OCR. May > be there could be some collaboration on the research > side. > http://www.geschkult.fu-berlin.de/e/indologie/mitarbeiter/drittmittel/hellwig/index.html > I think if there is a decent OCR software and people are willing to put in > some time to correct the generated text, it is possible to get a pretty > decent digital copy. I am in Europe and I would like to read those old > Nepali books on my kindle etc. The scanned pdf is just not fun. > Currently, I have been involved in an Openstreetmap project (adyota,org or > osmnepal.org), would like to do sth around openbooks. > > cheers > rajeev > > On Tuesday, December 4, 2012 5:55:59 AM UTC+1, Bal Krishna wrote: >> >> Hi, >> Madan Puraskar Pustakalaya (MPP) had been digitizing its collections of >> Nepali materials. MPP/LTK has done some research and development on Nepali >> >> OCR. >> http://nepalinux.org/index.php?option=com_content&task=view&id=46&Itemid=53 >> However, this is still an area that needs further Research as segmentation >> >> and handling of half and joint Nepali characters is quite challenging and >> >> yet to be addressed. I would be happy to guide people who would be >> interested to take up this research. >> Regards, >> Bal Krishna >> >> >> >> >> On Tue, Dec 4, 2012 at 2:37 AM, rajeev <[email protected] >> <javascript:>>wrote: >> >>> Hi guys, >>> >>> Can somebody update me on the recent works on digitizing old Nepalese >>> literature books such as Laxmi Nibandha Sangraha etc. I know that many of >>> >>> them have been scanned and put for public use under CC at >>> pustakalaya.org. >>> Anybody working on digitizing them using OCR or some other means? >>> >>> Thanks, >>> Rajeev >>> >>> -- >>> FOSS Nepal mailing list: [email protected] <javascript:> >>> http://groups.google.com/group/foss-nepal >>> To unsubscribe, e-mail: [email protected] <javascript:> >>> >>> Mailing List Guidelines: >>> http://wiki.fossnepal.org/index.php?title=Mailing_List_Guidelines >>> Community website: http://www.fossnepal.org/ >>> >> >> > > -- > FOSS Nepal mailing list: [email protected] > http://groups.google.com/group/foss-nepal > To unsubscribe, e-mail: [email protected] > > Mailing List Guidelines: > http://wiki.fossnepal.org/index.php?title=Mailing_List_Guidelines > Community website: http://www.fossnepal.org/ > -- Avash Mulmi Support Mozilla Project :) @avashz http://www.facebook.com/avasz -- FOSS Nepal mailing list: [email protected] http://groups.google.com/group/foss-nepal To unsubscribe, e-mail: [email protected] Mailing List Guidelines: http://wiki.fossnepal.org/index.php?title=Mailing_List_Guidelines Community website: http://www.fossnepal.org/
