Hi Jayanta, I would also suggest that you make a list of books you would like to be uploaded to Commons. If the PDF is not easily available on the Internet, you can upload it to Commons, then I will make a DJVU. BTW, I checked another OCR software (Caminova Document Express 7.5 Enterprise, with Asian languages), but there is no Indian language available in it. Only Chinese, Japanese, Korean, etc.
Regards, Yann 2013/12/6 Jayanta Nath <[email protected]> > Hi Asaf , > > Thank you for helping us. > > Presently I am personally working this for my native Bengali Wikisource. > There some program (1,2,3,4) available on web to download Books from DLI > website. It may help for you too! Sometimes I am facing the big issue with > this utility from service side not available or down for few moments. > > 1) http://sanskritdocuments.org/scannedbooks/dlidownloader/ > 2) http://dlidownloader.wordpress.com/ > 3) http://code.google.com/p/dli-downloader/ > 4) > http://techstunted.blogspot.in/2013/03/downloading-books-from-digital-library.html > > Another issue I found at Internet Archive, I have uploaded some books in > PDF format in AI here [1], but no boos have converted to DJVU , because, > they are saying that A DjVu can only be made if the language of the book is > OCRable. At this time we are not able to OCR Bengali. I know PDF will also > accepted format in WS, but I would preferred DJVU. > > 1) > https://archive.org/search.php?query=uploader%3A%22jayantanth%40gmail.com%22&sort=-publicdate > > I shall send a mail for download list to you off-list. > > Jayanta > > > > On Fri, Dec 6, 2013 at 4:49 AM, Asaf Bartov <[email protected]> wrote: > >> Jayanta, I'm also happy to help, if bandwidth is a problem. If you send >> me a list of URLs of books at the DLI that you'd like me to download and >> upload to the Internet Archive for you, I'm happy to do it. >> >> A. >> >> >> On Wed, Dec 4, 2013 at 2:14 PM, Yann Forget <[email protected]> wrote: >> >>> 2013/12/5 Jayanta Nath <[email protected]> >>> >>>> Hi Yann, >>>> >>>> Thank you for sharing this add-on and website. This site may very >>>> useful for sa.wikisource.org. >>>> >>> >>> Yes, I will upload some of these books and tell them. >>> >>> >>>> I am working on my native wikisource bengali. Can you help us to >>>> develop OCR for Bengali? >>>> >>> >>> Unfortunately, Bengali may not even exist in commercial software, >>> although I know a French company which is making OCR for Indian languages, >>> it will take some time. >>> Bengali is not available in Abby FineReader 11 Professional Editon, >>> which is the leading world software for OCR. However several dozens of >>> languages are available: all European languages, Latin, Greek, Russian, >>> Chinese, Japanese, Korean, Arabic, several African languages, etc., but no >>> Indian language is available in the list. It is what Internet Archive uses. >>> Developing OCR is a very long and complex work. And I don't speak >>> Bengali, so I can't help much. >>> >>> However I can help creating PDF and/or DJVU files, and uploading them. >>> >>> Best regards, >>> >>> Yann >>> >>> _______________________________________________ >>> Wikisource-l mailing list >>> [email protected] >>> https://lists.wikimedia.org/mailman/listinfo/wikisource-l >>> >>> >> >> >> -- >> Asaf Bartov >> Wikimedia Foundation <http://www.wikimediafoundation.org> >> >> Imagine a world in which every single human being can freely share in the >> sum of all knowledge. Help us make it a reality! >> https://donate.wikimedia.org >> >> _______________________________________________ >> Wikisource-l mailing list >> [email protected] >> https://lists.wikimedia.org/mailman/listinfo/wikisource-l >> >> > > _______________________________________________ > Wikisource-l mailing list > [email protected] > https://lists.wikimedia.org/mailman/listinfo/wikisource-l > >
_______________________________________________ Wikisource-l mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/wikisource-l
