Hi Asaf ,

Thank you for helping us.

Presently I am personally working this for my native Bengali Wikisource.
There some program (1,2,3,4)  available on web to download Books from DLI
website. It may help for you too!  Sometimes I am facing the big issue with
this utility from service side not available or down for few moments.

1) http://sanskritdocuments.org/scannedbooks/dlidownloader/
2) http://dlidownloader.wordpress.com/
3) http://code.google.com/p/dli-downloader/
4)
http://techstunted.blogspot.in/2013/03/downloading-books-from-digital-library.html

Another issue I found at Internet Archive, I have uploaded some books in
PDF format  in AI here [1], but no boos have converted to DJVU , because,
they are saying that A DjVu can only be made if the language of the book is
OCRable. At this time we are not able to OCR Bengali. I know PDF will also
accepted format in WS, but I would preferred DJVU.

1)
https://archive.org/search.php?query=uploader%3A%22jayantanth%40gmail.com%22&sort=-publicdate

I shall send a mail for download list to you off-list.

Jayanta



On Fri, Dec 6, 2013 at 4:49 AM, Asaf Bartov <[email protected]> wrote:

> Jayanta, I'm also happy to help, if bandwidth is a problem.  If you send
> me a list of URLs of books at the DLI that you'd like me to download and
> upload to the Internet Archive for you, I'm happy to do it.
>
>    A.
>
>
> On Wed, Dec 4, 2013 at 2:14 PM, Yann Forget <[email protected]> wrote:
>
>> 2013/12/5 Jayanta Nath <[email protected]>
>>
>>> Hi Yann,
>>>
>>> Thank you for sharing this add-on and website. This site may very useful
>>> for sa.wikisource.org.
>>>
>>
>>  Yes, I will upload some of these books and tell them.
>>
>>
>>> I am working on my native wikisource bengali. Can you  help us to
>>> develop OCR for Bengali?
>>>
>>
>> Unfortunately, Bengali may not even exist in commercial software,
>> although I know a French company which is making OCR for Indian languages,
>> it will take some time.
>> Bengali is not available in Abby FineReader 11 Professional Editon, which
>> is the leading world software for OCR. However several dozens of languages
>> are available: all European languages, Latin, Greek, Russian, Chinese,
>> Japanese, Korean, Arabic, several African languages, etc., but no Indian
>> language is available in the list. It is what Internet Archive uses.
>> Developing OCR is a very long and complex work. And I don't speak
>> Bengali, so I can't help much.
>>
>> However I can help creating PDF and/or DJVU files, and uploading them.
>>
>>  Best regards,
>>
>> Yann
>>
>> _______________________________________________
>> Wikisource-l mailing list
>> [email protected]
>> https://lists.wikimedia.org/mailman/listinfo/wikisource-l
>>
>>
>
>
> --
>     Asaf Bartov
>     Wikimedia Foundation <http://www.wikimediafoundation.org>
>
> Imagine a world in which every single human being can freely share in the
> sum of all knowledge. Help us make it a reality!
> https://donate.wikimedia.org
>
> _______________________________________________
> Wikisource-l mailing list
> [email protected]
> https://lists.wikimedia.org/mailman/listinfo/wikisource-l
>
>
_______________________________________________
Wikisource-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikisource-l

Reply via email to