Yeah!
I'm really happy that the BUB tool is resurrecting, and for the new OCR
script. Thanks everyone!

Aubrey

On Tue, Jan 5, 2016 at 9:53 PM, Asaf Bartov <[email protected]> wrote:

> On Tue, Jan 5, 2016 at 10:29 AM, Bodhisattwa Mandal <
> [email protected]> wrote:
>
>> Hi,
>>
>> I am happy to inform, that Shrinivasan has created a python script to
>> automate the process in Linux system. This scripts upload the PDF files to
>> Google Drive, download the OCRed text and split, merge the text files
>> properly to fit as the PDF file. We have just tested the script for small
>> files in Kannad and Bengali Wikisource and it was successful. We are going
>> to test the script for using different types and sizes of files and in
>> other Indic languages in next few days.
>>
>> The script is in https://github.com/tshrinivasan/OCR4wikisource
>>
>
> Fantastic news!
>
>    A.
>
>
> _______________________________________________
> Wikisource-l mailing list
> [email protected]
> https://lists.wikimedia.org/mailman/listinfo/wikisource-l
>
>
_______________________________________________
Wikisource-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikisource-l

Reply via email to