On Tue, Jan 5, 2016 at 10:29 AM, Bodhisattwa Mandal < [email protected]> wrote:
> Hi, > > I am happy to inform, that Shrinivasan has created a python script to > automate the process in Linux system. This scripts upload the PDF files to > Google Drive, download the OCRed text and split, merge the text files > properly to fit as the PDF file. We have just tested the script for small > files in Kannad and Bengali Wikisource and it was successful. We are going > to test the script for using different types and sizes of files and in > other Indic languages in next few days. > > The script is in https://github.com/tshrinivasan/OCR4wikisource > Fantastic news! A.
_______________________________________________ Wikisource-l mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/wikisource-l
