Dear Wikimedia India,
As you probably aware the Govt. of India, immediately post Independence started 
multiple Indian language encyclopedia projects to stream in Science and 
Technology. The Tamil language encyclopedia was completed 
[http://en.wikipedia.org/wiki/Tamil_Encyclopedia]    
I'm pleased to report Tamil Virtual University has scanned in the Tamil 
Kalaikalanjiam / Tamil Encyclopedia [Please see Reference 1 below].
I was able to download the material via the wonderful wget command and the 
'convert' (from imagemagick lib)  in GNU/Linux. However each of the 10 volumes 
is close to 700 MB without compression.
I would imagine, the people behind this mammoth task (pre-internet era) would 
have liked it to be merged into a Wiki type format, which would make it a truly 
living document in-sync with the times.
I do not have any experience with 1) Tamil OCR software and 2) Automated 
updates to Wikipedia.   Can anyone take the lead on this project ? It will help 
boost the number of quality, articles in Indian languages. The Children's 
encyclopedia is being scanned and has a lot of great visual content.
I have uploaded a sample (10 MB) PDF file at 
https://sites.google.com/site/periasamythooran/kalaikalanjiam/kalaikalanjiamWikiMergeAttempt.pdf
 if you are interested to give it a spin.
Thanks,
Murali.
1. http://www.tamilvu.org/library/libindex.htm and click on Kalaikalanjiam / 
Tamil Encyclopedia.                                          
_______________________________________________
Wikimediaindia-l mailing list
Wikimediaindia-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l

Reply via email to