What is the present status of Tamil OCR? 20 volumes (8376 pages) typing should not solution for this!
Jayanta On Sun, Aug 31, 2014 at 1:07 AM, ViswaPrabha (വിശ്വപ്രഭ) < [email protected]> wrote: > Let me mention of yet another possible model that we are almost about to > try in ml: > > Apart from the already tried and tremendously successful models of > school-wise student model and general community competition (for prizes in > kind or just for credential certificates), we are now pondering upon a new > idea: > > Use secondary social network collaboration. > > As an example, if you have formed (or if you could grow from now,) a very > vibrant Wikimedia or other language-focused community network in Facebook, > you can open up an appropriately scaled competition model within that. > Many people are now seeking independent localization tools and input > methods just to use Facebook. > > (And BTW, thanks to Facebook, which I always detested, for making a great > invisible revolution among mainstream local community members, that most > other models are still struggling to achieve!) > > In Malayalam, now we have several such large communities each one > specializing in their own arenas (eg. Butterflies, Plants, Birds, Grammar, > Films and Film Songs etc. etc.). They all share the idea that most of such > shared knowledge and digital text and images shall ultimately end up in WM > pools. > > So, imagine a world where, > A Tamil community in Facebook takes up this project. (May or may not be > with an organized competition spirit). Each member picks up a few pages one > at a time, and post it back to Facebook. A small team collects this and > pipes to Wikisource. Eventually, the volume gets completely in. > > Just imagine... and it shall happen! > > PS. Why in FB and then why not directly in Wikisourse? > You know why! The interface makes a big difference. Let's admit that! > > -Viswam > > > > > > > > > On Sun, Aug 31, 2014 at 12:03 AM, Yann Forget <[email protected]> wrote: > >> FYI. Yann >> >> ---------- Forwarded message ---------- >> From: Ravishankar <[email protected]> >> Date: 2014-08-30 20:10 GMT+05:30 >> Subject: [Wikimediaindia-l] 20 volumes (8376 pages) of Tamil >> Encylopedia released under Creative Commons >> To: Wikimedia India Community list <[email protected]> >> >> >> Hi, >> >> Tamil Development Board (an autonomous institution under Government of >> Tamilnadu) releases its Encyclopedia (10 volumes, 7407 pages) and >> Children's Encyclopedia (10 volumes, 969 pages) under Creative Commons >> license. Tamil Wikipedians lead by Prof. C. R. Selvakumar and Prof. P. >> R. Nakkeeran, (Director, Tamil Virtual Academy) spearheaded this >> initiative coinciding with Tamil Wikipedia's 10 years celebrations. >> >> An official confirmation (in Tamil) can be seen at >> >> >> https://upload.wikimedia.org/wikipedia/commons/4/46/Letter_from_Tamil_Development_Board_donating_20_volumes_of_encyclopedia_in_Tamil_under_Creative_Commons_license.jpeg >> >> Scanned copies of these works are already available at >> >> http://tamilvu.org/library/kulandaikal/lku00/html/lku00ind.htm >> >> At Tamil Wikipedia, we are discussing how we can get this content >> typed and transferred to WikiSource. Doing so can be a good model to >> encourage more such works to be released in public domain. >> >> Following are two options I can think of: >> >> 1. Volunteers type all the content. Besides taking years to complete, >> this won't do justice for the value of time of volunteers who can do >> more valuable work than typing mechanically. >> >> A program like IT@School present in Kerala or a contest can encourage >> more people to join this effort but not all communities can't emulate >> this model successfully. >> >> 2. Request WMF to give a grant to the owner of the content and let >> them hand over the typed content to Wikisource volunteers who will >> upload and wikify the content. >> >> This will ensure maintaining the spirit of volunteerism and yet >> getting the work done in a professional and time bound manner. >> >> Numerous works in Wikisource are such ready made content uploaded >> already in the web through other projects like Project Gutenberg. >> >> If providing grants to non-Wikimedia organizations is an issue, a >> grant towards this can be given to community / chapter who will then >> outsource the typing work. >> >> I welcome community's input on any other model for this as India has >> vast amount of literature and works like this are waiting to be >> transfered to Wikisource. This is one area where we can add lot of >> content to Wiki projects at once. >> >> Ravi >> >> >> _______________________________________________ >> Wikimediaindia-l mailing list >> [email protected] >> To unsubscribe from the list / change mailing preferences visit >> https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l >> >> _______________________________________________ >> Wikisource-l mailing list >> [email protected] >> https://lists.wikimedia.org/mailman/listinfo/wikisource-l >> > > > _______________________________________________ > Wikisource-l mailing list > [email protected] > https://lists.wikimedia.org/mailman/listinfo/wikisource-l > >
_______________________________________________ Wikisource-l mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/wikisource-l
