A suggestion: don't focus on Google only. Generalize the tool for other
sources. There are plenty of libraries as Opal that share much better
images to grab. The quality of Google shared scans is often very poor.

(PS: I presume, I could grab the whole Alma Mater library content... but
presently I'm busy; Aubrey, "stai sereno" :-) )

Alex


2014-03-04 10:21 GMT+01:00 Andrea Zanni <[email protected]>:

> Rohit Dua just wrote to me regarding that project (I was one of the
> possible mentors).
> if one of you is technically skilled and wants to help, there is plenty of
> room for that :-)
>
> Aubrey
>
>
> On Mon, Mar 3, 2014 at 7:24 PM, Luiz Augusto <[email protected]> wrote:
>
>> FYI \o/
>> Em 03/03/2014 15:13, <[email protected]> escreveu:
>>
>>>  Rohit Dua <[email protected]> changed bug 
>>> 57813<https://bugzilla.wikimedia.org/show_bug.cgi?id=57813>
>>>  What Removed Added  CC   [email protected]
>>>
>>>  *Comment # 2 <https://bugzilla.wikimedia.org/show_bug.cgi?id=57813#c2>
>>> on bug 57813 <https://bugzilla.wikimedia.org/show_bug.cgi?id=57813> from
>>> Rohit Dua <[email protected]> *
>>>
>>> (In reply to vladjohn2013 from comment #0 
>>> <https://bugzilla.wikimedia.org/show_bug.cgi?id=57813#c0>)> Google Books > 
>>> Internet Archive > Commons upload cycle
>>> >
>>> > Wikisources all around the world use heavily GB digitizations for
>>> > transcription and proofreading. As GB provides just the PDF, the usual 
>>> > cycle
>>> > is:
>>> >
>>> >     go to Google Books and look for a book
>>> >     check if the book is already in IA
>>> >     if it's not, upload it there
>>> >     get the djvu from IA
>>> >     upload it on Commons
>>> >     use it on Wikisource
>>> >
>>> > For point 4, we have this awesome tool:
>>> > https://toolserver.org/~tpt/iaUploadBot/step1.php What we miss right now 
>>> > is
>>> > a tool for point 2.1, that would serve many other users outside the
>>> > Wikimedia movement too. Eventually, we could think of a bot/script which
>>> > would do all the work altogether, notifying the user when their help is
>>> > needed (eg metadata polishing, Commons categories, etc.) Mentors: Aubrey 
>>> > is
>>> > available for "design" mentorship, paired with a technical expert. We can
>>> > maybe ask help from a IA expert.
>>> >
>>> > URL:https://www.mediawiki.org/wiki/Mentorship_programs/
>>> > Possible_projects#Google_Books_.3E_Internet_Archive_.3E_Commons_upload_cycle
>>>
>>>
>>> Hi
>>>
>>> This is to inform that I am working on Bug 57813 
>>> <https://bugzilla.wikimedia.org/show_bug.cgi?id=57813> - Google Books > 
>>> Internet
>>> Archive > Commons upload cycle, via GSOC-2014 project.
>>> I'm ready with with the outline of google-books download script.
>>>
>>> --
>>> Rohit Dua
>>> 8ohit.dua
>>> New Delhi,India
>>>
>>>  ------------------------------
>>> You are receiving this mail because:
>>>
>>>    - You voted for the bug.
>>>
>>>
>> _______________________________________________
>> Wikisource-l mailing list
>> [email protected]
>> https://lists.wikimedia.org/mailman/listinfo/wikisource-l
>>
>>
>
> _______________________________________________
> Wikisource-l mailing list
> [email protected]
> https://lists.wikimedia.org/mailman/listinfo/wikisource-l
>
>
_______________________________________________
Wikisource-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikisource-l

Reply via email to