uh, that sounds very interesting. Right now, we mainly use OCR from djvu from Internet Archive (that means ABBYY Finereader, which is very nice).
But ideally we could think of a "customizable" OCR software that gets trained language per language: htat would be extremely useful for Wiikisources. (i can also imagine to divide, inside every language, per centuries, because languages too changes over time ;-) Aubrey On Sat, Jul 11, 2015 at 5:44 PM, Nicolas VIGNERON < vigneron.nico...@gmail.com> wrote: > Hi, > > I'm not a techie so I'm not sure to know what is OCR-as-service but you > should ask Tpt and Phe who have OCR stuff on the tool labs (to know what is > behind tools like http://tools.wmflabs.org/phetools/ocr.php ). > > Cdlt, ~nicolas > > _______________________________________________ > Wikisource-l mailing list > Wikisource-l@lists.wikimedia.org > https://lists.wikimedia.org/mailman/listinfo/wikisource-l > >
_______________________________________________ Wikisource-l mailing list Wikisource-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikisource-l