uh, that sounds very interesting.
Right now, we mainly use OCR from djvu from Internet Archive (that means
ABBYY Finereader, which is very nice).

But ideally we could think of a "customizable" OCR software that gets
trained language per language: htat would be extremely useful for
Wiikisources.

(i can also imagine to divide, inside every language, per centuries,
because languages too changes over time ;-)

Aubrey

On Sat, Jul 11, 2015 at 5:44 PM, Nicolas VIGNERON <
vigneron.nico...@gmail.com> wrote:

> Hi,
>
> I'm not a techie so I'm not sure to know what is OCR-as-service but you
> should ask Tpt and Phe who have OCR stuff on the tool labs (to know what is
> behind tools like http://tools.wmflabs.org/phetools/ocr.php ).
>
> Cdlt, ~nicolas
>
> _______________________________________________
> Wikisource-l mailing list
> Wikisource-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikisource-l
>
>
_______________________________________________
Wikisource-l mailing list
Wikisource-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikisource-l

Reply via email to