On Sun, Jul 12, 2015 at 11:25 AM, Asaf Bartov <[email protected]> wrote:

> On Sat, Jul 11, 2015 at 9:59 AM, Andrea Zanni <[email protected]>
> wrote:
>
>> uh, that sounds very interesting.
>> Right now, we mainly use OCR from djvu from Internet Archive (that means
>> ABBYY Finereader, which is very nice).
>>
>
> Yes, the output is generally good.  But as far as I can tell, the
> archive's Open Library API does not offer a way to retrieve the OCR output
> programmatically, and certainly not for an arbitrary page rather than the
> whole item.  What I'm working on requires the ability to OCR a single page
> on demand.
>
> True.
I've recently met Giovanni, a new (italian) guy who's now working with
Internet Archive and Open Library.
We discussed about a number of possible parnerships/projects, this is
definitely one to bring it up.

But if we manage to do it directly in the Wikimedia world it's even better.

Aubrey


>
> _______________________________________________
> Wikisource-l mailing list
> [email protected]
> https://lists.wikimedia.org/mailman/listinfo/wikisource-l
>
>
_______________________________________________
Wikisource-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikisource-l

Reply via email to