Re: [Wikisource-l] [cultural-partners] ABBYY Finereader 11 on Toolserver: do we like it?

Alex Brollo Mon, 28 Nov 2011 13:23:51 -0800

I'm going to upgrade my licensed FineReader 10 to FineReader 11 (so that
it.source too will have a volunteer with a legal FineReader 11 software...
:-) ), I downloaded the trial software and I can confirm that it produces a
complete djvu file (images and text layer) within a single step.


Text layer hasn't full range of details, it's organized into two levels
(page and line), while OCR engine on  IA servers produces a very rich
"tree" (page, column, region, paragraph, line and word). Images can't be
finely tuned, but it is possible, given images of the same width/height of
better quality, to "transplant" text layer into a different djvu with a few
DjvuLibre commands.

Is anyone of you interested into a rather deep exploration of djvu text
layer by python? I'm working about it, but I feel that there's so much  to
do, and  so much to gain. I'm currently working into a Windows dropbox
folder, containing DjvuLibre routines too.

Alex_brollo

_______________________________________________
Wikisource-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikisource-l

Re: [Wikisource-l] [cultural-partners] ABBYY Finereader 11 on Toolserver: do we like it?

Reply via email to