>>>>> "Thomas" == Thomas Breuel <[email protected]> writes:

Thomas> The images look like mixed raster content to me (background,
Thomas> foreground, and selection layer).

It looks like the djvu algorithm was used on the scans.  The site
also has a djvu version; the pdf may have been created from that.

If the djvu-ing was done ideally, the foreground image should be
all one needs for ocr-ing.

-JimC
-- 
James Cloos <[email protected]>         OpenPGP: 1024D/ED7DAEA6

--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups 
"ocropus" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to 
[email protected]
For more options, visit this group at 
http://groups.google.com/group/ocropus?hl=en
-~----------~----~----~----~------~----~------~--~---

Reply via email to