>>>>> "Thomas" == Thomas Breuel <[email protected]> writes:
Thomas> The images look like mixed raster content to me (background, Thomas> foreground, and selection layer). It looks like the djvu algorithm was used on the scans. The site also has a djvu version; the pdf may have been created from that. If the djvu-ing was done ideally, the foreground image should be all one needs for ocr-ing. -JimC -- James Cloos <[email protected]> OpenPGP: 1024D/ED7DAEA6 --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "ocropus" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/ocropus?hl=en -~----------~----~----~----~------~----~------~--~---
