> I have a large body of images (primarily in JPEG2000) that I want to
> break up into their component parts.  That is, take a 3-column page
> (think dictionary or phone book listing), and output both individual
> columns, and individual lines.

That's pretty easy to do with OCRopus, both from the command line and
programmatically.  Look at the ISegmentPage interface and the "ocropus
pageseg" command.

The format is documented on ocropus.org at File Formats -- file
formats used by OCRopus

https://docs.google.com/View?id=dfxcv4vc_92c8xxp7

Tom

--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups 
"ocropus" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to 
[email protected]
For more options, visit this group at 
http://groups.google.com/group/ocropus?hl=en
-~----------~----~----~----~------~----~------~--~---

Reply via email to