> I have a large body of images (primarily in JPEG2000) that I want to > break up into their component parts. That is, take a 3-column page > (think dictionary or phone book listing), and output both individual > columns, and individual lines.
That's pretty easy to do with OCRopus, both from the command line and programmatically. Look at the ISegmentPage interface and the "ocropus pageseg" command. The format is documented on ocropus.org at File Formats -- file formats used by OCRopus https://docs.google.com/View?id=dfxcv4vc_92c8xxp7 Tom --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "ocropus" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/ocropus?hl=en -~----------~----~----~----~------~----~------~--~---
