Hi, > - reordering pages -- Simply by drag&drop in the page list (at left).
People generally want a thumbnail view for this. > > > - removing noise regions (page borders, stains, etc.) -- If Ocropus > misrecognised some noise as an image region, the user can select and delete > it. I think it would be good to let the user mouse out a rectangle or region and say "all of this is noise, remove it". > > > - fixing page thresholding -- Could be manually set in > directory/image/region options. I'd suggest avoiding "options" dialogs. > > - marking regions as images or text (if they have been misclassified) > -- OK, the dropdown menu should have this option. (Text, Photograph, > Drawing, Math...) > > - adding or removing column separators -- The columns could be > detected as two separate text regions, couldn't they? OCRopus segments the page by detecting column separators first. Sometimes it gets it wrong. In that case, the user should be able to delete the column separator or add a new one and tell OCRopus to re-analyze the page. > > > - cutting or joining text lines > > - fixing OCR errors -- Maintaining the Unix philosophy not to > substitute other programs on their own turf, I would pass this work to some > text editor. Depending on the output format, this can be OpenOffice, Kile, > Vim or whatever else the user choses. Fixing OCR errors requires the user to see the original image. Also, it should usually be done on text that is laid out in the same way as the recognized page. Generally, I think a good way of thinking about an OCR user interface is an application like iPhoto or f-spot, with a thumbnail view and a simple edit page view and the ability to go quickly back and forth between them. Tom --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "ocropus" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [EMAIL PROTECTED] For more options, visit this group at http://groups.google.com/group/ocropus?hl=en -~----------~----~----~----~------~----~------~--~---
