On 11/28/2011 10:23 PM, Alex Brollo wrote: > [...] FineReader 11 [...] produces a complete djvu file [...] Text > layer hasn't full range of details, it's organized into two levels > (page and line), while OCR engine on IA servers produces a very rich > "tree" (page, column, region, paragraph, line and word).
Has anybody designed a web interface that shows the scanned image and the zones or regions of the Djvu text layer? It would look similar to image annotation on Commons, http://commons.wikimedia.org/wiki/Commons:Image_annotations For a Djvu file uploaded to Commons, could you automatically generate image annotations for the various text columns and illustrations? Does image annotation handle multi-page document formats such as PDF and Djvu? (Shouldn't image annotations and timed text be the same thing?) -- Lars Aronsson ([email protected]) Aronsson Datateknik - http://aronsson.se _______________________________________________ Commons-l mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/commons-l
