First of all, please forgive my bad english... So, I have some questions about ocropus, which can also be seen as suggestions to improve the ocropus.
As I understand it, one of the strengths of ocropus is the addition of a powerful "Document Layout Analysis" to a powerful OCR (tesseract) am I right? I believe that a positive point for the ocropus would be the development of a frontend (perhaps based on some frontend that already exists, like gscan2pdf) that would enable ordinary people to use the ocropus with ease. I also think that, if possible, adding the feature "text under the image" (such as ABBY FineReader, here is an picture of the finereader's feature in Portuguese: http://www.imagebam.com/image/c1c78f93276763 ) would be very welcome. This feature enables the scanning of old texts without concern for the correction of all errors, because those who are reading the text has access also to the original image (here is an exaple of "text under image" in Spanish: http://www.cepc.es/rap/Publicaciones/Revistas/2/REP_048_184.pdf ). Ocropus can read Portuguese? If not, the tesseract-ocr language files for Brasilian Portuguese text is compatible with ocropus? Thank you! -- You received this message because you are subscribed to the Google Groups "ocropus" group. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at http://groups.google.com/group/ocropus?hl=en.
