First of all, please forgive my bad english...

So, I have some questions about ocropus, which can also be seen as
suggestions to improve the ocropus.

As I understand it, one of the strengths of ocropus is the addition of
a powerful "Document Layout Analysis" to a powerful OCR (tesseract) am
I right?

I believe that a positive point for the ocropus would be the
development of a frontend (perhaps based on some frontend that already
exists, like gscan2pdf) that would enable ordinary people to use the
ocropus with ease.

I also think that, if possible, adding the feature "text under the
image" (such as ABBY FineReader, here is an picture of the
finereader's feature in Portuguese: http://www.imagebam.com/image/c1c78f93276763
) would be very welcome. This feature enables the scanning of old
texts without concern for the correction of all errors, because those
who are reading the text has access also to the original image (here
is an exaple of "text under image" in Spanish:
http://www.cepc.es/rap/Publicaciones/Revistas/2/REP_048_184.pdf ).

Ocropus can read Portuguese? If not, the tesseract-ocr language files
for Brasilian Portuguese text is compatible with ocropus?

Thank you!

-- 
You received this message because you are subscribed to the Google Groups 
"ocropus" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to 
[email protected].
For more options, visit this group at 
http://groups.google.com/group/ocropus?hl=en.

Reply via email to