Hello everyone,

i'm currently working on a small project which aims to recognize very
small chunks of texts (Typically, blocks of 20 chars, with known
location for each of them in the documents).

i have read a lot about Tesseract, and some points are a bit confusing
me.

First, i had a look at the .net wrapper for Tesseract (Which actually
doesn't really interest me, was for making myself a good idea about
"how to") , and following trails of clues over the net, i've run into
a lot of discussions saying that Tesseract is full of memleaks, is
kind of unstable etc. i'd like to have a clear overview of the
reliability of Tesseract, and if possible a confirmation or
infirmation that it's memleaking... i don't feel confortable starting
an external binary from my application, i'd rather prefer using the
Tesseract API directly. Most of all, my application aims to be cross-
platform, so the ideal deal would be to include Tesseract code in my
project, and deploying to another platform would just be a compilation
away.

Secondly, me following the same trails the .net author left behind,
i've read that Tesseract's code is really not "thought" or "modeled"
to be used from another code. It looks like it includes a lot of exit
messages, which leads to the conclusion it's modeled to be running
from it's standalone binary. Is that true? Would it be a lot of work
to change that if i decide to?

Last thing, i've been looking around for documentation, the best i've
found so far is maintained by this group of fellow hackers:
http://tesseract-ocr.repairfaq.org . Though it's looking very nice, i
was unable to find any good example for using a very simple Tesseract
recognition routine directly from C++ code. Again, i'm mostly looking
for C++ instanciating objects / calling functions from code included
in my project, not a library or an external binary.

If i'm not at the right place for asking such questions, please pardon
me, and feel free to point me the right direction... i have to admin
that i feel a bit confused about who is maintaining what, how "alive"
are projects, etc.

Thanks a lot for your time reading me,
Pierre.

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to 
[email protected].
For more options, visit this group at 
http://groups.google.com/group/tesseract-ocr?hl=en.

Reply via email to